(ML 18.1) Markov chain Monte Carlo (MCMC) introduction

แชร์
ฝัง
  • เผยแพร่เมื่อ 12 พ.ย. 2024

ความคิดเห็น • 71

  • @hannahmctigue9616
    @hannahmctigue9616 5 ปีที่แล้ว +87

    fyi, the two authors referred to as "the wives of Rosenbluth and Teller" have names, as well; they are Arianna W. Rosenbluth and Augusta H. Teller

    • @zioasterbald5962
      @zioasterbald5962 5 ปีที่แล้ว +7

      Exactly the comment I wanted to see. Thanks. :)

    • @gregor3264
      @gregor3264 4 ปีที่แล้ว +5

      thanks!

    • @nemmaadeni
      @nemmaadeni 4 ปีที่แล้ว +4

      YES!

  • @ProfessionalTycoons
    @ProfessionalTycoons 6 ปีที่แล้ว +69

    In general, MC is used by rappers and the evolution of MC have made into pop culture to MC Hammer. Hence when MC Hammer says 'Can't touch this' this means that there is no way to approximate the parameter of the distribution that we want to aim.
    Hence MCMC can't touch this.

    • @mkowal2
      @mkowal2 4 ปีที่แล้ว +1

      LOL

    • @captain_buggle694
      @captain_buggle694 4 ปีที่แล้ว +2

      Now I know everything about everything.

  • @pedrocolangelo5844
    @pedrocolangelo5844 11 หลายเดือนก่อน

    Your enthusiastic way of teaching is so inspiring. Thank you for sharing this great video!

  • @andrashorvath2411
    @andrashorvath2411 8 หลายเดือนก่อน

    Fantastic presentation, easy to follow and giving great intuition, thank you.

  • @danlan4132
    @danlan4132 8 ปีที่แล้ว +158

    I still do not understand

    • @ItsNotAllRainbows_and_Unicorns
      @ItsNotAllRainbows_and_Unicorns 8 ปีที่แล้ว +4

      "A Markov chain is a sequential model that transitions from one state to another in a probabilistic fashion, where the next state that the chain takes is conditioned on the previous state." -theclevermachine.wordpress,com
      Let's apply that to his drawing of the various interconnected green lines. The OP started at a point on the line and progressed along the line. The progression is based on the previous point. The red points as depicted in the drawing are probabilities. Examples in real life are tracking of an aircraft, missile or spacecraft.There is variation and noise in radar. (Actually there is variation and noise in every measuring instrument). Another example can be a missile tracking an aircraft as in a heat seeking missile.

    • @Johnnyboycurtis
      @Johnnyboycurtis 7 ปีที่แล้ว

      You should watch this for an actual example: th-cam.com/video/tre4zz7pnlE/w-d-xo.html

    • @buramjakajil5232
      @buramjakajil5232 5 ปีที่แล้ว +2

      Agreed, the video introduces the MCMC concept in a rather high-level way so you get the intuition, somewhat, but you can't program the MCMC method by yourself with the help of this video. More details and math would be needed.

  • @rebeccacarroll8385
    @rebeccacarroll8385 ปีที่แล้ว +1

    A little dissapointed that Arianna Rosenbluth wasn't included in the list of creators. She was also the first to use the method. Great explanation of MCMC!

  • @ching-chenghsu1423
    @ching-chenghsu1423 6 ปีที่แล้ว +12

    I like your explanation. My physics background helps me a lot :). Many people learn algorithms just by learning their mathematics without knowing why we need that. Many tricks in MCMC, such as Gib sampling, Metropolis-Hastings algorithm come from solving statistical mechanics problems. And that's physics, not computer science :)

    • @raghavendrakaushik4871
      @raghavendrakaushik4871 3 ปีที่แล้ว

      Curious to know how? Can you elaborate a bit on this - algorithm come from solving statistical mechanics problems.
      Or maybe refer a source to know more?

    • @carti8778
      @carti8778 2 ปีที่แล้ว +2

      @@raghavendrakaushik4871 statistical mechanics is study of a large no of particles by applying probabilistic methods. It's all about applying MCMC simulations to spin hamiltonians. Spin is an intrinsic property of an electron(u can call it a direction of an electron) and these hamiltonian equations are not solvable analytically bcz they are non-linear. That's why we employ the MCMC technique (metropolis, heat bath) to sample configurations from a large ensemble of states and calculate our observables on only these sampled states. If we don't use MCMC simulations, we would have to average over all the possible states which is very impractical since for real systems bcz there might be infinitely large no. of states. My phd was also about applying MCMC to quantum many-body physics.

  • @pigdogcheap1773
    @pigdogcheap1773 4 ปีที่แล้ว +5

    Thanks for the lecture, great explanation. Nonetheless, "do the wives of Rosenbluth and Teller have names?"

  • @ivyzhu3981
    @ivyzhu3981 6 ปีที่แล้ว +3

    A masterpiece. Thank you so much for sharing the knowledge.

  • @imkb010
    @imkb010 13 ปีที่แล้ว +3

    So this is where I go after graduating from Khan Academy! Thanks for posting these videos, very helpful!

  • @delrocco
    @delrocco 6 ปีที่แล้ว +1

    This reminds me of simulated annealing in a sense - the fact that the vector of variables was a state of the problem and the algorithm does a sort of hill-climbing to find better states, as well as the fact that annealing has to do with particles...

  • @MrEduardoPessanha
    @MrEduardoPessanha ปีที่แล้ว

    Excellent tutorial.

  • @BerkayCelik
    @BerkayCelik 9 ปีที่แล้ว +2

    Great series of videos. Thank you so much.

  • @afaisaladhamshaazi7519
    @afaisaladhamshaazi7519 5 ปีที่แล้ว +1

    The photos used around 7:45 make the physicists look like gangsters

  • @mariamalmstrom6878
    @mariamalmstrom6878 ปีที่แล้ว +1

    What has the discs to do with the phase diagram? In the moving of the discs, how are Markov chains used?

  • @alexpui82
    @alexpui82 12 ปีที่แล้ว

    This is way more accessible than the Hastie and Tibshirani book :) Excellent Stuff

  • @GodspeedPatches
    @GodspeedPatches 13 ปีที่แล้ว

    Great video, the initial physics application for MCMC was in modelling neutron diffusion

  • @AntonioSzeto
    @AntonioSzeto 11 ปีที่แล้ว +1

    A very good tutorial ! Thank you !

  • @Yzjoshuwave
    @Yzjoshuwave 3 ปีที่แล้ว +4

    This was mostly pretty clear. The pace and tone of it are quite easy to listen to. I was a little unclear about how the circles corresponded to the solid/liquid/gas diagram at the end, or what sort of data was (or would need to be) collected to develop the correspondence. Was the idea that those circles would end up finding the high probability distribution (like the “lightening” from the previous screen) for one of those phase diagrams?

  • @jeremyroy8552
    @jeremyroy8552 4 ปีที่แล้ว +4

    Question about your intuitive description of the algorithm: what happens when you have disconnected regions of high probability? Will MCMC fail to find them?

  • @rafaellima8146
    @rafaellima8146 7 ปีที่แล้ว +2

    I understood the intuition. However, I still did not understand how it works. I believe if you used a numerical example, I would understand much better the idea.

  • @ASHISHDHIMAN1610
    @ASHISHDHIMAN1610 2 ปีที่แล้ว

    This is the next 3blue1brown

  • @petersmuts6267
    @petersmuts6267 8 ปีที่แล้ว +1

    This is great, thanks.

  • @mebstyne
    @mebstyne 12 ปีที่แล้ว +1

    love it! thank you!

  • @AswanthCR7
    @AswanthCR7 4 ปีที่แล้ว

    Thank you for this!

  • @maxsklar
    @maxsklar 12 ปีที่แล้ว +1

    Thanks for the videos!

  • @vuhoangdung
    @vuhoangdung 12 ปีที่แล้ว

    thank you for the excellent lecture. I wonder if you can do about the "Particle filter" (Sequential Monte Carlo Method) as well?
    Thank you.

  • @BillHaug
    @BillHaug ปีที่แล้ว

    thank you

  • @nikilpujari1682
    @nikilpujari1682 12 ปีที่แล้ว

    thanks you very very much great video.........could probably under better than reading

  • @MrPrashath
    @MrPrashath 12 ปีที่แล้ว

    Excellent

  • @ashalyngdoh7622
    @ashalyngdoh7622 4 ปีที่แล้ว +1

    what does high-dimensional probability mean when he said here in the introduction? plss someone help me!!

    • @fenildoshi386
      @fenildoshi386 4 ปีที่แล้ว

      The random variable is not just one dimensional but spans across multiple dimensions.
      Like - Multinomial distribution, Dirchlet distribution, Multivariate Gaussian etc.

  • @erogol
    @erogol 11 ปีที่แล้ว +1

    Way too good explanation...

  • @khanetor
    @khanetor 8 ปีที่แล้ว

    When you draw a box around a molecule, what is it that you are examine? Presence of a molecule?

  • @Ivan-td7kb
    @Ivan-td7kb 5 ปีที่แล้ว

    So is MCMC just random sampling? Similar to numpy.random but for more complicated distributions?

  • @navidmohammadzadeh5890
    @navidmohammadzadeh5890 5 ปีที่แล้ว +1

    It was not clear for me at all. :(

  • @KMulroe
    @KMulroe 2 ปีที่แล้ว

    ❤️‍🔥

  • @gokultalele4264
    @gokultalele4264 9 ปีที่แล้ว

    I found SYSTAT 13 is good software for MCMC

  • @wutch27
    @wutch27 12 ปีที่แล้ว

    what do you do if after one of the iterations the new configuration is not allowed (like two discs intersect). Do you just go back and try again until you step to an allowed configuration? or perhaps reset to some sort of base state?

  • @GreyBillie
    @GreyBillie 12 ปีที่แล้ว

    Thank you :)

  • @sirisaksirisak6981
    @sirisaksirisak6981 4 ปีที่แล้ว

    If one who not understanding clearly each of two theory will cause confusing when mixing it.

  • @minanina8214
    @minanina8214 9 ปีที่แล้ว +3

    when we use MC ?and MCMC and what is the problem in MC so we use MCMC?

    • @Johnnyboycurtis
      @Johnnyboycurtis 7 ปีที่แล้ว

      Can you clarify your question? Are you asking for the difference between MC and MCMC?

    • @enesalbay9707
      @enesalbay9707 6 ปีที่แล้ว

      As I understand, MC jsut sampling data but MCMC use Markov property for sampling. So, you do not just randomly sample data in MCMC but you follow a sampling pattern regarding the previous sampling. That is called Markov property.

  • @soohyunnieee.k
    @soohyunnieee.k 4 ปีที่แล้ว

    voice sounds very similar to sal khan

  • @happysirishful
    @happysirishful 11 ปีที่แล้ว

    Can you please add a lecture on Reversible jump MCMC thank you

  • @DrINTJ
    @DrINTJ 8 ปีที่แล้ว +21

    Bad education is when you over-simplify the stupid and easy parts and hurriedly skip over the hard parts.
    Value gained = 0

    • @YashwantMarathe
      @YashwantMarathe 7 ปีที่แล้ว +7

      It's a video series. Watch the other videos for the hard parts

    • @Johnnyboycurtis
      @Johnnyboycurtis 7 ปีที่แล้ว

      I've a MS in Statistics, and I can say that these videos are not helpful. He doesn't explain the algorithms well. I made my own video where I actually have an example: th-cam.com/video/tre4zz7pnlE/w-d-xo.html

    • @ching-chenghsu1423
      @ching-chenghsu1423 6 ปีที่แล้ว

      Hi man, this is one of the best MCMC introduction I have seen. Just for your information, these big guys who developed MCMC are all physicists. not statisticians. They developed this for attacking real problems not playing math tricks.

  • @IshratBadami
    @IshratBadami 9 ปีที่แล้ว

    What is the software that you are using?

  • @jerryys
    @jerryys 9 ปีที่แล้ว

    Thank you so so much !
    So my understanding is MCMC is Monte Carlo simulation for a vector (array) of variables. The number of variables is infinite. Is my understanding correct?

    • @Johnnyboycurtis
      @Johnnyboycurtis 7 ปีที่แล้ว

      I don't think your variables can be infinitely. You have p-dimensional problem with p-unknown parameters you are trying to estimate via sampling from the marginal distribution. You sample from the marginal distribution by MCMC. See my video: th-cam.com/video/tre4zz7pnlE/w-d-xo.html

  • @fitzgerald5032
    @fitzgerald5032 3 ปีที่แล้ว

    This was awful

  • @step7steveX
    @step7steveX 5 ปีที่แล้ว

    Not a great explanation

  • @andres8bg247
    @andres8bg247 5 ปีที่แล้ว

    terrible video, could hardly watch it despite being very interested in the topic.