An Introduction to the Hypergeometric Distribution

แชร์
ฝัง
  • เผยแพร่เมื่อ 22 ก.ย. 2024
  • An introduction to the hypergeometric distribution. I briefly discuss the difference between sampling with replacement and sampling without replacement. I describe the conditions required for the hypergeometric distribution to hold, discuss the formula, and work through 2 simple examples.
    I also discuss the relationship between the binomial distribution and the hypergeometric distribution, and a rough guideline for when the binomial distribution can be used as a reasonable approximation to the hypergeometric. I finish with a brief example involving the multivariate hypergeometric distribution.
    For those using R, here is the R code to find the probabilities for the examples in this video:
    The probability of picking exactly 4 red balls when picking 5 balls from a source containing 6 red and 14 yellow.
    Without replacement (hypergeometric):
    choose(6,4)*choose(14,1)/choose(20,5)
    [1] 0.01354489
    or
    dhyper(4,6,14,5)
    [1] 0.01354489
    With replacement (binomial):
    dbinom(4,5,6/20)
    [1] 0.02835
    The probability of picking exactly 7 females when randomly sampling from a school with 1100 female and 900 male students.
    Without replacement (hypergeometric):
    choose(1100,7)*choose(900,3)/choose(2000,10)
    [1] 0.1664901
    or
    dhyper(7,1100,900,10)
    [1] 0.1664901
    With replacement (binomial):
    dbinom(7,10,1100/2000)
    [1] 0.1664783
    Multivariate hypergeometric, probability of picking exactly 3 Democrats, 2 Republicans, and 1 independent in the sample.
    choose(12,3)*choose(24,2)*choose(8,1)/choose(44,6)
    [1] 0.06881377
    or, with the extraDistr package installed:
    dmvrhyper(c(3,2,1),c(12,24,8),6)

ความคิดเห็น • 243

  • @aymenechchalim4654
    @aymenechchalim4654 ปีที่แล้ว +51

    Huge admiration for your work, it's nice to realize that something you did 9 years ago is still remarkably useful to many of us, thank you

    • @jbstatistics
      @jbstatistics  ปีที่แล้ว +18

      Thanks for the kind words! I tried to build them to stand the test of time :)

  • @lazy_hiker_guy
    @lazy_hiker_guy 10 ปีที่แล้ว +126

    You are better at explaining the basic concepts of statistics than my college professors. You have become my shifu!

    • @jbstatistics
      @jbstatistics  10 ปีที่แล้ว +36

      I'm always happy to be somebody's shifu!

    • @lazy_hiker_guy
      @lazy_hiker_guy 10 ปีที่แล้ว +4

      Could you please upload video on Gamma Distribution?

    • @pesa2164
      @pesa2164 3 ปีที่แล้ว +7

      I had to Google shifuh…

    • @phenomenal821
      @phenomenal821 ปีที่แล้ว +1

      @@pesa2164 🤣

    • @bruhidk2817
      @bruhidk2817 ปีที่แล้ว

      Ikr watching this bc my high school teacher like explains it once than just uses homework

  • @yaribsuarez8725
    @yaribsuarez8725 8 ปีที่แล้ว +118

    You're a gifted profesor, I really enjoy and understand!! 👍

    • @jbstatistics
      @jbstatistics  8 ปีที่แล้ว +13

      +Yarib Suárez Thanks for the compliment! I'm glad to be of help.

  • @kniix
    @kniix 2 ปีที่แล้ว +7

    I absolutely love that you compare each dist type to Binomial, so that we can understand WHY they are different. Well done!

  • @danielrojas4843
    @danielrojas4843 8 ปีที่แล้ว +27

    your voice is so peaceful I dont know if Im studying or hearing music.
    Great videos man

  • @sknight7511
    @sknight7511 3 ปีที่แล้ว +7

    who knew a video from years ago would help me so much with my uni, hope you're still teaching because omg your explanation is so good.

  • @narasimhann2814
    @narasimhann2814 ปีที่แล้ว +3

    Dude!!
    You're series is literally Gold.
    Precise explanation, with examples and tiny bit of humor . Hats of to you!!!
    I've learnt more now than during my college. Thank you :)

  • @Cleisthenes2
    @Cleisthenes2 10 หลายเดือนก่อน +11

    Huh. So why is it 'hyper-geometric'?

  • @orlandowan5847
    @orlandowan5847 7 ปีที่แล้ว +2

    One of the better instructional videos on this topic here on TH-cam. You also explain similarity and trade-off with the binomial distribution as well as cater for scenarios where there are more than two possible outcomes. Thanks.

  • @nishadr.7637
    @nishadr.7637 2 ปีที่แล้ว +3

    this is basically 5 hours of lectures summed up in 15 min

  • @benjaminli3808
    @benjaminli3808 3 ปีที่แล้ว +3

    I literally had a list of distributions i needed to review, and you just lined up all the videos, XD, amazing.

  • @sunghyunpark2055
    @sunghyunpark2055 8 ปีที่แล้ว +7

    Wow, This is really good. I am currently preparing for the preliminary exam for actuarial science, and your lecture is so clear and super-helpful.

    • @jbstatistics
      @jbstatistics  8 ปีที่แล้ว +1

      +Sung Hyun Park Thanks!

  • @bokangfalatsi3736
    @bokangfalatsi3736 ปีที่แล้ว +3

    Okay Professor, you are guilty of being the best Mathematical Statistics Prof.

  • @abhinavbichal8798
    @abhinavbichal8798 2 ปีที่แล้ว

    You are the best statistics teacher I have ever found!

  • @instant_mint
    @instant_mint 6 ปีที่แล้ว +2

    These videos are helping me so much! I feel like I'm about to give up this course but then I come home and watch another video from this series, and suddenly things make sense. Thanks for making great videos

    • @jbstatistics
      @jbstatistics  6 ปีที่แล้ว +1

      Thanks for the kind words! I'm glad I could be of help.

  • @TheOfficialYossi
    @TheOfficialYossi 10 ปีที่แล้ว +12

    Man I LOVE YOU!! I have an exam in the morning and this is so useful!! I have an exam in all the distributions, wish me luck! haha

  • @jbstatistics
    @jbstatistics  11 ปีที่แล้ว +1

    Thanks Mohamed. I appreciate the compliment! I'm glad you like my videos, and I'll definitely be adding more soon.

  • @jbstatistics
    @jbstatistics  11 ปีที่แล้ว +3

    Thanks! I hope you find many more of videos helpful. Cheers.

  • @jbstatistics
    @jbstatistics  11 ปีที่แล้ว +1

    Thanks! I'm glad you find them useful.

  • @goksenumutguler2179
    @goksenumutguler2179 7 ปีที่แล้ว

    Very clear and simplified version of everything teachers generally tend to tell complex as hell, you sir is a legend.

    • @jbstatistics
      @jbstatistics  7 ปีที่แล้ว

      Thanks! I'm glad to be of help

  • @rishabhnarula1999
    @rishabhnarula1999 10 หลายเดือนก่อน +1

    thank you sir, i have become a great admirer of your works now, and they are really useful. 😊👍

    • @jbstatistics
      @jbstatistics  10 หลายเดือนก่อน +1

      You are very welcome. I'm glad to be of help!

  • @Mariam-so3nb
    @Mariam-so3nb 9 ปีที่แล้ว +1

    I wish I found your videos a little bit earlier! Watching your videos give some sense of hope for my exam tomorrow. Thank you! :-)

  • @faracanthaceae4946
    @faracanthaceae4946 7 ปีที่แล้ว +26

    the boss example hilarious :D :D :D

  • @gabriel-braga-uc
    @gabriel-braga-uc ปีที่แล้ว

    Your videos are serving as amazing insights and have been helping me a lot throughout my graduation. Thank you so much for this peak content!

  • @markusbjerkas6675
    @markusbjerkas6675 ปีที่แล้ว

    That boss example is litteraly made my day

    • @jbstatistics
      @jbstatistics  ปีที่แล้ว

      It was a long time ago, but still one of my faves :)

  • @ozchelseagirl
    @ozchelseagirl ปีที่แล้ว

    I am watching all your videos and I second, these are all relevant, clearly explained, brilliant videos. Thank you so much for your brilliant work!

  • @philandthai
    @philandthai 5 ปีที่แล้ว

    I am sure that it is a significant effort to create these videos, which are so smooth, so clear and so polished. I always think that statistics in an intrinsically difficult area of mathematics.While I am an advocate of book learning, your videos are a great supplement and I really appreciate your contribution.

    • @jbstatistics
      @jbstatistics  5 ปีที่แล้ว +3

      Thanks so much for the very kind words. I'm very glad I could be of help. These videos do take some time (sometimes *a lot* of time), but I hope they provide value that far outweighs my effort. Thanks again.

  • @TravellingDon
    @TravellingDon ปีที่แล้ว

    Amazing video makes the concept so much easier to understand.

  • @krishnabharadwaj4715
    @krishnabharadwaj4715 8 ปีที่แล้ว

    You are the king of all teachers.

  • @LoayAkmal
    @LoayAkmal 7 ปีที่แล้ว

    Alright, you've just saved my grades and more importantly, taught me statistics. Thank you!

    • @jbstatistics
      @jbstatistics  7 ปีที่แล้ว +1

      I'm glad to be of help!

  • @fahada1921
    @fahada1921 7 ปีที่แล้ว

    Wow thank you.
    i love people who share their knowledge and make it easier for others.

    • @jbstatistics
      @jbstatistics  7 ปีที่แล้ว

      You are very welcome. I'm glad I could help!

  • @rehabmohsin2071
    @rehabmohsin2071 7 ปีที่แล้ว +1

    You are an absolute legend!!! Thank you!!

    • @jbstatistics
      @jbstatistics  7 ปีที่แล้ว

      You are very welcome. A legend in my own mind, at least!

  • @asanyal296
    @asanyal296 5 ปีที่แล้ว

    Crystal clear explanation. Very helpful. I detect a slight Canadian accent which I have come to associate with the best math and stats videos on TH-cam. Thanks prof wherever you are.

    • @philandthai
      @philandthai 5 ปีที่แล้ว

      Canadian accent? I thought we Canadians didn’t have accents. Isn’t it everyone else who does?

    • @asanyal296
      @asanyal296 5 ปีที่แล้ว

      Phil Bakes - that is correct. However, we who live South of the border - and also have no accent, of course - notice a few deviations around our means :)

  • @umeshtiwari9249
    @umeshtiwari9249 ปีที่แล้ว

    Hats off to you Sir, the way you explain really fits into the mind. Thanks

  • @asababenard2045
    @asababenard2045 ปีที่แล้ว

    U are just the best; May God just bless you

  • @mgkillex6636
    @mgkillex6636 2 ปีที่แล้ว

    THANK YOU for teaching so well

  • @UncleLejin88
    @UncleLejin88 9 ปีที่แล้ว

    Thank you so much. You explain it so much better than my professor.

  • @theodoresweger4948
    @theodoresweger4948 2 ปีที่แล้ว

    Thank you for adding the 3rd cmponent was wondering about that. Thanks very much

  • @jemshussdsbc4775
    @jemshussdsbc4775 8 หลายเดือนก่อน +1

    You are the best teacher thanks alot

    • @jbstatistics
      @jbstatistics  8 หลายเดือนก่อน

      You are very welcome, and thanks for the compliment!

  • @jbstatistics
    @jbstatistics  11 ปีที่แล้ว

    Hi inmwt. I'm glad my video helped!

  • @aakarshitsrivastava6888
    @aakarshitsrivastava6888 4 ปีที่แล้ว

    All doubts cleared by seeing your lecture thanks a lot

  • @VK-sp4gv
    @VK-sp4gv ปีที่แล้ว

    Very well explained, thanks.

  • @halajbail7007
    @halajbail7007 5 ปีที่แล้ว

    The BOSS of statistics!!!

  • @violalagonigro-m4d
    @violalagonigro-m4d 5 หลายเดือนก่อน

    10 yrs later and still going strong

  • @hanwenguo9371
    @hanwenguo9371 ปีที่แล้ว +2

    Hey 9 years later, I still like your joke

  • @sthefanyguzman8209
    @sthefanyguzman8209 2 ปีที่แล้ว

    bro...I think you are good. Glad to find you, thank you for all

  • @jeffaggas2205
    @jeffaggas2205 9 ปีที่แล้ว

    This is a great explanation, very clear and really good examples.

  • @ViniciusCamatti01
    @ViniciusCamatti01 10 ปีที่แล้ว +1

    Very good job! I learned a lot and very rapidly.

  • @ravitegar6582
    @ravitegar6582 6 ปีที่แล้ว

    Here i am, 2 hours before exam and just star digging into hypergeometric distribution. Thanks for the video , wish me luck.

  • @bhanusinghal1918
    @bhanusinghal1918 5 ปีที่แล้ว

    Finally I am not bored of studying stats!!!!!

  • @naved591
    @naved591 3 ปีที่แล้ว

    Excellent explanation, Now the exam will be good, thanks to you

  • @nimas1638
    @nimas1638 5 ปีที่แล้ว

    Stas exam this saturday.... thanks for these videos :)

  • @theodoresweger4948
    @theodoresweger4948 2 ปีที่แล้ว

    As usual you do a excelent job of explaining this distribution probability is a very popular subject for mr.. Having a lot of trouble with BINGO would like to see examples of this!!!

  • @DexterPires
    @DexterPires 6 ปีที่แล้ว

    I honestly was really getting frustrated with this whole part of statistics, I would work really hard (Reading, trying examples and exercises) but I just couldn't get it. Your videos just made it so simple and easy, Thank you!!

    • @jbstatistics
      @jbstatistics  6 ปีที่แล้ว

      You are very welcome Luis! I'm glad to be of help!

  • @narisup272
    @narisup272 ปีที่แล้ว

    Thank you for the explanation. It helps a lot!

  • @isupportargentina
    @isupportargentina 9 ปีที่แล้ว

    I think this needs to be added to your "Discrete distributions" playlist?
    Thanks for these once again! :)

    • @jbstatistics
      @jbstatistics  9 ปีที่แล้ว +1

      Shaun Roberts Thanks Shaun, I didn't notice that it wasn't in that playlist. I've added it.

  • @stellaofthelake3451
    @stellaofthelake3451 5 ปีที่แล้ว +5

    Lol. You sampled 2 ppl 6 times lol. Fired.

  • @tvvt005
    @tvvt005 6 หลายเดือนก่อน

    7:09 but n-(N-a) is just n-(no of failures in the sample+ number of successes outside the sample) right? So why do we consider number of successes outside sample

  • @manellallem3523
    @manellallem3523 2 ปีที่แล้ว

    Your videos are saving me thank you so much sir you are amazing!

  • @amitbhatt575
    @amitbhatt575 8 ปีที่แล้ว

    sr it really helped me a lot, U explained just like sal khan

  • @ololademafimidiwo7288
    @ololademafimidiwo7288 7 ปีที่แล้ว

    Thank you so much! This made things so much clearer.

  • @suryat1103
    @suryat1103 6 ปีที่แล้ว

    Excellent Explanation.
    I am a new bie to stats with good Math backgrouund.
    I was able to grasp the content.
    Many Many Thanks

    • @jbstatistics
      @jbstatistics  6 ปีที่แล้ว

      You are very welcome, and thanks for the compliment!

  • @sandisiwebuthelezi3192
    @sandisiwebuthelezi3192 9 ปีที่แล้ว

    Good explanation with clear examples...Thank you :)

  • @moypatel5554
    @moypatel5554 8 ปีที่แล้ว +2

    Thank you for your knowledge sir

  • @noor777300
    @noor777300 7 ปีที่แล้ว

    thank you so much. Very clear explanation and good comparisons to avoid misunderstanding. Keep it up sir

    • @jbstatistics
      @jbstatistics  7 ปีที่แล้ว +2

      You are very welcome! Thanks for the kind words.

  • @someshshahi6014
    @someshshahi6014 10 ปีที่แล้ว +1

    This is so helpful, should help me ace my stats exam. Do you have a video on conditional probability ?

  • @royalarindam
    @royalarindam 2 ปีที่แล้ว

    Great video man!

  • @MileeWilson
    @MileeWilson 10 ปีที่แล้ว +8

    Mathematicians, have a very good sense of humour. He said if you told your boss you sampled the same person more than once, you will be looking for another job in the near future,,, LOL.

  • @aaaaaawda
    @aaaaaawda 7 ปีที่แล้ว +2

    Why are you such a lifesaver?!?!? Whyy tell me!!! Tell it!!!

    • @jbstatistics
      @jbstatistics  7 ปีที่แล้ว +4

      I'm just doing my best to help out.

  • @arnatri1503
    @arnatri1503 5 ปีที่แล้ว

    Great explanation. Thank you!

  • @snoop1029
    @snoop1029 4 ปีที่แล้ว

    At 1:25,
    you said that selecting any sample of 5 balls is equally likely, but it's seems wrong to me, because selecting 5 yellows is much more likely than selecting 5 reds...
    what am I missing?

  • @seanki98
    @seanki98 6 ปีที่แล้ว

    Professor, the reason why the binomial coefficients are multiplied is because of the "And" rule in probability right? We are making our choices simultaneously

  • @sushmitanigam4979
    @sushmitanigam4979 7 ปีที่แล้ว +2

    thanx a lot. incredible explanation

  • @davinpaharia6265
    @davinpaharia6265 8 ปีที่แล้ว

    Sir could you please explain in the last example why the case for failure was not taken??

  • @dgamma1
    @dgamma1 10 ปีที่แล้ว

    extremely well explained - and professional!!

  • @thepag52
    @thepag52 7 ปีที่แล้ว

    youre a blessing. thank you so much for what you do. but i wish i had the definition of a hypergeometric distrobution like the others so i can distinguish when to use. i have been writing each definition down.

  • @mskgt8466
    @mskgt8466 8 ปีที่แล้ว

    Thank you very much for such a great explanation. It was like 2+2=4 👍👍 you have clarified my queries :)

  • @abhisekbhuyan8854
    @abhisekbhuyan8854 6 ปีที่แล้ว

    I got a doubt!! Wt abt the last question was asked with replacement ? Is it going to be addition of 3 probabilities i:e; for P(3 democratic)+P(2 Republic)+ P(1 Independent)?

  • @temrenalican4570
    @temrenalican4570 8 ปีที่แล้ว

    Wherever I look up the variance formula I'm finding a bunch of Einstein formulas that don't make any sense to me. Can you write a brief summary of variance or give me a link to a source that is easy to understand?

  • @dr.mosesmuthinja7838
    @dr.mosesmuthinja7838 6 ปีที่แล้ว

    This is brilliant, I like it. some text books had made me a stressed man.

    • @jbstatistics
      @jbstatistics  6 ปีที่แล้ว

      Thanks for the compliment! I'm glad I could be of help!

  • @oprabin
    @oprabin 7 ปีที่แล้ว

    that "boss example" was savage!
    XD

    • @jbstatistics
      @jbstatistics  7 ปีที่แล้ว +2

      Thanks! I found it amusing :)

  • @joshuabryant536
    @joshuabryant536 2 ปีที่แล้ว

    When you said you used the binomial distribution with the replacement, did you mean the negative binomial distribution formula? Bc that was the formula you used, not the Binomial

    • @jbstatistics
      @jbstatistics  2 ปีที่แล้ว

      No, I meant the binomial and used the binomial pmf. The negative binomial looks a little like it, but that's the binomial pmf.

  • @studunders8354
    @studunders8354 10 ปีที่แล้ว

    This video is excellent! Keep up the good work!
    Oh where are my manners, thank you for doing this, much appreciated!

    • @jbstatistics
      @jbstatistics  10 ปีที่แล้ว

      You are very welcome Stu!

  • @xyz39r1abc
    @xyz39r1abc 9 ปีที่แล้ว

    Very good explanation :D Thank you

  • @koachonghow2691
    @koachonghow2691 5 ปีที่แล้ว

    better explanation than my professor

  • @ДмитроКостюшко
    @ДмитроКостюшко 8 ปีที่แล้ว

    Thanks, this have really helped me although i did not understand why in 7:10 "x" has to be less not only then a number of successes but also the size of the data set are`t they codependent, how does the num of successes be greater than size of data set it those successes located into.

  • @andreyrublyov8061
    @andreyrublyov8061 8 ปีที่แล้ว

    Thank you. I'm crying right now! :')

  • @lilmonkianime8084
    @lilmonkianime8084 4 ปีที่แล้ว

    Hello, sir , very good video, so much helpful, are u sure that u use combination formula with the right calculation at the last exercize ?

  • @polarbear986
    @polarbear986 4 ปีที่แล้ว

    this is life saver

  • @UncleLejin88
    @UncleLejin88 9 ปีที่แล้ว

    I love stat but my professor lecture are dry and not as insightful like yours. you are awesome!!!

  • @coolgreensnake
    @coolgreensnake 11 ปีที่แล้ว

    love ur videos ,keep up the good work

  • @jyotiranjanjally_
    @jyotiranjanjally_ 4 ปีที่แล้ว

    Very helpful! Thank you.

  • @vishweshmishra8134
    @vishweshmishra8134 5 ปีที่แล้ว

    Thankyou so much great video

  • @Ali124hdkflc
    @Ali124hdkflc 3 ปีที่แล้ว

    You're a gem.

  • @AloofMusician
    @AloofMusician 9 ปีที่แล้ว

    Hi, I'm just a bit confused withh the bit about Max (0, n-(N-a)) Min (a,n) What do you mean by the maximum value x can take on is going to be the minimum of (a,n), and what does it mean that the min value of x can take on is the max of 0, n + a - N? thanks for your help.

    • @jeffaggas2205
      @jeffaggas2205 9 ปีที่แล้ว +2

      The maximum value of successes cannot be greater than 1) a (the number of successes in the population) or 2) n (the number of samples drawn). From the school example, you can't draw 11 girls if you only draw 10 people.
      The minimum number of successes is MAX( 0, n-(N-a) ). You cannot have less than 0 successes so that is where that comes in. n-(N-a) is a little more intricate. Consider this case:
      10 balls (8 red, 2 green)
      You will draw 5, no replacement.
      Consider red to be success
      In this situation, the worst you can possible do is pick 2 green balls. Thus, the minimum number of successes you can have is n-(N-a) or 5 - (10-8) or 3. However, you must keep in mind this depends on what you define as a "success". Consider the same situation except let green equal "success". In that case, the worst you can do is 0 (if you picked all red balls) since n-(N-a) or 5 - (10-2) = -2. Thus the minimum number of successes would be MAX(0,-2).
      Hope that helped.

    • @AloofMusician
      @AloofMusician 9 ปีที่แล้ว

      Jeff Aggas Thank you for your detailed and useful reply :)

  • @adnanmohamed6517
    @adnanmohamed6517 3 ปีที่แล้ว

    I request from TH-cam to add the feature of golden likes!

  • @abrish338
    @abrish338 2 ปีที่แล้ว

    Excellent

  • @harisrg92
    @harisrg92 7 ปีที่แล้ว

    In the last example about different politicians, if the question explicitly mentioned that the choosing was done WITH REPLACEMENT, could we have used binomial formula in some sort of way?

    • @jbstatistics
      @jbstatistics  7 ปีที่แล้ว +1

      The last example involves 3 groups, so if the sampling were done with replacement, we would use the multinomial distribution to answer the question. (The multinomial distribution is a straightforward generalization of the binomial distribution.)

  • @hassanabdi9280
    @hassanabdi9280 9 ปีที่แล้ว

    I like this explanation .
    It can help me in my sudies.
    Best regards

  • @prakharbansal9079
    @prakharbansal9079 7 ปีที่แล้ว

    brilliant explanation....hats off...😊