Chi-square tests: Goodness of Fit for the Binomial Distribution

แชร์
ฝัง
  • เผยแพร่เมื่อ 2 ต.ค. 2024
  • I work through an example of testing the null hypothesis that the data comes from a binomial distribution. I do this for two tests, one in which the probability of success is specified in the null hypothesis, and one where it is estimated from the data.
    Data reference
    The Larry Bird free throw data based on information in:
    Wardrop, R.L. (1995). Simpson's paradox and the hot hand in basketball. The American Statistician, 49 (1), 24-28.

ความคิดเห็น • 70

  • @hassannauman6194
    @hassannauman6194 2 ปีที่แล้ว

    THANKYOUUUUU 9 YEARS LATER AND STILL HELPFUL

  • @jbstatistics
    @jbstatistics  11 ปีที่แล้ว +20

    I'm glad to be of help. I've got lots of good stuff on this channel :)

  • @hotandcutericky
    @hotandcutericky 10 ปีที่แล้ว +41

    This is SOOOOOO MUCH easier to understand than a two hour lecture

    • @jbstatistics
      @jbstatistics  10 ปีที่แล้ว +7

      Thanks Ricky. I'm glad to be of help!

  • @rewrose2838
    @rewrose2838 4 ปีที่แล้ว

    Thank you for this clear and concise explanation.
    My teacher gave an assignment with questions about goodness-to -fit procedure without covering the topic in the class 😒

  • @MrAndreaCaso
    @MrAndreaCaso 7 ปีที่แล้ว +4

    We need you! Thank you so much again. Come back and publish something, please. And if you have a channel on Patreon I'd be happy to support you!

    • @jbstatistics
      @jbstatistics  7 ปีที่แล้ว

      Hi Andrea! Thanks for the kind words. Every day I think about getting back to producing videos. (In fact, just this morning I was mapping some out in my head.) I'm still crushed for time, but I'll force myself to find the time to produce one and go from there. Thanks for the inspiration!

  • @jbstatistics
    @jbstatistics  11 ปีที่แล้ว +7

    I'd love to do videos on order statistics, and may get around to it at some point in the future, but I've got a lot of other topics lined up before that. Cheers.

  • @geunik
    @geunik 11 ปีที่แล้ว +9

    A day before my final math exam, I'm going through your video, thinking
    WHY COULDN'T I FIND THIS BEFORE D:

  • @assansanogo1343
    @assansanogo1343 8 ปีที่แล้ว +10

    it's so clear, it becomes art. Thx for such a great job!!!

  • @yasinzamani9467
    @yasinzamani9467 5 ปีที่แล้ว +2

    Why didn't you calculate two-tail p-value?!

  • @rahsaansargusingh6557
    @rahsaansargusingh6557 4 ปีที่แล้ว +2

    Thank you so much this was very very very, simple and straightforward. This video answered all my questions with no waffle added.

  • @SupFreshJR
    @SupFreshJR 7 ปีที่แล้ว

    These videos are ok. My teacher makes me watch and answer questions about it online which is like deciphering hieroglyphics when it actually comes time to use any of it in R.

  • @wronski11
    @wronski11 9 ปีที่แล้ว +1

    Very nice video. Actually, the entire set of videos on statistics is very good. Concerning the video. How would the above methodology generalize if we had multidimensional contingency table. Say, count the elements in 5 boxes, where each box can hold 5 different elements. In this case we would have 16 degrees of freedom. More precisely, how would we compute the theoretical frequencies for the cells, assuming uniform distribution?

  • @loislovescheese4776
    @loislovescheese4776 9 หลายเดือนก่อน

    what if you had 3 replicates for each variable (say i have variables a, b, c, d and 3 replicate values for each (eg variable a has values 0.1, 0.2, 0.3) can I do the chi squared test on the 4 variables? Do I have to find the mean of each variable then do the chi squared test? or is there a better test for that? Thanks!

  • @haileyzen1430
    @haileyzen1430 3 ปีที่แล้ว

    Thank you for this but do the 2nd example where the degree of freedom is deducted by 2 works the same for Poisson distribution as well?

    • @jbstatistics
      @jbstatistics  3 ปีที่แล้ว +1

      Yes. Using the data to estimate lambda would cause a loss of one degree of freedom, and the DF would end up being # cells - 2.

    • @haileyzen1430
      @haileyzen1430 3 ปีที่แล้ว

      @@jbstatistics Understood thank you!

  • @Ivon_vlog
    @Ivon_vlog 5 ปีที่แล้ว

    Why don’t u find the percentiles in chi square table and compare it with the test statistics?

  • @ThanhThuy-wb6lm
    @ThanhThuy-wb6lm 8 หลายเดือนก่อน

    thanks for your very understandable explanation :D

  • @ahmedel-hindawi9226
    @ahmedel-hindawi9226 2 ปีที่แล้ว

    that first example was kinda hard to observe for non-american people

  • @balajitj
    @balajitj 5 ปีที่แล้ว +2

    Thanks for your wonderful videos. Love the way you teach concepts.

  • @slayvenom5900
    @slayvenom5900 4 หลายเดือนก่อน

    tq for the df point
    exam in 2hrs

  • @MrCodmaster777
    @MrCodmaster777 10 ปีที่แล้ว +5

    thanks alot for the help

    • @jbstatistics
      @jbstatistics  10 ปีที่แล้ว +1

      You are very welcome.

  • @Leonardo-jv1ls
    @Leonardo-jv1ls 4 ปีที่แล้ว

    Thank you a lot for the help. Your videos are amazing. But just one question here. In the last part, in the test of the binomial assumption hipotesis, should not the hipotesis be "H0: Larry bird's number of success...a binomial distribution WITH p = CALULATED P". I mean. Why its not mentioned the calculated probability value when it's used to calculate wheter it is a good binomial aproximation? I really got this question. And other doubt is: How can i be sure to use a binomial distribution as aproximation, if the chi-squared test don't prove it's a good aproximation, but only shows evidence that it can't be refused? It is so complex.

  • @supercitty90
    @supercitty90 ปีที่แล้ว

    legend for getting me to understand degrees of freedom using a basic example. THIS IS HOW TEACHING SHOULD BE DONEEEE

  • @PandemicGameplay
    @PandemicGameplay 3 ปีที่แล้ว

    Clear, concise, easily visible to read from a dark screen, what's not to like?

  • @rattlesnake255
    @rattlesnake255 2 ปีที่แล้ว

    Thank you!!! Amazing video.
    When calculating expected values it looks like you have to double the value where x=1 because you can do that 2 ways.
    Make/Miss or Miss/Make.
    Am I thinking about that right??

  • @phungijuba4170
    @phungijuba4170 10 ปีที่แล้ว +6

    Thank you very much.

    • @jbstatistics
      @jbstatistics  10 ปีที่แล้ว

      You are very welcome!

    • @samchen8412
      @samchen8412 8 ปีที่แล้ว +2

      +jbstatistics your new album Purpose is awesome!

  • @simongomez2895
    @simongomez2895 6 ปีที่แล้ว +1

    Thank you Justin Bieber, you are the very best. I love you and i hope you keep doing your thang. If you can, come to Bogotá, Universidad de Los Andes. People here love you and watch your videos every day.

    • @jbstatistics
      @jbstatistics  6 ปีที่แล้ว

      I'm always happy to help my friends in Colombia! I'm glad to hear you find my videos helpful. Perhaps I'll visit someday!

    • @realdvgarg
      @realdvgarg 4 ปีที่แล้ว

      hol up ur name is justin beiber?

    • @serrataban930
      @serrataban930 3 ปีที่แล้ว +1

      @@realdvgarg lol no :d it is actually Dr. Jeremy Balka, a professor at the University of Guelph

  • @ashutoshsharma22
    @ashutoshsharma22 ปีที่แล้ว

    I don't know how I ended up here but boy am glad that i find you. Great content and to the point. Cheers.

  • @jayaprakashksalian
    @jayaprakashksalian 4 ปีที่แล้ว

    Hi ,
    How can we perform significance test for non binomial data like ARPU

  • @AnshLaxmiBoutique
    @AnshLaxmiBoutique 5 ปีที่แล้ว

    why P is 0.8

  • @manostube
    @manostube 6 ปีที่แล้ว

    Lets say Larry made a) 1 of 2: 120 times b) 2 of 2: 200 times and of course, c) 0 of 2 18 (338-320) times. The H0 that a the number of makes follows a Binomial distribution would not be rejected (p=0.13). However, when examining only one claim of H0 (p=0.8) by condidering the total number of shots made we would have 520 observed (120 + 2 x 200) (with 156 misses) vs 540.8 and 135.2 expected, the Chi-test now suggests that the prob of each free throw cannot be 0.8 (p-value 0.04). The strange thing for me is that conditioning on independence of each pair of free throws, the two tests should have close p-values however this is not the case. What do I miss? Any intuitive explanation? Thanks a lot!

    • @yaweli2968
      @yaweli2968 2 ปีที่แล้ว

      I don’t quite understand your question but I think the second test is reasonable and isn’t supposed to have a p value closer to the first because you have 5 pairs with 0 made in first with expectation of 13.52 but 18 pairs missed considerably closer to 13.52 as compared with 5 missed pairs of free throws. So is 120 closer to 108.16, even though you are rejecting the bill with a p value below 5%. I don’t know if this is the question you are asking, or it’s something else.

  • @Kkohli
    @Kkohli 6 ปีที่แล้ว +1

    Complicated stuff

  • @NotJames1
    @NotJames1 6 ปีที่แล้ว

    Great video. The old value of chi^2 reappeared at 12:30

  • @nabajyotidey2832
    @nabajyotidey2832 6 หลายเดือนก่อน

    i see we are doing hit and trial kind of a thing here .....

    • @jbstatistics
      @jbstatistics  6 หลายเดือนก่อน

      I don't understand.

  • @mieproductions6209
    @mieproductions6209 4 ปีที่แล้ว

    very well explained video and clear voice. Thank you

  • @OnyinyechiVera
    @OnyinyechiVera 10 หลายเดือนก่อน

    It was really helpful

  • @منالمنصور-ث5خ
    @منالمنصور-ث5خ 7 ปีที่แล้ว

    Thank you .please , i need a book containing this part

  • @siqili204
    @siqili204 4 ปีที่แล้ว

    I did not get it, why DF-1-1???

  • @kanacaredes
    @kanacaredes 4 ปีที่แล้ว

    Excellent video!!! Very clear...Well done!!!!

  • @husnainutube
    @husnainutube 10 ปีที่แล้ว

    Simply GREAT !!! Fantastic Job

  • @waqasmeers
    @waqasmeers 6 ปีที่แล้ว

    how 338 came i didn't understand

  • @samareenzubair4414
    @samareenzubair4414 2 ปีที่แล้ว

    beautiful

  • @hisokaamorou4211
    @hisokaamorou4211 4 ปีที่แล้ว

    it'd have been a lot easier if you had shown the process to calculate expected proportion as well!

    • @jbstatistics
      @jbstatistics  4 ปีที่แล้ว

      I don't know what you mean. I go through the expected proportion calculations at 1:52. I don't pull out a calculator and punch the values in, but I show the formulas and the resulting value.

    • @hisokaamorou4211
      @hisokaamorou4211 4 ปีที่แล้ว

      @@jbstatistics at 9:32. there is number of makes, expected proportion and expected number. You've find out expected numbers but i'm clueless about the calculation process there. It will be helpful if you illuminate me in this matter. For e.g in Number of make 1 there is : (2 1).864^1(1-.864)^1 = .2351. How this .2351 came could you please tell me(I mean how to do calculation ?

  • @benhorspool7750
    @benhorspool7750 4 ปีที่แล้ว

    Super helpful, thank you

  • @jamessmith8097
    @jamessmith8097 6 ปีที่แล้ว

    im an idiot. i dont understand any of this.. ergghh

    • @jbstatistics
      @jbstatistics  6 ปีที่แล้ว +1

      That doesn't make you an idiot! :)

  • @pvsmouli7727
    @pvsmouli7727 7 ปีที่แล้ว

    what is that software
    u r telling about through which i can find p value

    • @jbstatistics
      @jbstatistics  7 ปีที่แล้ว

      Many different software packages can calculate the area under a chi-square distribution. I usually use the (free) statistical software R, and in that software the command pchisq(x,df) yields the area to the left of x under a chi-square distribution with df degrees of freedom.

    • @pvsmouli7727
      @pvsmouli7727 7 ปีที่แล้ว

      thanks bro

  • @theuknowndreamer9402
    @theuknowndreamer9402 7 ปีที่แล้ว

    how did u calculated p-value?

    • @jbstatistics
      @jbstatistics  7 ปีที่แล้ว

      I typically use R to find these areas, but there are a variety of other statistics packages that will do it (e.g. SAS, STATA, Excel, SPSS).

  • @navinkrishnan9419
    @navinkrishnan9419 6 ปีที่แล้ว

    Thank you!