Confidence interval of difference of means | Probability and Statistics | Khan Academy

แชร์
ฝัง
  • เผยแพร่เมื่อ 5 ก.ย. 2024
  • Courses on Khan Academy are always 100% free. Start practicing-and saving your progress-now: www.khanacadem...
    Confidence Interval of Difference of Means
    Watch the next lesson: www.khanacadem...
    Missed the previous lesson?
    www.khanacadem...
    Probability and statistics on Khan Academy: We dare you to go through a day in which you never consider or use probability. Did you check the weather forecast? Busted! Did you decide to go through the drive through lane vs walk in? Busted again! We are constantly creating hypotheses, making predictions, testing, and analyzing. Our lives are full of probabilities! Statistics is related to probability because much of the data we use when determining probable outcomes comes from our understanding of statistics. In these tutorials, we will cover a range of topics, some which include: independent events, dependent probability, combinatorics, hypothesis testing, descriptive statistics, random variables, probability distributions, regression, and inferential statistics. So buckle up and hop on for a wild ride. We bet you're going to be challenged AND love it!
    About Khan Academy: Khan Academy offers practice exercises, instructional videos, and a personalized learning dashboard that empower learners to study at their own pace in and outside of the classroom. We tackle math, science, computer programming, history, art history, economics, and more. Our math missions guide learners from kindergarten to calculus using state-of-the-art, adaptive technology that identifies strengths and learning gaps. We've also partnered with institutions like NASA, The Museum of Modern Art, The California Academy of Sciences, and MIT to offer specialized content.
    For free. For everyone. Forever. #YouCanLearnAnything
    Subscribe to KhanAcademy’s Probability and Statistics channel:
    / @khanacademyprobabilit...
    Subscribe to KhanAcademy: www.youtube.co...

ความคิดเห็น • 49

  • @mattjelani
    @mattjelani 2 ปีที่แล้ว +5

    This video helped a little but it's all over the place, definitely needs more organization.

  • @sumedhakappagantula9824
    @sumedhakappagantula9824 3 ปีที่แล้ว +16

    I owe Khan Academy 50% of my academic career

  • @megan2584
    @megan2584 2 ปีที่แล้ว +5

    Wish I could like this video twice! You are single handedly getting me through my Biostats module in my masters. Thank you!!!!

  • @1994sammahdi
    @1994sammahdi ปีที่แล้ว +2

    Thought I'd just add a tidbit here since I find the terminology a bit confusing, and find some of the setup, especially the conclusion, unexplained or poorly explained (this results in some of the confusion in the comments). Anyone, please correct me if I'm wrong.
    The intention of determining the confidence interval, is to see 1) Is there weight loss between the 2 diets and 2) How much? To that end, we need to determine, on average, what is the weight difference between the 2 groups (this is why you take the difference of the means). So now you have a new distribution, the distribution of the weight differences between group 1 and group 2. The next step, is to determine, within 95% confidence (i.e. 95% chance if I were to pick a random person between group 1 and group 2, they would have lost this amount of weight), how much weight is lost between group 1 and group 2. You do all the math, and you arrive with a confidence interval of 0.7 to 3.12lbs. This means if you were to pick 2 random people between group 1 and group 2, there is a 95% chance the weight lost between the 2 would be between 0.7lbs to 3.12lbs. Now to answer the 2 initial questions.
    Is there weight loss between the 2 diets? Yes, because even the lowest value (0.7lbs) is above 0lbs. Again, this is not an absolute 100% there is weight loss (maybe if you do 99.9999% your range will be -1lbs to 5lbs, in which case there are people who haven't lost weight), but you are 95% confident (at least that's how I look at it). As to how much? Again, not an absolute 100%, but I am 95% confident with the data given, it is between 0.7-3.12lbs.

  • @purplepandasrock43
    @purplepandasrock43 2 ปีที่แล้ว +2

    its crazy how he talks so fast when he constantly stumbles over words and repeats himself and is always correcting himself. makes it hard to learn

  • @Ihatenicknames1
    @Ihatenicknames1 9 ปีที่แล้ว +22

    Thank you for the great video Sal! :D
    I was thinking, maybe I am mistaken, but I don't believe that a 95% confidence interval means that there is a 95% chance that the confidence interval contains the population mean. Because if that was true, then, no matter how "far off" your sample mean was, there would always be a 95% chance that the confidence interval around it contains the population mean. This makes it seem as if the population mean is "moving around".
    A 95% confidence interval means that 95% of all the SAMPLES you take, will contain the population mean. Not that one sample has a 95% chance of containing the population mean.

    • @liverpooler1997
      @liverpooler1997 8 ปีที่แล้ว

      +Ihatenicknames1 yeah I was wondering the same thing.

    • @clb8645
      @clb8645 7 ปีที่แล้ว

      Yes...the probability that the true difference between the population means lies within the confidence interval calculated from the difference between his sample means is either 0% or 100%.

    • @user-mi5sc9kf6z
      @user-mi5sc9kf6z 7 ปีที่แล้ว +8

      I believe the correct wording is "we are 95% confident that this interval overlaps the true population mean"

  • @opejojosephjedidiah
    @opejojosephjedidiah 10 หลายเดือนก่อน

    All I can say may the good Lord bless more with wisdom 😊😊

  • @geraldh78
    @geraldh78 11 ปีที่แล้ว +4

    Thank you for helping me through Biostats.

  • @mikelcuvet1470
    @mikelcuvet1470 ปีที่แล้ว

    Thank you for being the goat of explaining stuff my that my teachers cannot

  • @nicolasscicolone2112
    @nicolasscicolone2112 4 ปีที่แล้ว +6

    Question, since the sample standard deviations of both samples were used (as opposed to population standard deviations) why was the z-distribution (and thus z-table) used to estimate the critical value for the confidence interval? Wouldn't a t-distribution and t-table be more appropriate? Thanks!

    • @stevewang314
      @stevewang314 4 ปีที่แล้ว +6

      Because the sample size is 100, which is greater than 30, meaning that it would be appropriate to use normal distribution instead of t -distribution. Thus, we look up the z-table.

    • @thutomongale7637
      @thutomongale7637 ปีที่แล้ว

      Exactly! I also felt like they are not correct and I used t-distibutuon and I found that the difference between population means is between 0.69 and 3.13

  • @Faisal-wb1nu
    @Faisal-wb1nu ปีที่แล้ว +1

    I want to ask.. should the difference between two means be always postive? I mean X1 should be the bigger one so the result is positive

  • @sijialiu6240
    @sijialiu6240 ปีที่แล้ว

    The expected value of sample variance is an unbiased estimation of population variance, thats why the s is used to "replace" the σ of the population X1

  • @LaBar0ness
    @LaBar0ness 9 ปีที่แล้ว +1

    Thank you SO much for all of your video's Sal!! They have helped me SO much!

  • @chopper84a
    @chopper84a 11 ปีที่แล้ว +4

    I don't get the intuition behind why amalgamating the two sample means tells us anything?

  • @luthojilimane1262
    @luthojilimane1262 4 ปีที่แล้ว +1

    Please include the formulas in your calculations

  • @dfsfklsj
    @dfsfklsj 12 ปีที่แล้ว +3

    Hi khan, I think you might have a mistake here... You assumed the sample means are the true means of the population. Isn't it better if you can calculate a 95% confidence interval of the distribution of sample means of x1 and the control, and then say that the mean of their differences is 95%*95% between the differences of the calculated condifence intervals?

  • @CCbean63
    @CCbean63 7 ปีที่แล้ว +6

    I love your video's, I t is annoying though that you repeat everything you say as soon as you say it.

    • @keshavbansal5148
      @keshavbansal5148 3 ปีที่แล้ว +1

      Sometimes it helps getting the point across some dumb students like myself. :-( . But okay.

  • @sujeongmoon7761
    @sujeongmoon7761 2 ปีที่แล้ว

    dont we need to calculate pooled variance?

  • @fraidym7793
    @fraidym7793 4 ปีที่แล้ว

    How come the true mean is x1-x2 and not x1+x2 divided by 2?? Thanks Sal for the video(s)

  • @DrAKMAnisurRahman
    @DrAKMAnisurRahman ปีที่แล้ว

    Is it 1.96* sd or 1.96*SE (standard error?)

  • @desrucca
    @desrucca 2 ปีที่แล้ว +1

    I thought 95% is mean ± 2*stdev ??

  • @leetodvi
    @leetodvi 12 ปีที่แล้ว

    thank you for all !!!!!!!!!

  • @1rachellund
    @1rachellund 12 ปีที่แล้ว

    please - please- please number these :) thanks for the help cheers

  • @Oleander_Sky
    @Oleander_Sky 2 ปีที่แล้ว

    What if x bar - x bar = 0???

  • @Nina-kv4vn
    @Nina-kv4vn 8 ปีที่แล้ว

    Is this course, Probability and Statistics, available in PDF, Sal?

    • @fly869
      @fly869 7 ปีที่แล้ว

      Nina 1 ppp

  • @nO_d3N1AL
    @nO_d3N1AL 11 ปีที่แล้ว

    very explicit. Thanks

  • @kallievartt8070
    @kallievartt8070 3 ปีที่แล้ว +1

    would the difference of the sample means, 1.91, be the point estimate?

  • @vivienj6831
    @vivienj6831 7 ปีที่แล้ว +3

    I'm confused.... what makes this a z test rather than a t test?

    • @suuup4711
      @suuup4711 7 ปีที่แล้ว +3

      if a sample size is 30 or more we use z, if not we use t.

    • @vivienj6831
      @vivienj6831 7 ปีที่แล้ว

      Okay thank you do much!

    • @tricky92x
      @tricky92x 7 ปีที่แล้ว +1

      Only if you are fairly certain that the samples are from a normally distributed population. If you look, 100 degrees of freedom on the Student's t table for a 95% confidence interval is still significantly off from the relative Z score.

  • @RuthSarona
    @RuthSarona ปีที่แล้ว +1

    Idk, but the way he narrates is repetitive and confusing because of so much repetition. It's so hard to understand with the way it is explained.

  • @rohitashvaraj8399
    @rohitashvaraj8399 4 ปีที่แล้ว

    isn't Z score = ( X - Mean)/ std dev .. so at 06:20 it should be 1.96* std dev + mean , right ??

    • @14december89
      @14december89 3 ปีที่แล้ว +1

      You are right, but at 06:20 he was just calculating how many standard deviations the (X-mean) is,
      not until 13:20 then he calculated the X (the actual mean) which are within the (1.96* std dev + mean) with 95% confidence interval

    • @keshavbansal5148
      @keshavbansal5148 3 ปีที่แล้ว

      We just have the mean of the sample distributions, and not the actual mean of the population. x1(bar)-x2(bar) doesn't actually represent the population mean value. Which is why we cannot exactly use the formula you mentioned. Sal is trying to find the CI for the actual mean around the calculated sample mean.

  • @shekarsubramanian9562
    @shekarsubramanian9562 6 ปีที่แล้ว +2

    But why did we choose 95% confidence interval... why not 99% or some other value??

    • @ericaheng3905
      @ericaheng3905 3 ปีที่แล้ว

      Because we are taking samples the closest percentage we can take us 95% in this subject. I think thats wat my lecturer said 🤣

    • @ryanwong7426
      @ryanwong7426 3 ปีที่แล้ว +2

      you can choose whatever confidence interval u want to, depending on how confident you want to be in your results

  • @PerpetualTiredness
    @PerpetualTiredness 12 ปีที่แล้ว

    What does that 95% mean in plain English?

    • @gamerdio2503
      @gamerdio2503 5 ปีที่แล้ว +1

      95% of samples contain the mean in the interval