R-squared, Clearly Explained!!!

แชร์
ฝัง
  • เผยแพร่เมื่อ 20 ก.ย. 2024

ความคิดเห็น • 714

  • @statquest
    @statquest  4 ปีที่แล้ว +134

    NOTE: When I first made this video, I was thinking about how R-squared relates to Linear Regression, which will not fit a line worse than the mean of the y-axis values. This is because if the values along the x-axis are truly useless in terms of predicting y-axis values, then the slope of the line used to make predictions will be 0, and the intercept will equal the mean. However, it is possible to simply draw a line that fits the data worse than the mean and get a negative R^2.
    Support StatQuest by buying my book The StatQuest Illustrated Guide to Machine Learning or a Study Guide or Merch!!! statquest.org/statquest-store/

    • @mattkilgore7323
      @mattkilgore7323 4 ปีที่แล้ว +1

      With enough variables in the data set, it would be easy to create a set of r-squared values so that the cumulative percent "explained by" the different variables goes over 100%. That's why I was never a fan of that terminology. Students think it implies causation when it doesn't. Otherwise, great video.

    • @statquest
      @statquest  4 ปีที่แล้ว +28

      @@mattkilgore7323 Maybe I should have made it more clear, but if you have a large model with a lot of variables, then you don't add together a bunch of individual R-squared values to find the total R-squared. You calculate a single r-squared value fro the entire model. In other words r-squared refers to the models, not the individual variables.

    • @huahua154
      @huahua154 4 ปีที่แล้ว

      StatQuest with Josh Starmer If you only consider all unbiased lines, (mean of predicted ys equal mean of real ys), then no negative R^2.

    • @sunilkumarsamji8507
      @sunilkumarsamji8507 4 ปีที่แล้ว +1

      @@mattkilgore7323 Hi Matt can you explain the point you trying to make in a bit more detailed manner

    • @mattkilgore7323
      @mattkilgore7323 4 ปีที่แล้ว +2

      The phrase "explained by" can be deceptive, as students often think it means "caused by." But this is not what it means in the context of r-squared. Does that help?

  • @dannysnee4945
    @dannysnee4945 4 ปีที่แล้ว +231

    So glad this channel exists. It's rare that TH-cam videos on stats are this well done

    • @statquest
      @statquest  4 ปีที่แล้ว +6

      Thanks!

    • @shashankkhare1023
      @shashankkhare1023 4 ปีที่แล้ว +4

      @@statquest You are a lifesaver. I am surprised you dont have more subsciptions. I would recommend your channel to my colleagues, thank you so much :)

    • @statquest
      @statquest  4 ปีที่แล้ว +7

      @@shashankkhare1023 Thank you very much!!! Recommending my channel to your colleagues is the best complement you can give me. :)

    • @VinodKumar-nn7go
      @VinodKumar-nn7go 2 ปีที่แล้ว

      you can watch and learn from Dr. Ami Gates. Her videos are great..

  • @joenah5651
    @joenah5651 3 ปีที่แล้ว +39

    Thank you so much for making this sooooooo clear, I've struggled to understand the meaning of R2 for a week and you just made it clear to me in 10 min.

  • @ramprakash7872
    @ramprakash7872 5 ปีที่แล้ว +12

    You have explained the concept so neatly,clearly ( most importantly in an easier manner ) so that one could get deeper understanding of the concept, a fact that lot many text books / videos / articles failed to do. Keep making such videos !

  • @sanketbadhe3572
    @sanketbadhe3572 5 ปีที่แล้ว +6

    I read a lot on R square from different books and articles but this was the really different and very intuitive approach. Visualization is the best way to understand statistics and I think most books lack there.

  • @dikshyapattanaik3528
    @dikshyapattanaik3528 3 ปีที่แล้ว +6

    Your channel is blessing in disguise. Visual aids and the explanations are so smooth and easy to understand. Thank you very much.

    • @statquest
      @statquest  3 ปีที่แล้ว

      Thank you very much! :)

  • @alecvan7143
    @alecvan7143 4 ปีที่แล้ว +67

    I can't believe the simple relationship between R^2 and R was never made clear to me! Amazing as always!

    • @statquest
      @statquest  4 ปีที่แล้ว +5

      Awesome!!!! Thank you very much.

    • @pablo_brianese
      @pablo_brianese 3 ปีที่แล้ว

      I also appreciated his comments on the subject, and him sharing his opinions and intuitions.

    • @bleakmess
      @bleakmess ปีที่แล้ว

      Just a quick question?

  • @muffinman1
    @muffinman1 4 ปีที่แล้ว +29

    "sniff/weight relationship" debunked by StatQuest. Give this man a Nobel. :)

  • @DD-hh4tz
    @DD-hh4tz 6 ปีที่แล้ว +7

    Your videos are so easy to understand, and also explains the intuition behind. I really love the way you start the video, unlike other bouring lectures.

    • @statquest
      @statquest  6 ปีที่แล้ว +1

      Thank you so much! :)

  • @cartulinito
    @cartulinito ปีที่แล้ว +1

    I started following you 4 months ago, now I'm starting over from the very first video, I'll watch them all and understand everything.
    Thank you very much for this content.

  • @vamanieperumal5262
    @vamanieperumal5262 4 ปีที่แล้ว +6

    this is the best channel ever that can exist about statistics :D wonderful explanation and illustrations and the music! :) am glad I found this at the right time !

    • @statquest
      @statquest  4 ปีที่แล้ว

      Thank you so much 😀

    • @baxtables
      @baxtables 4 ปีที่แล้ว

      Are u glad u found it or are u asking us if u are glad??😆😆

  • @amirmohamed2428
    @amirmohamed2428 3 ปีที่แล้ว +6

    I had stats exam coming up and didn't know this particularly well, Thanks for making it much more simpler!

    • @statquest
      @statquest  3 ปีที่แล้ว

      Good luck on the exam! :)

  • @AdilKhan-sh9fv
    @AdilKhan-sh9fv 3 ปีที่แล้ว +3

    This channel has become my go-to resource for anything stat related.

    • @statquest
      @statquest  3 ปีที่แล้ว +1

      Bam! :)

    • @khanhdovanit
      @khanhdovanit 3 ปีที่แล้ว +1

      @@statquest Love your Bam and your singing

  • @gri189
    @gri189 ปีที่แล้ว +1

    Just recently found your channel. These are by FAR the most straight forward explanations I found so far. You sir are a godsend.

  • @yashshah1936
    @yashshah1936 4 ปีที่แล้ว +6

    Josh you are the best!!! Your every video has been helpful to god knows how many times in my studies. Much much love

    • @statquest
      @statquest  4 ปีที่แล้ว

      Thank you very much! :)

  • @kinvert
    @kinvert 4 ปีที่แล้ว +271

    So all this time I spent sniffing rocks to grow bigger was for nothing???

    • @statquest
      @statquest  4 ปีที่แล้ว +21

      Ha! You made me laugh. :)

    • @PeterXLuo
      @PeterXLuo 3 ปีที่แล้ว +3

      haha

    • @antonbagaev1771
      @antonbagaev1771 3 ปีที่แล้ว +1

      only if you are mouse

    • @andresrossi9
      @andresrossi9 2 ปีที่แล้ว +1

      I love you hahahaha

    • @muslimmukhtarkhanov8194
      @muslimmukhtarkhanov8194 2 ปีที่แล้ว +1

      You should have made a powder out of rocks, that would speed up your growing. Especially if your powder of white color🤣🤣🤣

  • @srikanth9450
    @srikanth9450 3 ปีที่แล้ว +4

    I have found a great channel for stats and trix.... BAM! it covers all the areas I want to learn.. Double BAM!! It's indeed clearly explained... Triple BAM!!!

  • @ivandeetlefs
    @ivandeetlefs 2 ปีที่แล้ว +2

    I would rather name this video VERY CLEARLY EXPLAINED. Thank you.

  • @tanmaymhatre6370
    @tanmaymhatre6370 3 ปีที่แล้ว +2

    love the way you explain things in casual manner

  • @sanjaykrish8719
    @sanjaykrish8719 6 ปีที่แล้ว +8

    Very beautifully explained. Many thanks to the folks of Genetics Department at the University of North Carolina at Chapel Hill.

  • @hewhomustnotbenamed5912
    @hewhomustnotbenamed5912 4 ปีที่แล้ว +4

    Added this to my useful tutorials and math playlists.
    Thanks StatQuests.

  • @ailsasun9308
    @ailsasun9308 ปีที่แล้ว +1

    The introductions are the cutest thing I have ever seen - the videos are also super duper helpful!

  • @firasal-nasir1909
    @firasal-nasir1909 6 หลายเดือนก่อน +1

    watching this again, thank you very much. I jumped into more advanced stuff because of your videos. 🙏🙏

    • @statquest
      @statquest  5 หลายเดือนก่อน

      Awesome!

  • @kowsergazi
    @kowsergazi 6 ปีที่แล้ว +1

    No one could make me understand R Squared in such easy way. Watched many videos. All made it complicated. Thanks.

    • @statquest
      @statquest  6 ปีที่แล้ว

      Hooray!!! I'm glad to hear the video was helpful! :)

  • @samuelliaw951
    @samuelliaw951 3 ปีที่แล้ว +2

    cool! you have cleared all the fogs around r2 in my head once for all. appreciate your explanation!

    • @statquest
      @statquest  3 ปีที่แล้ว

      Glad to help!

  • @ashokcgl
    @ashokcgl 2 ปีที่แล้ว +1

    Just impeccable. I don't think any other better illustration exists other than this. Thank you

  • @prateeksachdeva1611
    @prateeksachdeva1611 ปีที่แล้ว +1

    Could not have even imagined such intuitive explanation of this topic before watching this video. Thanks Josh!

  • @stevenpatterson8119
    @stevenpatterson8119 ปีที่แล้ว +1

    This helped me clearly understand R^2. Trying to grasp this from reading a textbook was impossible for me.

  • @Steve-go3zp
    @Steve-go3zp 3 ปีที่แล้ว +1

    I have been looking at a variety of stats videos and these are clearly the best. I am so impressed with StatQuest that I renamed my four dogs, "StatQuest," "StatQuest," "StatQuest," and "John Stamos" because, of course...

  • @mohammadhassanjafari481
    @mohammadhassanjafari481 2 ปีที่แล้ว +1

    I did not see anyone explain the statistics better than you
    God bless you ...

  • @rajsingh9869
    @rajsingh9869 ปีที่แล้ว +1

    really boom...I was confused from past 3 days to understand regression value ...now I understand. Thanks

  • @senwang8468
    @senwang8468 5 ปีที่แล้ว +5

    这样的创作者请给我来一百个!thank you for your videos!I really appreciate what you have done, and look forward to seeing more of them~

    • @statquest
      @statquest  5 ปีที่แล้ว

      Thank you very much! :)

  • @wayne02058
    @wayne02058 4 ปีที่แล้ว +3

    a simple concept explained simply. thank you for the straight forward explanation

    • @statquest
      @statquest  4 ปีที่แล้ว

      Thank you very much! :)

  • @thisis7734
    @thisis7734 5 ปีที่แล้ว +5

    Amazing explanation!!Made it very simply for me to understand!! :)
    I went through so much content for this..thank you

    • @statquest
      @statquest  5 ปีที่แล้ว +2

      Hooray! I'm glad the video was helpful.

  • @minilamabianco
    @minilamabianco 4 ปีที่แล้ว +8

    You are a rarity ❤️ really love how you explain statistics! Please tell us more 🙏🏻♥️♥️♥️

    • @statquest
      @statquest  4 ปีที่แล้ว +1

      Thanks! :)

  • @manikyar7115
    @manikyar7115 2 ปีที่แล้ว +3

    Really enjoying your videos. Moreover everything is crystal clear and I am able to understand them. Double BAM

    • @statquest
      @statquest  2 ปีที่แล้ว

      Hooray!!! That's great news. BAM! :)

  • @AydinCGur
    @AydinCGur 2 ปีที่แล้ว +1

    Wonderful explanation again. I easily understood the concept. I'm grateful.

  • @rabbisheryl9613
    @rabbisheryl9613 4 ปีที่แล้ว +4

    aha! we meet again, and I thank you again!!! wow, I wish you were teaching my class!!

    • @statquest
      @statquest  4 ปีที่แล้ว

      Ha! I'm glad my videos are so helpful. :)

  • @kumarransing8489
    @kumarransing8489 ปีที่แล้ว +1

    Really good explanation as to why r squared is significant in describing variation in data. Thank you!

    • @statquest
      @statquest  ปีที่แล้ว

      Glad you liked it!

  • @goodester6924
    @goodester6924 5 ปีที่แล้ว +2

    Was struggling to understand this concept but this video explained everything!

    • @statquest
      @statquest  5 ปีที่แล้ว +1

      Hooray! :)

  • @nicholesutter80
    @nicholesutter80 3 ปีที่แล้ว +1

    Working on my MPA stats final and this video has been so helpful

  • @dearwriter9659
    @dearwriter9659 2 ปีที่แล้ว +1

    Now I understand the R squared much better! Thank goodness for this video!

    • @statquest
      @statquest  2 ปีที่แล้ว

      Glad it helped!

  • @thyang3999
    @thyang3999 4 ปีที่แล้ว +2

    Thank you very much, your video is very easy way to understand, makes me want to go through the Statistics course again.

    • @statquest
      @statquest  4 ปีที่แล้ว

      You can do it!

  • @fredyrojas7884
    @fredyrojas7884 2 ปีที่แล้ว +1

    These videos are pretty cool. I can always come back and refresh concepts.

    • @statquest
      @statquest  2 ปีที่แล้ว

      Glad you like them!

  • @mohdbasryt
    @mohdbasryt 4 ปีที่แล้ว +3

    Great teachers make everything interesting! Thanks Josh

    • @statquest
      @statquest  4 ปีที่แล้ว

      Thank you! :)

  • @shinhyelee3169
    @shinhyelee3169 5 ปีที่แล้ว +4

    Ah! this is the best video explaining R squared! Thank a lot!

    • @statquest
      @statquest  5 ปีที่แล้ว

      Thank you! :)

  • @samuelgrubb2976
    @samuelgrubb2976 ปีที่แล้ว +1

    I never comment, but today is not that day. Thank you so much for this!!!! I am in graduate school and am still struggling to understand these concepts, you're a life saver

  • @aamuz1cool
    @aamuz1cool 6 ปีที่แล้ว +9

    Adding to my previous comment , R2 value can be negative when the variance explained by the line is lesser than the variance explained by mean.
    For example var(mean) = 30 and var(line) = 40
    Then R2 = -0.3
    There exists such models , perhaps that could be worst models.

    • @statquest
      @statquest  6 ปีที่แล้ว +8

      This is technically correct, but practically speaking, R-squared is always positive because it is used to compare the least squares residuals for the best fitting model to the least squares residuals for the mean, and the best fitting model can't have larger residuals than the mean, otherwise the best fitting model would be the mean. Does that make sense?

    • @aamuz1cool
      @aamuz1cool 6 ปีที่แล้ว +1

      Completely agree with you in terms of practicality. It doesn't make sense at all. At the end of day you want a model which performs better than the base model. My point was it can be negative. Nevertheless i really like your videos. That comment of mine was just to clarify my understanding and to reach out to you.

    • @statquest
      @statquest  6 ปีที่แล้ว +4

      I was thinking more about the negative R-squared and how it could be used in practice. I mean, like you said, even if your model is terrible, worse than the mean, it still might be nice quantify how terrible it is - and that's where the negative R-squared could come in handy. It still has the same meaning, except now you're quantifying how much worse your model is than the mean. Interestingly, it still works out even if var(terrible model) is so bad that the R-squared is less than -1. For example, if var(mean) = 50 and var(terrible model) = 100, then R-squared = (50 - 100) / 50 = -1, so "terrible model" is 100% worse than the mean. If var(terrible model) = 150, then R-squared = (50 - 150) / 50 = -2, and now terrible model is 200% worse.

    • @aamuz1cool
      @aamuz1cool 6 ปีที่แล้ว +4

      Right , That's my point. From my own experience , I used to train multiple models on a sample dataset and compute their respected R-squared value to choose the best among those models. There I encountered some models returning negative R-squared value. Those models are practically useless and if you agree that happens when your training data is so huge and the algorithm you are using is so insignificant, like using a multi variant regression for a heavily skewed target variable.That was the motivation behind my comment. I appreciate your time to reply back to my comments. I am glad that it grabbed your attention Mr. Josh.

    • @spacedustpi
      @spacedustpi 5 ปีที่แล้ว

      @@statquest I asked a question about this too and I assumed you meant the best fitting line (even though it was not explicitly stated in the video), or at least one that performed better than the mean line.

  • @anacarolinamartinelli9065
    @anacarolinamartinelli9065 2 ปีที่แล้ว +2

    This video is absoluted amazing! R^2 and R finaly understood!

  • @Love4ever1223
    @Love4ever1223 4 ปีที่แล้ว +1

    god bless, i have been searching high and low for this kind of video. Thank you!!!!

  • @pablo_brianese
    @pablo_brianese 3 ปีที่แล้ว +2

    Thank you for this precious material

    • @statquest
      @statquest  3 ปีที่แล้ว +1

      I'm glad you like it!

  • @adnanshahnawaz6808
    @adnanshahnawaz6808 ปีที่แล้ว +1

    "time spent sniffing a rock"! had me cracking😂.... btw thanks josh for putting such great content up... this channel is the my primary source of building my statistics foundations....

    • @statquest
      @statquest  ปีที่แล้ว +1

      Glad you like them!

  • @mmhamed1
    @mmhamed1 2 ปีที่แล้ว +1

    You know what i decided to start watching your videos from the beginnimg .. baaam .. thanks

    • @statquest
      @statquest  2 ปีที่แล้ว

      Awesome! Thank you!

  • @paololara3115
    @paololara3115 4 ปีที่แล้ว +1

    Thank you, this was a life-giver! Josh Starmer, you just might have become a part of something which will be big

    • @statquest
      @statquest  4 ปีที่แล้ว

      Wow, thanks!

  • @toast34
    @toast34 ปีที่แล้ว

    Came here from the Pearson's correlation video. Thank you so much for this
    I just wish that you could show in the video:
    • how (Var(mean)-Var(line)) / Var(mean) is equal to [Covar(x,y) / (Var(x)^-2)(Var(y)^-2)]^2
    • whether (Var(mean)-Var(line)) / Var(mean) using mean and differences from the x-axis also yields the same value
    Again, thank you for the video

    • @statquest
      @statquest  ปีที่แล้ว +1

      I'll keep that in mind.

  • @abdullahattia2491
    @abdullahattia2491 ปีที่แล้ว +1

    my dude I understood and I am happy
    8-year-old video is this good
    liked, subbed and thank you!

    • @statquest
      @statquest  ปีที่แล้ว +1

      My dude! Thank you very much! :)

  • @elfmas
    @elfmas 3 ปีที่แล้ว +1

    Thanks, every question/doubt that I had instantly got answered about 10 seconds later.

  • @donfeto7636
    @donfeto7636 4 ปีที่แล้ว +3

    Thank You Statquest your video and my knowledge of R^2 have a R^2 of 99.99

  • @yasinzamani9467
    @yasinzamani9467 5 ปีที่แล้ว +4

    Thank you for this easy to understand video :-)
    I have two suggestion!
    - Time 0:50 -- instead of `strongly related` it is better to say `strongly linear related`! We know that `R` can't explain nonlinear relationships (e.x. Y = X^2)!
    - Time 10:00 -- instead of `0.7^2 = 0.5` it is better to say `0.7^2 \approx (is approximately equal to) 0.5` ;-)

    • @statquest
      @statquest  5 ปีที่แล้ว +3

      Interestingly, and little known, but R^2 can be calculated for equations like y = a + b*x^2. That equation makes a curve, which is not linear, but the equation is _linear in its parameters_ (the parameters are 'a' and 'b', not 'x^2'), and that is what makes a "linear model" linear. A linear model doesn't have result in a straight line, but it must be linear in its parameters. That means you can calculate R^2 for y = a + b*x^2 or even y = a + b*sin(x). Not many people know this though since they don't understand what the "linear" in "linear models" actually refers to.

    • @yasinzamani9467
      @yasinzamani9467 5 ปีที่แล้ว +2

      Yes, and in y = a + b*x^2 or y = a + b*sin(x) it is better to say `y` has a linear relationship with `x^2` or `sin(x)`, not `x`!

  • @KIKI-NJ
    @KIKI-NJ 3 ปีที่แล้ว +1

    So proud of me because I'm watching these videos. Very very goood job thanks 😊 👍 👏

  • @funnyclipsutd
    @funnyclipsutd 4 ปีที่แล้ว +2

    After watching your videos, I aced my stats module!

    • @statquest
      @statquest  4 ปีที่แล้ว +1

      TRIPLE BAM!!! Congratulations!!! :)

  • @anilsarode6164
    @anilsarode6164 4 ปีที่แล้ว +4

    hats off and thanks a lot you will make me cry. thanks once again.

    • @statquest
      @statquest  4 ปีที่แล้ว

      Thanks! :)

    • @baxtables
      @baxtables 4 ปีที่แล้ว

      Why cry mate???

  • @DiegoMachida
    @DiegoMachida 3 ปีที่แล้ว +1

    You goddamm beautiful man, Im eating your videos like candy nowadays, Im finishing an electrical and comms engineering degree and working with some computer science and I usually get hammered with statistical questions when I finish presenting my models, thanks to your uploads i've held my own against some nasty expert old timers, thank you for this.

    • @statquest
      @statquest  3 ปีที่แล้ว

      Awesome! TRIPLE BAM! :)

  • @sunilkumarsamji8507
    @sunilkumarsamji8507 4 ปีที่แล้ว +1

    Hi Josh Starmer Thanks a lot for the explaination. By eye we can see that variation around the mean is higher than variation around the blue line. @ 6 : 15 it is mentioned that size - weigth relationship accounts for the 81% of total variation in the data. However, i feel its the otherway. Variation around the mean contributes higher percentage to the total variation in the data which is 81 % that has been reduced by considering variation around the mean. This means to say that size-wieight relationship contributes to 19 % variation in the data. I have used the word "contributes" instead of "accounts for". Please correct me If am wrong.

    • @statquest
      @statquest  4 ปีที่แล้ว

      I think there are two potential problems with your alternative phrasing. 1) To say "height contributes 19% of the variation in weight", or even " the height-weight relationship contributes 19% of the variation in the data" is to suggest that "height" causes 19% of the variation in "weight". This may be true, but it might not. Since correlation doesn't mean causation, you could run into trouble here.
      2) The other thing is about using non-standard terminology that is the reverse of what most people use - this can lead to confusion and unexpected consequences. So you could run into trouble here as well.

  • @ahmadulfijihaddzulqornain6790
    @ahmadulfijihaddzulqornain6790 2 ปีที่แล้ว

    Hello, as an statistics student im so glad this channel exist. I was thinking, can i make an Indonesian version of this channel so i can share my knowledge about statistic for Indonesian Student? Because, most of us struggling understanding at class in this covid era 😆. Thank you!

    • @statquest
      @statquest  2 ปีที่แล้ว

      If you would like to make Indonesian subtitles, you can contact me through my website and we can work something out: statquest.org/contact/

  • @sebastianc09
    @sebastianc09 2 ปีที่แล้ว +1

    I'm truly grateful for your videos!

  • @sweetheart.nikkilee430
    @sweetheart.nikkilee430 4 ปีที่แล้ว +2

    wow this is the best explanation ive ever seen, thank u!!!!!

    • @statquest
      @statquest  4 ปีที่แล้ว

      Thank you very much! :)

  • @kwanpakshing
    @kwanpakshing 3 ปีที่แล้ว +1

    The best explaination I can find

  • @jiayoongchong2606
    @jiayoongchong2606 4 ปีที่แล้ว +1

    6:05 explains R^2 accounts for the variation of relationships

  • @xruan6582
    @xruan6582 4 ปีที่แล้ว +1

    (8:30) R^2 is square of R only when you are fitting a linear regression line. Apparently, the square relationship does not hold for regressions with quadratic term(s).

    • @statquest
      @statquest  4 ปีที่แล้ว

      That is correct, because normal correlation is only defined for straight lines.

  • @footballistaedit25
    @footballistaedit25 2 ปีที่แล้ว +1

    Thanks for sharing, Sir. It helps me a lot

    • @statquest
      @statquest  2 ปีที่แล้ว +1

      Glad to hear that!

  • @carbon273
    @carbon273 4 ปีที่แล้ว +1

    Zedstatistics coupled with statQuest is just absolutely magnifique

  • @isa..333
    @isa..333 ปีที่แล้ว +1

    this is bizarelly useful for my exam tomorrow

    • @statquest
      @statquest  ปีที่แล้ว +1

      Good luck! :)

  • @PeterXLuo
    @PeterXLuo 3 ปีที่แล้ว +1

    best video explaining R and R squared ever!

  • @koonsickgreen6272
    @koonsickgreen6272 ปีที่แล้ว +1

    Dude..this friggin rocks.. THAKS YOU!!!!!

  • @junepark9591
    @junepark9591 ปีที่แล้ว +1

    Beutifully explained. Thank you so much.

  • @sudhitiwaridwivedi3096
    @sudhitiwaridwivedi3096 5 ปีที่แล้ว +2

    Undoubtedly d best and to the point explanation. Thanks a lot

  • @busyshah
    @busyshah 4 ปีที่แล้ว +3

    You know only a level of mastery can achieve this level of ease.

  • @unlearningcommunism4742
    @unlearningcommunism4742 3 ปีที่แล้ว +1

    I've stared binge watching the entire channel :D

    • @statquest
      @statquest  3 ปีที่แล้ว

      BAM! :)

    • @unlearningcommunism4742
      @unlearningcommunism4742 3 ปีที่แล้ว

      @@statquest Do you know someone doing TDA (topological data analysis) / AYASDI Software?
      Man it's a good stuff...

    • @statquest
      @statquest  3 ปีที่แล้ว

      @@unlearningcommunism4742 I'll look into it.

    • @unlearningcommunism4742
      @unlearningcommunism4742 3 ปีที่แล้ว +1

      @@statquest Obscure stuff, my postdoc actually, but it makes everything - better. I've personally tested a lot of things with it from images, to soil samples, languages, FTIR spectra... And it gives the edge every time.

  • @benitshetty8492
    @benitshetty8492 4 ปีที่แล้ว +2

    Hi Josh,
    Is there any videos that explains the Degrees of Freedom? I find it difficult to understand this concept. Pls provide link if there is.

    • @statquest
      @statquest  4 ปีที่แล้ว +1

      Not yet. It's something I would love to do as soon as possible.

  • @AmosFolarin
    @AmosFolarin 5 ปีที่แล้ว +12

    Beautiful explanation :)

    • @statquest
      @statquest  5 ปีที่แล้ว +1

      Thank you! :)

  • @nithyashreevenkataraman3
    @nithyashreevenkataraman3 3 ปีที่แล้ว +1

    Thank you so much, I've learned so much from you in the past week! Very grateful

    • @statquest
      @statquest  3 ปีที่แล้ว

      Thank you very much! :)

  • @robhuntington8504
    @robhuntington8504 5 ปีที่แล้ว +4

    Why do most textbooks suck so bad at explaining statistics?? This was very helpful.

    • @statquest
      @statquest  5 ปีที่แล้ว +2

      Thank you!!! :)

    • @JasTheKariol
      @JasTheKariol 5 ปีที่แล้ว

      because they think that teaching is related to sniffing a rock

  • @gbchrs
    @gbchrs 3 ปีที่แล้ว +1

    woah thanks for this one too Josh! finally gets R2

    • @statquest
      @statquest  3 ปีที่แล้ว

      Triple bam! :)

  • @aamuz1cool
    @aamuz1cool 6 ปีที่แล้ว +2

    Explanation of R squared was perfect and thanks for that. Plain old R is just a square root of R squared. I believe you don't mean pearson correlation coefficient(r) as plain old R.

    • @statquest
      @statquest  6 ปีที่แล้ว +1

      Exactly. The other two common measures of correlation are Spearman's Rank, which is denoted with the greek character "rho", not "r", and Kendall's Rank, which is denoted with the greek character "tao", not r.

    • @kikokimo2
      @kikokimo2 5 ปีที่แล้ว +1

      What is this 'R' explained here then called? What is it's name? just "Correlation coefficient"?

    • @shvm_bh
      @shvm_bh 4 ปีที่แล้ว

      @@statquest whats the difference between the 3 (pearson, spearman and kendall Rank)?
      And which one did you talk about in this vide

    • @statquest
      @statquest  4 ปีที่แล้ว

      @@shvm_bh In this video the "r" (or plain old) is "Pearson's Correlation Coefficient". R-squared is the square of that "r'. For more details about Pearson's Correlation Coefficient, see: th-cam.com/video/qtaqvPAeEJY/w-d-xo.html and th-cam.com/video/xZ_z8KWkhXE/w-d-xo.html
      Spearman and Kendall's correlations replace the actual measurements with ranks (so the largest measurement gets rank = 1, the second largest measurement gets rank = 2, etc.) and it looks to see if two samples have similar ranks.

  • @lilmoesk899
    @lilmoesk899 7 ปีที่แล้ว

    Super useful as always. Please continue with the videos (for example, prediction interval vs. confidence interval or maybe p-values vs. randomization tests or logistic regression...)! I liked your explanation because it never occurred to me that R^2 was basically the same as calculating percent change (diff/original)x100.

  • @gvtronrox4204
    @gvtronrox4204 5 หลายเดือนก่อน +1

    This is sooo good... wish i found this video earlier.

    • @statquest
      @statquest  5 หลายเดือนก่อน

      Glad you liked it!

  • @jonathanlee8755
    @jonathanlee8755 4 ปีที่แล้ว +2

    Amazing video! Thank you very much!

  • @ThomasHaberkorn
    @ThomasHaberkorn 4 ปีที่แล้ว +1

    One day I'm gonna binge watch all of your videos and make notes.. How long could it take?

    • @statquest
      @statquest  4 ปีที่แล้ว +1

      I think it would take a while! :)

  • @sub17was
    @sub17was 3 ปีที่แล้ว +1

    Legen"wait-for-it and yeah this guy make statistics feel so easy"dary

  • @mohitsrivastava5880
    @mohitsrivastava5880 4 ปีที่แล้ว +1

    Thanks for the simple explanation. Much appreciated.

    • @statquest
      @statquest  4 ปีที่แล้ว +1

      Thanks! :)

  • @MrClevermind
    @MrClevermind 4 ปีที่แล้ว +1

    Thank you, you're helping us with these videos.

    • @statquest
      @statquest  4 ปีที่แล้ว

      Thank you very much! :)

  • @terryliu3635
    @terryliu3635 4 ปีที่แล้ว +1

    Very good explanation, thanks!

    • @statquest
      @statquest  4 ปีที่แล้ว

      Thank you! :)

  • @scottcooke5641
    @scottcooke5641 3 ปีที่แล้ว +1

    Great videos! What if you get a model that has an ok correlation (with a very significant P-value) but a low R^2? Can this still be meaningful? IE. There is a statistically significant relationship between the two variables, however there are probably other correlated variables as well so this model is not good for making accurate predictions, but it does let us know these two variables are correlated which can still be useful?

    • @statquest
      @statquest  3 ปีที่แล้ว

      I'm not sure I understand your question because you can't have an OK correlation but a low R^2. The two are linked (the square root of R^2 is the correlation).

    • @scottcooke5641
      @scottcooke5641 3 ปีที่แล้ว

      @@statquest I think this is what I was trying to get at maybe?
      blog.minitab.com/en/adventures-in-statistics-2/how-to-interpret-a-regression-model-with-low-r-squared-and-low-p-values. Perhaps I have correlation and regression mixed up. I thought they were the same thing kind of. I'm trying to see if we can potentially have a model that is a poor predictor because there is tons of variation in the data but there is still a significant relationship between the dependent and independent variables? And this can still be useful?

    • @statquest
      @statquest  3 ปีที่แล้ว

      @@scottcooke5641 Yes, you can definitely have a small p-value and a small r^2 value - if you have enough data, that's what happens. So having a small p-value is not enough to say something is interesting or biologically relevant. You need to have a small p-value and a relatively large r^2 for the result to be interesting. I talk about this is one of my other videos but I can't remember which one. That said, if you want to learn about regression, check this out: th-cam.com/video/nk2CQITm_eo/w-d-xo.html

  • @VivaldiHeroes
    @VivaldiHeroes 4 ปีที่แล้ว +1

    Hey, when you say the variation around the line can never be greater than the variation around the mean, why is that? What if the model (line) is very misspecified?

    • @statquest
      @statquest  4 ปีที่แล้ว +1

      When I made this video, I was only thinking in terms of linear regression, which will not fit a line worse than the mean to the data. This is because if the values along the x-axis are truly useless in terms of predicting y-axis values, then the slope of the line used to make predictions will be 0, and the intercept will equal the mean. However, it is possible to simply draw a line that fits the data worse than the mean and get a negative R^2.

    • @VivaldiHeroes
      @VivaldiHeroes 4 ปีที่แล้ว +1

      @@statquest thx for getting back to me. Makes sense! And I've seen this assumption elsewhere. I was curious because throughout those videos you don't make assumptions about inferring the line (e.g. OLS)

    • @statquest
      @statquest  4 ปีที่แล้ว +2

      @@VivaldiHeroes It's good you checked!

  • @guthrie_the_wizard
    @guthrie_the_wizard 3 ปีที่แล้ว +1

    Thanks very much! Your videos rock. Great pacing, excellent points.

    • @statquest
      @statquest  3 ปีที่แล้ว

      Thank you very much! :)

  • @ricardoagnelo2995
    @ricardoagnelo2995 5 ปีที่แล้ว +5

    Good explanation and your videos really are funny... good job

    • @statquest
      @statquest  5 ปีที่แล้ว +1

      Hooray! I'm glad you like the videos and my silly jokes. ;)

  • @mishtimaithli
    @mishtimaithli 2 ปีที่แล้ว +1

    phew...!!! finally this concept is clear in my head :) thank you sooo much

    • @statquest
      @statquest  2 ปีที่แล้ว

      Glad it helped!

  • @bradleylignoski6887
    @bradleylignoski6887 10 หลายเดือนก่อน +1

    I love your videos, and I have a question. My understanding is that "variation" is not a technical term. I'm pretty sure it doesn't refer to a number and it has different meanings in different contexts.
    When Josh says "variation" in the video, does he mean variance? I find this confusing. I would love it if someone would clarify this for me.

    • @statquest
      @statquest  10 หลายเดือนก่อน

      In this video I use the terms "variation" and "variance" interchangeably. This is a fairly common practice, but I'm sorry if it was confusing.

  • @tanyofish
    @tanyofish 5 ปีที่แล้ว +1

    It’s my first time to learn R and R^2. I’m not quite sure why R2 is easier and more intuitive than R. I understand the case of your example that R=0.7 and R=0.5. But what about when R=0.5 and R=0.25, R2s become not easier to compare. Should I see other videos first to understand this? Thanks a lot!

    • @statquest
      @statquest  5 ปีที่แล้ว +2

      You give a good example for why R^2 is easier to interpret than R. When R=0.5, then R^2=0.25, and the model accounts for 25% of the variance. When R=0.25, then R^2=0.0625, and the model accounts for 6.25% of the variance. Thus, when R=0.5, the model accounts for 0.25/0.0625=4 times as much variance. As you just saw, the math was much easier with R^2 and far less obvious with R.

    • @tanyofish
      @tanyofish 5 ปีที่แล้ว +1

      @@statquest gotchya. I thought "easier" meant even ratio (0.7/0.5 vs 0.49/0.25). Thank you!