R-squared or coefficient of determination | Regression | Probability and Statistics | Khan Academy

แชร์
ฝัง
  • เผยแพร่เมื่อ 22 ต.ค. 2024

ความคิดเห็น • 148

  • @alleyygal
    @alleyygal 11 ปีที่แล้ว +102

    I think I've fallen in love with Khan. We've been through everything together; Statistics, Chemistry, you name it.

  • @MrKevlax
    @MrKevlax 11 ปีที่แล้ว +101

    and after all of the money that I spent on education... I just learned more without any pants on at home.

    • @10thpass93
      @10thpass93 4 ปีที่แล้ว +1

      hy i am from india,i also maths teacher

    • @ayushsoni786
      @ayushsoni786 4 ปีที่แล้ว +7

      @@10thpass93 to?? kya??

    • @bennyblanco3322
      @bennyblanco3322 2 ปีที่แล้ว

      You watched video after your wedding night

  • @oscarbergman7173
    @oscarbergman7173 8 ปีที่แล้ว +392

    Khan "Let me do this in an other color" Academy

    • @Cash4Fruit
      @Cash4Fruit 7 ปีที่แล้ว +2

      XD

    • @dakshsagar2839
      @dakshsagar2839 7 ปีที่แล้ว +2

      This is EPIC !

    • @youtubeaccount0x073
      @youtubeaccount0x073 4 ปีที่แล้ว +3

      Oscar Bergman another*

    • @NM-vw6xq
      @NM-vw6xq 3 ปีที่แล้ว +1

      Lol all joking aside, the colors help visualize it a lot better since there's usually a ton of stuff up on the "board" by the end of the video.

  • @NM-vw6xq
    @NM-vw6xq 3 ปีที่แล้ว +25

    Thank you Sal! This was the best explanation of the R squared error I've seen ... makes a LOT more sense than just machine learning courses saying "its a percentage of how much it explains the variability of the data". This shows why that is, brilliant. Such a simple concept when broken down and explained visually step-by-step. You and 3B1B (also a former Khan Academy educator) are amongst the best instructors on YT.

    • @satishchandramedi9729
      @satishchandramedi9729 3 ปีที่แล้ว

      Had you watched Applied AI course video on R squared error and came here to understand it in depth?

  • @mevlutsoylu7481
    @mevlutsoylu7481 8 ปีที่แล้ว +52

    This video "literally" saved my butt. Thank you very much.

  • @angelbythewings
    @angelbythewings 3 ปีที่แล้ว +3

    Sal makes no presumptions about students coming to watch his lectures, and that's what has been helping me out. I have never met an educator better than him in my entire life, PERIOD

  • @benjamcg
    @benjamcg 2 ปีที่แล้ว +2

    1:03 What defines the error?
    1:20 What defines total square error?
    1:37 How y = (mx+b) relates to the error.
    2:55 "What % of the variation in y describes the variation in x?
    6:30 What defines the squared error of the mean of y? (end)
    6:59 What is not described by the variation in x?
    12:38 What is the coefficient of determination, aka R²? (end)
    10:23 What does R² say about the regression line?

  • @chocolatebubbles91
    @chocolatebubbles91 12 ปีที่แล้ว +2

    thank god for him because my own math teacher has absolutely no idea what he's teaching. he's not a statistician, it's his first year teaching stats, so i don't blame him.. but i'd be lost without sal & his vids. thanks again!

    • @nocturnalvisionmusic
      @nocturnalvisionmusic 2 ปีที่แล้ว

      Don't think I've ever heard the term *statistician* before. Thanks :D

  • @skulqerX
    @skulqerX 3 ปีที่แล้ว +2

    Mr khan , youve been with me since i was at secondary school . And now im trying to understand how to do my papers and statistics, suprised to still see there is so much i can learn from you . Thank you for all of ur lectures.

    • @shubhanshuagarwal901
      @shubhanshuagarwal901 3 ปีที่แล้ว

      Can u explain what is the meaning of line " % not explained by regression line" as that ratio regression/mean error

  • @RajvimalCrystal
    @RajvimalCrystal 6 ปีที่แล้ว +1

    Khan you are prodigy!!! only prodigy can teach as clear as crystal

  • @coconutjoy
    @coconutjoy 11 ปีที่แล้ว +1

    Best explanation ever, imagine how many students around the world you have enriched!

  • @NirmalKumar-sc5tk
    @NirmalKumar-sc5tk 5 ปีที่แล้ว +2

    Thank you Khan Academy for clearing my doubts in so many topics.

  • @boredomgotmehere
    @boredomgotmehere ปีที่แล้ว

    Always enjoy the explanations of Khan Academy. Precise, succinct and. Intuitively explained.

  • @Drew0Ranger
    @Drew0Ranger 2 ปีที่แล้ว

    I am glad I watched this, I am glad...glad..I am glad I watched this.

  • @fridaylovebird
    @fridaylovebird 11 ปีที่แล้ว +3

    Note potential confusion: "SE" in this video is the (sum of the) Squared Errors. However, usually "s.e." is standard error. Since, technically, we NEVER observe errors (because they are a function of the true regression line--we can estimate residuals, though), what is called "SE" here is actually the Sum of the Squared Residuals (SSR in some texts). I am torn whether to recommend this video to my stats students because of this likely confusion.

    • @numericalmethodsguy
      @numericalmethodsguy 6 ปีที่แล้ว

      It is the sum of the squares of the residuals, not errors.

  • @totalinsight9888
    @totalinsight9888 7 ปีที่แล้ว +4

    Wonderful Mr. Khan. You have made this problematic concept seem so simple and 'intuitive'. Keep up this GOOD WORK.

  • @georgedestino4788
    @georgedestino4788 3 ปีที่แล้ว +2

    9:58 r squared is the Coefficient of determination

  • @stephenaloia6695
    @stephenaloia6695 3 ปีที่แล้ว

    Brilliantly simple and straightforward. So good.

  • @dychui
    @dychui 11 ปีที่แล้ว +1

    Khan you are my statistics hero!

  • @familycohen4758
    @familycohen4758 2 ปีที่แล้ว

    Wonderful explanation. Thank you so much!

  • @jerilphilip4835
    @jerilphilip4835 4 ปีที่แล้ว +4

    Sal the saviour of pilgrims seeking conceptual clarity, as always.

  • @sayak
    @sayak 3 ปีที่แล้ว

    thanks this was clear explanation

  • @dychui
    @dychui 10 ปีที่แล้ว +8

    Great video. I really appreciate it.

  • @ibrahimbarro2814
    @ibrahimbarro2814 4 ปีที่แล้ว

    Man this saved my life. Thanks so much

  • @MeghaShankarArt
    @MeghaShankarArt 5 ปีที่แล้ว +1

    Thankyou so much for this video! Wonderfully explained!

  • @xgutierrezc
    @xgutierrezc 4 ปีที่แล้ว +2

    I loved the explanation. Came here trying to understand the r2_score function in Python.

  • @goodnesskabelo
    @goodnesskabelo 5 ปีที่แล้ว +1

    I wish everyone else would like and comment after enjoying a video, these likes and comments motivate us when we are in panic to quickly click into the video and cram before exam

  • @andattam
    @andattam 6 ปีที่แล้ว +29

    When you say " described by the variation in X," you should just write "described by the line," rather than X. It's really confusing for the rest of the video for those who didn't catch that.

    • @henryksarat
      @henryksarat 5 ปีที่แล้ว +2

      An Luu this comment helped a lot thank you

    • @10thpass93
      @10thpass93 4 ปีที่แล้ว +1

      i am maths teacher at india. i want to help students in maths subject

  • @NamalIndrajith-MCP
    @NamalIndrajith-MCP 4 ปีที่แล้ว

    superb explanation, understood well

  • @keinsent
    @keinsent 6 หลายเดือนก่อน

    THANKYOU

  • @bryanporinski5308
    @bryanporinski5308 4 ปีที่แล้ว

    Brilliant teaching!

  • @Chris-ng9zi
    @Chris-ng9zi ปีที่แล้ว +3

    Excellent explanation! But why is the Coefficient of Determination equal to r _squared vs some other quantity? Why not r- cubed or ( r_squared -1)?

    • @beinghimself
      @beinghimself ปีที่แล้ว

      Did u find the answer?

  • @shehlamalik5392
    @shehlamalik5392 3 ปีที่แล้ว

    Your voice is so sweet.

  • @ansariarbaz3374
    @ansariarbaz3374 3 ปีที่แล้ว

    just one word, awesome!!

  • @yusufnzm
    @yusufnzm 2 ปีที่แล้ว +2

    Why sesubline/sesuby is "What % of variation is not described by the Regression line " instead of "described by"?

    • @19_088SHOVONSHARMA
      @19_088SHOVONSHARMA 10 หลายเดือนก่อน

      buddy,i had this problem too. but think it like this.when we talk about the "proportion of unexplained variation," we mean the fraction of the total variability that is not captured or explained by the regression model which means the error or SSE(more fancy term XD). hope this helps u.

  • @andrerossa8553
    @andrerossa8553 5 ปีที่แล้ว

    Great video. Tks

  • @anesdj4282
    @anesdj4282 9 ปีที่แล้ว +3

    There is no easier way to explain regression as simple as that. thanks

  • @yileiwu4040
    @yileiwu4040 6 ปีที่แล้ว +2

    Thanks,I really appreciate it.

  • @jaskaransingh0304
    @jaskaransingh0304 ปีที่แล้ว

    Just awesome

  • @kennguyen1066
    @kennguyen1066 5 ปีที่แล้ว

    Very nicely explained!

  • @nglazgow
    @nglazgow 5 ปีที่แล้ว

    Thanks!

  • @studyselection2881
    @studyselection2881 2 ปีที่แล้ว

    Good channel.

  • @yangzhou981
    @yangzhou981 10 ปีที่แล้ว

    great video,many thanks

  • @UZPvNUCaaQdF
    @UZPvNUCaaQdF ปีที่แล้ว

    I love you, man.

  • @annashchukina3768
    @annashchukina3768 5 ปีที่แล้ว +1

    Thank you for video, it was really helpful. I have a question.
    What if we have data, which we approximated well with a line L, but L is almost horizontal and close to y=mean(y). The ratio of SE(line) to SE(y) is supposed to be close to 1, thus r^2 is close to 0, although L is a good fit.

  • @rushikeshkalkar5162
    @rushikeshkalkar5162 ปีที่แล้ว

    Sal Sir = LEGEND🙌

  • @bhoomeendra
    @bhoomeendra 6 ปีที่แล้ว +1

    GREAT EXPLANATION !! the concept is very clean in head now

  • @serenalizinnqui8474
    @serenalizinnqui8474 3 ปีที่แล้ว

    How do we know that the amount of variation in y not caused by variation in x is SE_line -SE_ymean? What does the difference in errors between the two lines have to do with anything?

  • @innovation2440
    @innovation2440 8 ปีที่แล้ว +14

    What is "variation described by the line" mean? Can someone help explain please..

    • @李映潔-l1w
      @李映潔-l1w 8 ปีที่แล้ว +1

      Inno Vation needing this also

    • @spacedustpi
      @spacedustpi 6 ปีที่แล้ว +1

      He is referring to the diagonal line. Often called the regression line. Represented by mx+b. The variation of the line is equal to the sum of all the (y-(mx + b))^2 equations he wrote up there in the video.

    • @Luiferhoyos
      @Luiferhoyos 6 ปีที่แล้ว +23

      If all of the points were exactly on the regression line, then the regression line would explain all of the variation. In other words, SE(line) would be zero. Anywhere that the points do NOT lie on the line is variation that is undescribed by the regression.
      For example, a regression line might show that for each hour a student studies, he gets 10% higher on a test. 5 hours earns a 50, 6 hours earns a 60, 7 hours earns a 70. If a student studied for 8 hours and only got a 75, that variation between 75 and the expected 80 would not be described by the regression line and would add to the SE(line).
      Taken from this comment: www.khanacademy.org/video/r-squared-or-coefficient-of-determination?qa_expand_key=kaencrypted_c228fca3982c98caf8466db5f21d3293_97ad4ce3e39b15af2b59bffb1589cc545a0a13b897657bf6c9694237018a45585fa77017167815c3fb5868e6cd97f5a15499b0ead13d4f8bdae6f9823d7f95f6f9c067fbf53f053b72e5221cd3341c5f842cbd96c5140b0868e8e68a46855aa5af32a4cc1fe19ffd136df5b5a492e8a83fd3a28e20965323f00c242a29d98de4c9bf954e71ca493b1e661f7d719a521c5a74b8710cac5e6b8b9f268a9170fad433006065db869f0da574bb11fdd142868bff9ffd8014d6433eebdcfc59aec0e9

    • @razorblake393
      @razorblake393 4 ปีที่แล้ว +2

      How far the point are from the line

  • @chris7toronto
    @chris7toronto 14 ปีที่แล้ว

    You are the best!

  • @riderblack6401
    @riderblack6401 6 ปีที่แล้ว

    Great video! Thanks sal!

  • @pdwivedi1990
    @pdwivedi1990 9 ปีที่แล้ว

    Great tutorial..

  • @razorblake393
    @razorblake393 4 ปีที่แล้ว +1

    What do you mean by "not described by the variation in x"

  • @pradeepkumar9586
    @pradeepkumar9586 6 ปีที่แล้ว

    khan acadamy rocks..

  • @JuSpeJule
    @JuSpeJule 8 ปีที่แล้ว +2

    Oh my god I finally got this! Thank you so much. x

  • @atuntalib9474
    @atuntalib9474 8 ปีที่แล้ว +3

    i dont hv basic in statistic but need to understand this for managerial economic. what do you mean by mean-of-y? is that the average for all the Y?

  • @produdeyay
    @produdeyay 4 ปีที่แล้ว

    I don't understand the need to interchange "describe by variation in X" and "describe by the line", are they the same thing?

  • @bhavindhedhi
    @bhavindhedhi 6 ปีที่แล้ว

    The error that you are referring to [ 0:58] is really residuals, right? just clearing things up.

  • @changw_w9413
    @changw_w9413 4 หลายเดือนก่อน

    Why "how much of total variation is not described by the regression line" = SE(line)? Isn't SE(line) describes the total variation for the regression line? I'm confused by this sentence? Could anyone help to enlighten me? Thanks :)

  • @stefanobourscheid8255
    @stefanobourscheid8255 7 ปีที่แล้ว +2

    I read somewhere that Rˆ2 may be negative, I don't understand how since it is essentially a %

    • @FoxxMulderr
      @FoxxMulderr 7 ปีที่แล้ว +2

      that's an adjusted or predicted R2, because you can't have negative with regular R2.

  • @rodaf2
    @rodaf2 9 ปีที่แล้ว +1

    Minor correction: it should be 'fraction' instead of 'percentage' at several parts of the video.

  • @AsgharKhan-fd1ul
    @AsgharKhan-fd1ul 12 ปีที่แล้ว

    KHAN IS KING

  • @ChaceBonanno
    @ChaceBonanno 4 ปีที่แล้ว

    Gotta get those Ms and Bs

  • @rutger5000
    @rutger5000 5 ปีที่แล้ว

    Just checking you can't get postive R^2 values higher than 1 right?

  • @TriNguyen-xi8ji
    @TriNguyen-xi8ji 11 ปีที่แล้ว +2

    I love you. you saved my life, keep up the good work :D

  • @Lucyferandtheson003
    @Lucyferandtheson003 ปีที่แล้ว

    How did he estimate the regression line?? Did he just guess? Or he used some technique?

  • @SandeepAcademy9
    @SandeepAcademy9 11 ปีที่แล้ว +1

    Can anyone tell me, How SEline (total square error) gives the total variation not described by regression line?
    Also, How does the regression line describe the variation in x?

  • @GThinks
    @GThinks 9 ปีที่แล้ว

    what do "the variation of y" and "the variation of y described by the variation of x" mean again.. ?

  • @aayishatabassum9174
    @aayishatabassum9174 7 หลายเดือนก่อน

    what does variation in x mean exactly ?

  • @mawais2560
    @mawais2560 4 ปีที่แล้ว

    what are possible interpretations and justifications for low r square values in management sceince?

  • @ernestamoore4385
    @ernestamoore4385 8 ปีที่แล้ว +1

    How do you estimate the significance of the betas?

  • @sera_venus
    @sera_venus 5 ปีที่แล้ว

    maybe i should have studied for my midterm this week

  • @mdnguyen287
    @mdnguyen287 13 ปีที่แล้ว

    thank you,it is extremely easy to understand what you are saying, absolutely different from my lecture notes. why can't you be my professor? :)

  • @kn0wled9e
    @kn0wled9e 13 ปีที่แล้ว

    thank u so much sir....u r awsome

    • @unhealthyavacado
      @unhealthyavacado 2 ปีที่แล้ว

      dude it's been 10 years since you wrote this. are you still alive :D

  • @user-zr8vc8nw7e
    @user-zr8vc8nw7e ปีที่แล้ว

    What is it meant by "describe"? "percentage of variation described by percentage of error"?" DESCRIBED"?! what's that mean?

  • @jgeorgiou94
    @jgeorgiou94 9 ปีที่แล้ว +87

    this guy must have repetitive strain injury by now repetitive strain injury by now. by now.

    • @junehurt4897
      @junehurt4897 8 ปีที่แล้ว +3

      +Jonathan Georgiou Makes it harder and harder for me to watch Khan Academy math videos. They are good at explaining, but the lecturer's repetitive phrases drive me nuts!

    • @carlos_f
      @carlos_f 8 ปีที่แล้ว +46

      maybe there's people watching these videos that aren't as fluent in english as you are mate so by repeating phrases those peolpe might have another chance to catch the meaning of the phrase.. my 2c

    • @riderblack6401
      @riderblack6401 6 ปีที่แล้ว +3

      It's not his problem, sometimes it is that you don't understand the math.

    • @jazzhopkid
      @jazzhopkid 6 ปีที่แล้ว +5

      Repetition is key to memory.

  • @adam2073
    @adam2073 5 ปีที่แล้ว +1

    Why does he refer to the "variation of X" and the "the line" as the same thing ?

  • @harshitjain7428
    @harshitjain7428 4 ปีที่แล้ว

    that R is pearson's coefficient right? don't we denote that with 'r' instead of 'R'?

  • @hectichive889
    @hectichive889 4 ปีที่แล้ว

    I still don’t understand why in the world you would use the mean of y instead of just the residual from the regression line?

  • @shannonmccosky6258
    @shannonmccosky6258 8 ปีที่แล้ว

    What program is used to take these notes?

  • @mishuga
    @mishuga 14 ปีที่แล้ว

    I'm with v2prc. This is the first statement in the playlist that has me confused. Help would be much appreciated.

  • @PetStuBa
    @PetStuBa 6 ปีที่แล้ว +3

    I think it's n-1 ... not n ... there are n-1 degrees of freedom .. (5:57)

    • @mindloop4070
      @mindloop4070 2 ปีที่แล้ว

      he is not calculating variance here just to compare with population..just a avearage Squared error so n makes sense

    • @PetStuBa
      @PetStuBa 2 ปีที่แล้ว

      @@mindloop4070 yeah good point

  • @jianzhao8986
    @jianzhao8986 8 ปีที่แล้ว

    What's the relationship btw r and r squared?

  • @t2hmn
    @t2hmn 13 ปีที่แล้ว

    @hapolian He is probably writing with a tool called graphic tablet.

  • @vana-t7u
    @vana-t7u 2 ปีที่แล้ว

    love you

  • @SimAlgyaxSaTun
    @SimAlgyaxSaTun 3 ปีที่แล้ว

    Sal is the GOAT

  • @willsonbasyal7883
    @willsonbasyal7883 2 ปีที่แล้ว +1

    khaaaaaaaaaannnnnn

  • @bunnybunny1988
    @bunnybunny1988 6 ปีที่แล้ว

    how to write on the computer screen like this?

  • @ldpy33
    @ldpy33 4 ปีที่แล้ว

    So, they both tell us how strong the relationship is between two variables?!

    • @ldpy33
      @ldpy33 4 ปีที่แล้ว

      Here's the question: IS THE ANSWER E?
      Which of the following tells us how strong the relationship is between two variables?
      a) the slope of a line
      b) the intercept of a line
      c) the coefficient of determination
      d) the coefficient of correlation
      e) both C and D are correct

  • @ajufsd
    @ajufsd 9 ปีที่แล้ว

    Thank you.. but I badly need the notes just made :p

  • @santiagorozo2530
    @santiagorozo2530 5 ปีที่แล้ว +2

    If you play this a 1.5x speed he sounds like Jeff Goldblum

  • @ansonlai9153
    @ansonlai9153 5 ปีที่แล้ว

    why are you so good in explaining like the how the fk...

  • @wencerich4550
    @wencerich4550 3 ปีที่แล้ว

    I don't know where 30% obtain and 1 in the formula for sure am not understand

  • @fusionplaysgames
    @fusionplaysgames 8 หลายเดือนก่อน

    Me learning after 13 years

  • @Himashree_333
    @Himashree_333 2 ปีที่แล้ว

    That's not error that's residual

  • @sunilguptamaths
    @sunilguptamaths 4 ปีที่แล้ว

    Hy

  • @MrMr-en8fo
    @MrMr-en8fo 5 ปีที่แล้ว

    lecture Urdu ma dayn smjh n ari

  • @kanishka.b8550
    @kanishka.b8550 3 ปีที่แล้ว

    730K views and 2.3K likes? 0.o wut?

  • @sierraclark6129
    @sierraclark6129 3 ปีที่แล้ว +1

    “If you declare with your mouth “Jesus is Lord,” and believe in your heart that God raised Him from the dead, you will be saved” (Romans 10:9). Now is the time to accept Jesus as your personal Lord and Savior. Obey His commands and repent of your sins because Jesus is coming back soon. Tomorrow isn’t promised.

  • @markchristophergemzon1052
    @markchristophergemzon1052 4 ปีที่แล้ว

    Who is here because of machine learning?