Covariance vs Correlation with simple data | Covariance vs Correlation Coefficient

แชร์
ฝัง
  • เผยแพร่เมื่อ 21 ก.ค. 2024
  • Covariance vs Correlation with simple data | Covariance vs Correlation Coefficient
    #CovarianceVSCorrelation #UnfoldDataScience
    Hello ,
    My name is Aman and I am a Data Scientist.
    About this video:
    In this video, In explain about covariance and correlation. This is an important statistics concept to know and hence I have explained the difference between correlation and covariance in this video through a simple data. Below topics are explained in this video:
    1. Covariance vs Correlation with simple data
    2. Covariance vs Correlation Coefficient
    3. What is difference between correlation and covariance
    4. Understanding Correlation vs Covariance
    5. How is Covariance different from correlation
    About Unfold Data science: This channel is to help people understand basics of data science through simple examples in easy way. Anybody without having prior knowledge of computer programming or statistics or machine learning and artificial intelligence can get an understanding of data science at high level through this channel. The videos uploaded will not be very technical in nature and hence it can be easily grasped by viewers from different background as well.
    If you need Data Science training from scratch . Please fill this form (Please Note: Training is chargeable)
    docs.google.com/forms/d/1Acua...
    Book recommendation for Data Science:
    Category 1 - Must Read For Every Data Scientist:
    The Elements of Statistical Learning by Trevor Hastie - amzn.to/37wMo9H
    Python Data Science Handbook - amzn.to/31UCScm
    Business Statistics By Ken Black - amzn.to/2LObAA5
    Hands-On Machine Learning with Scikit Learn, Keras, and TensorFlow by Aurelien Geron - amzn.to/3gV8sO9
    Ctaegory 2 - Overall Data Science:
    The Art of Data Science By Roger D. Peng - amzn.to/2KD75aD
    Predictive Analytics By By Eric Siegel - amzn.to/3nsQftV
    Data Science for Business By Foster Provost - amzn.to/3ajN8QZ
    Category 3 - Statistics and Mathematics:
    Naked Statistics By Charles Wheelan - amzn.to/3gXLdmp
    Practical Statistics for Data Scientist By Peter Bruce - amzn.to/37wL9Y5
    Category 4 - Machine Learning:
    Introduction to machine learning by Andreas C Muller - amzn.to/3oZ3X7T
    The Hundred Page Machine Learning Book by Andriy Burkov - amzn.to/3pdqCxJ
    Category 5 - Programming:
    The Pragmatic Programmer by David Thomas - amzn.to/2WqWXVj
    Clean Code by Robert C. Martin - amzn.to/3oYOdlt
    My Studio Setup:
    My Camera : amzn.to/3mwXI9I
    My Mic : amzn.to/34phfD0
    My Tripod : amzn.to/3r4HeJA
    My Ring Light : amzn.to/3gZz00F
    Join Facebook group :
    groups/41022...
    Follow on medium : / amanrai77
    Follow on quora: www.quora.com/profile/Aman-Ku...
    Follow on twitter : @unfoldds
    Get connected on LinkedIn : / aman-kumar-b4881440
    Follow on Instagram : unfolddatascience
    Watch Introduction to Data Science full playlist here : • Data Science In 15 Min...
    Watch python for data science playlist here:
    • Python Basics For Data...
    Watch statistics and mathematics playlist here :
    • Measures of Central Te...
    Watch End to End Implementation of a simple machine learning model in Python here:
    • How Does Machine Learn...
    Learn Ensemble Model, Bagging and Boosting here:
    • Introduction to Ensemb...
    Build Career in Data Science Playlist:
    • Channel updates - Unfo...
    Artificial Neural Network and Deep Learning Playlist:
    • Intuition behind neura...
    Natural langugae Processing playlist:
    • Natural Language Proce...
    Understanding and building recommendation system:
    • Recommendation System ...
    Access all my codes here:
    drive.google.com/drive/folder...
    Have a different question for me? Ask me here : docs.google.com/forms/d/1ccgl...
    My Music: www.bensound.com/royalty-free...

ความคิดเห็น • 186

  • @saikiranreddymekala1346
    @saikiranreddymekala1346 ปีที่แล้ว +5

    In the formula of cov(x,y) the denominator is N-1 . Can you please correct this sir!

    • @UnfoldDataScience
      @UnfoldDataScience  ปีที่แล้ว

      I will pin this on top of the video.

    • @pkavenger9990
      @pkavenger9990 ปีที่แล้ว +10

      actually if you are applying this formula on sample then its N-1 otherwise if you are applying on population then its N.

    • @vaibhavpandey7398
      @vaibhavpandey7398 ปีที่แล้ว +1

      @@pkavenger9990 thanks

  • @prasadgpa6813
    @prasadgpa6813 3 ปีที่แล้ว +5

    You nail it in every your videos . You sell the simplified knowledge . Keep it up and may God bless you.. Can't wait for more such videos :)

  • @user-pb6pt4rw1l
    @user-pb6pt4rw1l ปีที่แล้ว

    Amazing video! Such simple explanation, you earned a loyal subscriber today :)

  • @SomnathGupta-gu2rm
    @SomnathGupta-gu2rm 8 หลายเดือนก่อน +1

    You are a Genius Sir. Thank You So Much for making these Concepts Simple and Lucid. May God Bless You 🙏🙏💐💐

  • @gaytriray7019
    @gaytriray7019 3 ปีที่แล้ว +10

    You are amazing at what you do! Your passion and dedication is beyond words! Thankyou so much sir.

    • @UnfoldDataScience
      @UnfoldDataScience  3 ปีที่แล้ว +1

      Thanks Gayatri for motivating me through your comment.

  • @niii5715
    @niii5715 ปีที่แล้ว

    how calm u r while talking man ,excellent explaination! Hats off 🙇‍♂

  • @karthikaanilkumar
    @karthikaanilkumar ปีที่แล้ว

    IT'S REALLY AN EASY CLASS AND THE WAY OF YOUR PRESENTATION IS GREAT.

  • @omdodmani3205
    @omdodmani3205 10 หลายเดือนก่อน

    Loved it 😊! you made it very easy to understand thanks !

  • @sexycurse
    @sexycurse 2 ปีที่แล้ว

    Amazing explanation brother..Good Job..👍👍👍

  • @best_movies3736
    @best_movies3736 2 ปีที่แล้ว

    You are simply the best! The MasterBlaster in Data Science

  • @kavyanagesh8304
    @kavyanagesh8304 ปีที่แล้ว

    You are the best teacher! Got goosebumps while listening to your lecture. Thank you so much!

    • @UnfoldDataScience
      @UnfoldDataScience  ปีที่แล้ว

      Thanks Kavya. Pls share with friends as well. Keep learning

  • @yoharihernandez
    @yoharihernandez 2 ปีที่แล้ว

    Thank you so much! it was hard for me to understand this concept until I found this video. Please keep doing more videos!

  • @andresgrtz
    @andresgrtz ปีที่แล้ว

    Thank you! Great teacher!

  • @ayushagarwal7284
    @ayushagarwal7284 ปีที่แล้ว

    Awesome video sir...Keep shining😀

  • @soheilaahmadi4807
    @soheilaahmadi4807 2 ปีที่แล้ว +1

    Was amazing. Happy to find your tutorials on TH-cam.

  • @pokejishnu
    @pokejishnu 3 ปีที่แล้ว +1

    Amazing explanation Aman bhai ... Made it sound so simple. Thanks this helps.

  • @Nannyhere
    @Nannyhere หลายเดือนก่อน

    Just watched it before exam and in one go ,i understood the concept ❤keep making such videos sir 🥰🥰

  • @siddheshbhalerao3013
    @siddheshbhalerao3013 ปีที่แล้ว

    Thank you sir for clean explanation.

  • @ketakishitut2713
    @ketakishitut2713 ปีที่แล้ว

    Very well explained, thanks

  • @sameerpandey5561
    @sameerpandey5561 3 ปีที่แล้ว +1

    Beautifully explained!!...Thanks for such content

  • @anaskhan4841
    @anaskhan4841 2 ปีที่แล้ว

    Thank you Aman❤

  • @vidyaanvekar
    @vidyaanvekar 2 ปีที่แล้ว +1

    Thanks for the wonderful explanation. You made my understanding very concrete.

  • @user-qd2dp8ru9w
    @user-qd2dp8ru9w หลายเดือนก่อน

    V good, thanks

  • @haidiazaman3326
    @haidiazaman3326 3 ปีที่แล้ว

    fantastic channel, def deserves more views

  • @shabiyaahlam3217
    @shabiyaahlam3217 2 ปีที่แล้ว

    Helpd me a lot!

  • @financenanchahal8801
    @financenanchahal8801 2 ปีที่แล้ว

    Amazing explanation sir.....

  • @champabanerjee1208
    @champabanerjee1208 2 ปีที่แล้ว

    You made it so clear....thank you so much 🙏❤️

  • @onuragmaji
    @onuragmaji 2 ปีที่แล้ว

    Good work bro ur teaching style is very cool, keep posting such great content

  • @rishabhsheoran6959
    @rishabhsheoran6959 2 ปีที่แล้ว +1

    Kaafi achhcha samjhaya bhai! Good work!

    • @UnfoldDataScience
      @UnfoldDataScience  2 ปีที่แล้ว +1

      Thanks Rishabh, thoda share kar dijie apne groups me :)

  • @parol3271
    @parol3271 3 ปีที่แล้ว

    Thanks a lot . A very detailed explanation . Great yaar🙏👍👌

  • @_itachi7904
    @_itachi7904 3 ปีที่แล้ว

    after so much of head banging finally I understood covariance & correlation....thank you so much...

  • @ajaykumarsahoo1404
    @ajaykumarsahoo1404 3 ปีที่แล้ว +6

    Hey I have a doubt. When you change the value of a y variable from 32 to 48 .. Will it mean remain same that means mean will not change? If change then how can you subtract the previous mean from the new value of y?

    • @UnfoldDataScience
      @UnfoldDataScience  3 ปีที่แล้ว

      Let me check once.

    • @simplicityandchaos976
      @simplicityandchaos976 2 ปีที่แล้ว +4

      Changing the value of Y variable from 32 to 48 changes overall mean of Y and hence it cannot be subtracted from the previous.

  • @k.vvishwanathan7341
    @k.vvishwanathan7341 ปีที่แล้ว

    Pls clear my doubt.covariance is the way two variables move. Whether positive or negative but what is correlation of those Variables. How much have they moved ? Like if covariance is nearby to -1 then the two variables move in the opposite direction?

  • @bayz4918
    @bayz4918 4 หลายเดือนก่อน

    it is good keep it up

  • @sanjeevkmr5749
    @sanjeevkmr5749 3 ปีที่แล้ว

    Amazing explanation. Please keep doing the great work. This channel deserves more!!!
    I have heard that before training a ML model, it is advised to remove highly correlated features, Can you explain why?

    • @UnfoldDataScience
      @UnfoldDataScience  3 ปีที่แล้ว +1

      Hi Sanjeev, I created a detailed video for this question, Watch it on my channel today 7pm IST. :)

    • @sanjeevkmr5749
      @sanjeevkmr5749 3 ปีที่แล้ว

      @@UnfoldDataScience Thanks a lot!

  • @jigyasasoni1812
    @jigyasasoni1812 ปีที่แล้ว

    with in a 1:30 sec.... i can say, you are the best teacher.

    • @UnfoldDataScience
      @UnfoldDataScience  ปีที่แล้ว

      Thank you Jigyasa. Your comments mean a lot to me

  • @ragess4rari100
    @ragess4rari100 2 ปีที่แล้ว +1

    Thank you sir. You're a great teacher.

  • @adityapatki151
    @adityapatki151 3 ปีที่แล้ว +4

    Hey I am data scientist too and really like your content . Can you make a video about how to select (statistically) control group size for marketing campaign?

    • @UnfoldDataScience
      @UnfoldDataScience  3 ปีที่แล้ว +1

      Thanks Aditya. Let me think through it. Thanks for asking.

  • @humanafees8059
    @humanafees8059 3 ปีที่แล้ว

    Thank u so much I m.doing msc buisness analytics from Scotland seriously for every questions that hit me I come to ur channel I m.ur new subscriber God bless you

  • @sudhakarvasa2688
    @sudhakarvasa2688 ปีที่แล้ว

    When Yi value is changed from 32 to 48 mean of y will also change

  • @tusharbedse9523
    @tusharbedse9523 2 ปีที่แล้ว

    Nicely explained again Aman!

  • @animetalks2129
    @animetalks2129 ปีที่แล้ว

    sir can you suggest a tabelau course for me ? i am confused where to learn from ?.

  • @vinayverma9121
    @vinayverma9121 2 ปีที่แล้ว

    You are amazing buddy you explained it so simply

    • @UnfoldDataScience
      @UnfoldDataScience  2 ปีที่แล้ว

      Thanks Vinay for your positive feedback. Please share with others as well who could be benefited from such content.

  • @sadhnarai8757
    @sadhnarai8757 3 ปีที่แล้ว +1

    Much needed,thanks :)

  • @RamanKumar-ss2ro
    @RamanKumar-ss2ro 3 ปีที่แล้ว +1

    Thanks a lot for this topic.

  • @m12gaming81
    @m12gaming81 6 หลายเดือนก่อน

    sir ur doing great work luv u

  • @emineakpnar6215
    @emineakpnar6215 3 ปีที่แล้ว

    it helped me a lot, thank you🙌

  • @SumitSinghXd
    @SumitSinghXd 3 ปีที่แล้ว +1

    Great work .. These two terms were alien for me and the online website have complicated it more. Thanks to you I have understood it completely. just a single doubt , I have seen in many websites the denominator for variance is taken N-1 and in your video its N . which one should I go for

  • @kanamarlapudinaresh9934
    @kanamarlapudinaresh9934 3 ปีที่แล้ว

    Thanks much.
    In last formula, you mentioned (standard deviation of x) ( standard deviation of y ) in denominator. How we will calculate and can you explain.

  • @karthickk6587
    @karthickk6587 2 ปีที่แล้ว

    Amazing Aman

  • @sandipansarkar9211
    @sandipansarkar9211 2 ปีที่แล้ว

    finished watching

  • @sudheerrao9820
    @sudheerrao9820 3 ปีที่แล้ว

    Thank you for the video Aman...if four features are positive correlated and four features are negative correlated out of 10 features in dataset...what should we do...I mean which are needs to be dropped and why?

    • @UnfoldDataScience
      @UnfoldDataScience  3 ปีที่แล้ว

      Good question Sudheer. To keep it short, choose the variables which are highest corelated with your target variable(Either positive or negative).

  • @Storiesbymanas
    @Storiesbymanas 3 ปีที่แล้ว

    nicely explained. thank you!

  • @omkarnarayankar5275
    @omkarnarayankar5275 2 ปีที่แล้ว

    Best explanation

  • @UnfoldDataScience
    @UnfoldDataScience  2 ปีที่แล้ว

    Access Hindi, English courses here- www.unfolddatascience.com/s/store
    Plz register on the website

  • @launchdome3219
    @launchdome3219 3 ปีที่แล้ว +1

    very helpful...good job

  • @jrsolomon5960
    @jrsolomon5960 3 ปีที่แล้ว

    Thanks...this is very clear.

  • @yogitabasnal
    @yogitabasnal 2 ปีที่แล้ว

    Great explanation

  • @jay_with_the_real6381
    @jay_with_the_real6381 3 ปีที่แล้ว +1

    Excellent thanks!

  • @nagamuthu4382
    @nagamuthu4382 3 ปีที่แล้ว

    i would like to compare the advances level in banks. is covariance useful to compare the gross advances of public and private sector banks for 10 years.

    • @UnfoldDataScience
      @UnfoldDataScience  3 ปีที่แล้ว +1

      May be you can use advance technique. Its a simple technique.

  • @jananiravinag
    @jananiravinag 3 ปีที่แล้ว +1

    Great resource!

  • @soumyaranjansethi1790
    @soumyaranjansethi1790 3 ปีที่แล้ว

    Amazing sir thank you

  • @bipintiwari8751
    @bipintiwari8751 2 ปีที่แล้ว

    why exponential is calculated can you please explain that as well.

  • @salmanjaved2816
    @salmanjaved2816 3 ปีที่แล้ว +1

    Thanks bro 👍

  • @udayteja6595
    @udayteja6595 ปีที่แล้ว +1

    sir, when you have changed 32 as 48 then the mean also should change . So, it will effect all the variances in numerator , not only the last one.

  • @kiranpol1601
    @kiranpol1601 3 ปีที่แล้ว +1

    wow... What a explanation

  • @afn8370
    @afn8370 ปีที่แล้ว

    thanksss

  • @vcalls9146
    @vcalls9146 3 ปีที่แล้ว +1

    How the new data is handled after the model is moved to production. Example: During model development the categorical data is converted to 1 and 0 using one hot encoding... When the new data is applied in production how the categorical data or text data is processed..

    • @UnfoldDataScience
      @UnfoldDataScience  2 ปีที่แล้ว

      good question, u encode again and then predict.

  • @balajinatarajan1051
    @balajinatarajan1051 2 ปีที่แล้ว +1

    Excellent video

  • @lakshaykhandelwal4284
    @lakshaykhandelwal4284 2 ปีที่แล้ว

    very well explained.. 👍

  • @deepseaoflove3683
    @deepseaoflove3683 3 ปีที่แล้ว +1

    Sir.. Divide by N Or (n-1) for finding convariance? In some lectures it is showing as by ( n-1)

    • @UnfoldDataScience
      @UnfoldDataScience  3 ปีที่แล้ว +1

      Does not matter if your sample size is large.
      See these answers:
      math.stackexchange.com/questions/2936143/do-i-use-n-or-n-1-as-the-denominator-for-covariance

    • @johnsubhash
      @johnsubhash 3 ปีที่แล้ว

      @@UnfoldDataScience then, will it matter if it’s the case of small data set..?
      What should we take then.??

  • @suhatharsanyoganathapillai2203
    @suhatharsanyoganathapillai2203 2 ปีที่แล้ว +1

    Thank you

  • @mosama22
    @mosama22 2 ปีที่แล้ว +1

    Thank you Amen :-) :-)

  • @madhurakhaire6583
    @madhurakhaire6583 2 ปีที่แล้ว

    amazing explination

  • @RishabhRishab
    @RishabhRishab 3 ปีที่แล้ว

    What purpose Covariance value / number is serving ? If I say sign of Covariance tells nature of relationship and Covariance value tells strength of relationship and there is no need of correlation .... how is this statement wrong ?

    • @UnfoldDataScience
      @UnfoldDataScience  2 ปีที่แล้ว

      Good question, however covariance is base of correlation hence that concept came first.

  • @binitkumarsingh8409
    @binitkumarsingh8409 2 ปีที่แล้ว

    how to find standard deviation of x and y..basically that denominator

    • @UnfoldDataScience
      @UnfoldDataScience  2 ปีที่แล้ว

      We no need to compute from scratch, tool will help to do so

  • @megabuzu2726
    @megabuzu2726 3 ปีที่แล้ว +1

    Perfect explanation

  • @priyamkataria573
    @priyamkataria573 2 ปีที่แล้ว

    Great content 🔥🔥

  • @leilyb5224
    @leilyb5224 2 ปีที่แล้ว +1

    Perfect 👍👍👍👍

  • @shubhamkumarjain1329
    @shubhamkumarjain1329 3 ปีที่แล้ว

    Thank you so much... It helped a lot... But i want to know why it ranges between 1 and - 1 and not above that...

    • @UnfoldDataScience
      @UnfoldDataScience  3 ปีที่แล้ว

      Welcome Shubham, its because of internal mathematical formula.

  • @sandipansarkar9211
    @sandipansarkar9211 3 ปีที่แล้ว

    great explanation

  • @chikoo8486
    @chikoo8486 3 ปีที่แล้ว

    The unit you are talking about in covariance is +ve and -ve ??

  • @gangavijayan8906
    @gangavijayan8906 9 หลายเดือนก่อน

    Why covariance is divided by standard deviation?

  • @learningsinlife
    @learningsinlife 2 ปีที่แล้ว

    mean will also change if 48 is made new observation

  • @rohitbhosale4614
    @rohitbhosale4614 3 ปีที่แล้ว

    You are a champ!

  • @arihantchoudhary
    @arihantchoudhary 2 ปีที่แล้ว

    ❤❤❤

  • @dr.sagarfirke2687
    @dr.sagarfirke2687 3 ปีที่แล้ว

    Eexcelent

  • @saurabsen3686
    @saurabsen3686 3 ปีที่แล้ว

    Great presentation that is simple and to the point. However could not fully grasp when calculating correlation, dividing cov(x,y) by sd x multiplied by sd y yields a value between -1 and 1. Why so? Kindly revert.

    • @UnfoldDataScience
      @UnfoldDataScience  2 ปีที่แล้ว

      The explanation will be little more mathematical, please see the discussion here Saurabh:
      math.stackexchange.com/questions/564751/how-can-i-simply-prove-that-the-pearson-correlation-coefficient-is-between-1-an

  • @shoooooooooorts8002
    @shoooooooooorts8002 2 ปีที่แล้ว

    Please make video on statistics for data science A-Z

  • @usmanzahid3711
    @usmanzahid3711 ปีที่แล้ว

    What exactly co variance is

  • @rediscovermath
    @rediscovermath 24 วันที่ผ่านมา

    When you changed 32 to 48, mean will also change.

  • @Manya2017
    @Manya2017 ปีที่แล้ว

    Can you share the link of the data science group so that I can also join

  • @humanafees8059
    @humanafees8059 3 ปีที่แล้ว

    One suggestion can u please make a video for detailed probability for beginners please it a request

  • @D.H.Bangalore
    @D.H.Bangalore 2 ปีที่แล้ว

    Looks like while increasing the value of y you forgot to increase the mean of y

  • @ashokmeena9631
    @ashokmeena9631 3 ปีที่แล้ว +1

    Sir what about virtual Interview

    • @UnfoldDataScience
      @UnfoldDataScience  3 ปีที่แล้ว

      Fill the form in the previous video. I will. Share invite.

  • @anilkumarsharma8901
    @anilkumarsharma8901 ปีที่แล้ว

    Super computer ki power windows interface main karwa do phir Duniya following karegi Vedic math par reserch karwavo

  • @anilkumarsharma8901
    @anilkumarsharma8901 ปีที่แล้ว

    More computer power means more success

  • @vaibhavpandey7398
    @vaibhavpandey7398 ปีที่แล้ว

    I want to ask, ye t shirt sale pe ayi thi.. Wahi se li thi na 🤣🤣🤣🤣 joking. Liked ur video

  • @sanyamsingh4907
    @sanyamsingh4907 3 ปีที่แล้ว

    kids learn from udemy
    legends learn from unfold data science

    • @UnfoldDataScience
      @UnfoldDataScience  3 ปีที่แล้ว

      Thanks Sanyam. your words are always motivating. :)

  • @khanraiyan123
    @khanraiyan123 2 ปีที่แล้ว

    denominator should be N-1