5 Concepts in Statistics You Should Know | Data Science Interview

แชร์
ฝัง
  • เผยแพร่เมื่อ 1 มิ.ย. 2024
  • 🚀 Land your dream data job using datainterview.com/.
    ====== ✅ Details ======
    Dan, formerly a data scientist at Google and PayPal, reviews 5 fundamental topics candidates need to review in preparation for data science interviews. These are topics that are asked in business-case, statistics, and statistical-coding rounds. For more prep content, check out datainterview.com/
    👍 Make sure to subscribe, like and share!
    ====== ⏱️ Timestamps ======
    0:00 Intro
    00:51 Central Tendency
    05:05 Dispersion
    06:17 Correlation
    10:42 Normal Distribution
    12:53 Hypothesis Testing
    20:00 Other Concepts to Know
    20:41 Conclusion
    ====== 📚 Other Useful Contents ======
    1. Principles and Frameworks of Product Metrics | TH-cam Case Study
    Link: / principles-and-framewo...
    2. How to Crack the Data Scientist Case Interview
    Link: / crack-the-data-scienti...
    3. How to Crack the Amazon Data Scientist Interview
    Link: / crack-the-amazon-data-...
    ====== Connect ======
    📗 LinkedIn - / danleedata
    📘 Medium - / datainterview
  • วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 28

  • @WebsterLincoln
    @WebsterLincoln 2 ปีที่แล้ว +7

    I would describe that as a positively skewed normal distribution, not an exponential distribution. Also, it's the 68-95-99.7 rule

  • @abdallahelmoctar7635
    @abdallahelmoctar7635 ปีที่แล้ว +3

    Such a simple and straight forward refresher. I'm grateful for your work

  • @AllieZhao
    @AllieZhao ปีที่แล้ว

    These are crucial concepts. Thanks

  • @mahmutozmen1261
    @mahmutozmen1261 2 ปีที่แล้ว +2

    Thanks for such a great content and your effort. Would you mind explaining further why you think that mode = median? Since this graph seems like a positively skewed graph, I though mode is around 3, median 4 or 5 and mean between 6 and 10.

  • @basmaelkhamlichi8223
    @basmaelkhamlichi8223 2 ปีที่แล้ว +8

    Hypothesis testing and P value nicely explained, thank you!

  • @shir0tei
    @shir0tei 2 ปีที่แล้ว +1

    Thanks for the video! I The correlation formula is wrong though, the covariance is the numerator divided by n.

  • @jcokonkwo
    @jcokonkwo 2 ปีที่แล้ว +4

    I definitely appreciate the explanation then the applied DS examples right after. Thank you!

    • @DataInterview
      @DataInterview  2 ปีที่แล้ว +1

      That's the best way to learn :)

  • @SaramaKamal
    @SaramaKamal ปีที่แล้ว

    Could you mention tools used to design and present your slides thanks!!!

  • @jacksun7999
    @jacksun7999 25 วันที่ผ่านมา

    6:43 should the numerator be cov(X,Y)? Seems there is a 1/(N-1) term missing.

  • @RedShipsofSpainAgain
    @RedShipsofSpainAgain ปีที่แล้ว +5

    11:16. I think you have a typo: The Normal distribution should be 68-95-99.7%, not 65-95-99.7%

  • @Foba_Bett
    @Foba_Bett ปีที่แล้ว +1

    I am binge-watching your channel ! 😎
    In the correlation section - why not just straight up remove the outliers? 🤔

    • @gaboqv
      @gaboqv 11 หลายเดือนก่อน

      that's what he is telling with a fancy name, you will use quartiles to confirm which of the points are outliers

  • @HarryPotter-st2cn
    @HarryPotter-st2cn 2 ปีที่แล้ว

    Great content. Is non-normal distributions listed separately to put emphasis on it? I believe it will be included within the concept of the overall distributions

  • @dreamingaparisdream3178
    @dreamingaparisdream3178 2 ปีที่แล้ว

    Also where is the link for Meta Statistical Interview questions video please?

  • @bandai2
    @bandai2 2 ปีที่แล้ว

    could you also use Spearman Correlation if you have outliers in your data?

    • @eresque7766
      @eresque7766 ปีที่แล้ว

      late but yeah u could

  • @benxneo
    @benxneo 2 ปีที่แล้ว +2

    could you give me ideas for data science projects that deliver value to businesses

  • @dreamingaparisdream3178
    @dreamingaparisdream3178 2 ปีที่แล้ว +4

    For the normal distribution, is it 66-95-99.7 rule or 68-95-99.7?

    • @TheNIK21HIL
      @TheNIK21HIL 2 ปีที่แล้ว +1

      it is 68% within 1 SD. it must be a typo on Dan's end. The graph though does represent it correctly.

    • @ASHISHDHIMAN1610
      @ASHISHDHIMAN1610 2 ปีที่แล้ว

      @@TheNIK21HIL yeah typo

  • @pal999
    @pal999 2 ปีที่แล้ว

    If you're using a real world example, you shouldn't "ASSUME" the SD to be something. Can you find out how it's determined in real world?

  • @anirbansarkar6306
    @anirbansarkar6306 9 หลายเดือนก่อน

    Can you help me understand on what basis have you assumed population standard deviation to be 20?

  • @stanislavdidenko8436
    @stanislavdidenko8436 ปีที่แล้ว

    2pm - poisson distribution

  • @BrianSalamone
    @BrianSalamone 2 หลายเดือนก่อน

    1:08 8 hours a day in Facebook????? What is the X at the bottom?

  • @michaell9804
    @michaell9804 2 ปีที่แล้ว +3

    You failed to mention bayes theorem and binomial distribution which is used here just as heavily as normal distribution particularly when quantifying the probability distribution of the accuracy of unsupervised learning models. This video is not comprehensive at all

    • @Omegageekk
      @Omegageekk 2 ปีที่แล้ว +13

      If you thought a video titled “5 concepts in statistics you should know” would be a comprehensive breakdown of literally every stats concept you need for data science, then I have a bridge to sell you.