5 Concepts in Statistics You Should Know | Data Science Interview

แชร์
ฝัง
  • เผยแพร่เมื่อ 8 ม.ค. 2025

ความคิดเห็น • 35

  • @WebsterLincoln
    @WebsterLincoln 2 ปีที่แล้ว +17

    I would describe that as a positively skewed normal distribution, not an exponential distribution. Also, it's the 68-95-99.7 rule

  • @Victor-yn6lv
    @Victor-yn6lv 10 วันที่ผ่านมา

    Thank you for sharing this excellent video, Dan!

  • @abdallahelmoctar7635
    @abdallahelmoctar7635 ปีที่แล้ว +3

    Such a simple and straight forward refresher. I'm grateful for your work

  • @annizheng5289
    @annizheng5289 5 หลายเดือนก่อน +1

    Nice for having a brush up! Thank you!

  • @YasirWani
    @YasirWani 9 วันที่ผ่านมา

    Great Video! Though CLT wasn't much clear! Thanks!

  • @RedShipsofSpainAgain
    @RedShipsofSpainAgain 2 ปีที่แล้ว +8

    11:16. I think you have a typo: The Normal distribution should be 68-95-99.7%, not 65-95-99.7%

  • @basmaelkhamlichi8223
    @basmaelkhamlichi8223 3 ปีที่แล้ว +8

    Hypothesis testing and P value nicely explained, thank you!

  • @asadhasnainbaqri5455
    @asadhasnainbaqri5455 3 หลายเดือนก่อน

    significance level is the probability of making type 1 error, which is rejecting null hypothesis when it was true

  • @jacksun7999
    @jacksun7999 8 หลายเดือนก่อน

    6:43 should the numerator be cov(X,Y)? Seems there is a 1/(N-1) term missing.

  • @shir0tei
    @shir0tei 3 ปีที่แล้ว +2

    Thanks for the video! I The correlation formula is wrong though, the covariance is the numerator divided by n.

  • @jcokonkwo
    @jcokonkwo 3 ปีที่แล้ว +4

    I definitely appreciate the explanation then the applied DS examples right after. Thank you!

    • @DataInterview
      @DataInterview  3 ปีที่แล้ว +1

      That's the best way to learn :)

  • @mahmutozmen1261
    @mahmutozmen1261 2 ปีที่แล้ว +2

    Thanks for such a great content and your effort. Would you mind explaining further why you think that mode = median? Since this graph seems like a positively skewed graph, I though mode is around 3, median 4 or 5 and mean between 6 and 10.

  • @dreamingaparisdream3178
    @dreamingaparisdream3178 2 ปีที่แล้ว +4

    For the normal distribution, is it 66-95-99.7 rule or 68-95-99.7?

    • @TheNIK21HIL
      @TheNIK21HIL 2 ปีที่แล้ว +1

      it is 68% within 1 SD. it must be a typo on Dan's end. The graph though does represent it correctly.

    • @ASHISHDHIMAN1610
      @ASHISHDHIMAN1610 2 ปีที่แล้ว

      @@TheNIK21HIL yeah typo

  • @ayushmathur5984
    @ayushmathur5984 5 หลายเดือนก่อน +1

    in #2 at 4.30 mins how he told that mean,median and mode value can anyone explain please

  • @ketanverma7839
    @ketanverma7839 23 วันที่ผ่านมา

    in 3-sigma rule isn't it 68 instead of 65 at first standard deviation?

  • @cosystudy55
    @cosystudy55 2 ปีที่แล้ว

    Could you mention tools used to design and present your slides thanks!!!

  • @Foba_Bett
    @Foba_Bett ปีที่แล้ว +1

    I am binge-watching your channel ! 😎
    In the correlation section - why not just straight up remove the outliers? 🤔

    • @gaboqv
      @gaboqv ปีที่แล้ว

      that's what he is telling with a fancy name, you will use quartiles to confirm which of the points are outliers

  • @benxneo
    @benxneo 3 ปีที่แล้ว +2

    could you give me ideas for data science projects that deliver value to businesses

  • @AllieZhao
    @AllieZhao 2 ปีที่แล้ว

    These are crucial concepts. Thanks

  • @pal999
    @pal999 2 ปีที่แล้ว

    If you're using a real world example, you shouldn't "ASSUME" the SD to be something. Can you find out how it's determined in real world?

  • @dreamingaparisdream3178
    @dreamingaparisdream3178 2 ปีที่แล้ว

    Also where is the link for Meta Statistical Interview questions video please?

  • @anirbansarkar6306
    @anirbansarkar6306 ปีที่แล้ว

    Can you help me understand on what basis have you assumed population standard deviation to be 20?

  • @bandai2
    @bandai2 2 ปีที่แล้ว

    could you also use Spearman Correlation if you have outliers in your data?

    • @eresque7766
      @eresque7766 ปีที่แล้ว

      late but yeah u could

  • @stanislavdidenko8436
    @stanislavdidenko8436 2 ปีที่แล้ว

    2pm - poisson distribution

  • @BrianSalamone
    @BrianSalamone 10 หลายเดือนก่อน

    1:08 8 hours a day in Facebook????? What is the X at the bottom?

  • @HarryPotter-st2cn
    @HarryPotter-st2cn 3 ปีที่แล้ว

    Great content. Is non-normal distributions listed separately to put emphasis on it? I believe it will be included within the concept of the overall distributions

  • @michaell9804
    @michaell9804 3 ปีที่แล้ว +4

    You failed to mention bayes theorem and binomial distribution which is used here just as heavily as normal distribution particularly when quantifying the probability distribution of the accuracy of unsupervised learning models. This video is not comprehensive at all

    • @Omegageekk
      @Omegageekk 3 ปีที่แล้ว +13

      If you thought a video titled “5 concepts in statistics you should know” would be a comprehensive breakdown of literally every stats concept you need for data science, then I have a bridge to sell you.