What are Bootstrap and Permutation Tests in Data Science? Easy Explanation for Beginners

แชร์
ฝัง
  • เผยแพร่เมื่อ 16 ก.ค. 2024
  • In this video, we’ll be focusing on the concept of resampling. I’ll introduce you to two particularly useful resampling techniques: bootstrapping and permutation tests. Then, we’ll compare the two.
    🟢Get all my free data science interview resources
    www.emmading.com/resources
    🟡 Product Case Interview Cheatsheet www.emmading.com/product-case...
    🟠 Statistics Interview Cheatsheet www.emmading.com/statistics-i...
    🟣 Behavioral Interview Cheatsheet www.emmading.com/behavioral-i...
    🔵 Data Science Resume Checklist www.emmading.com/data-science...
    ✅ We work with Experienced Data Scientists to help them land their next dream jobs. Apply now: www.emmading.com/coaching
    // Comment
    Got any questions? Something to add?
    Write a comment below to chat.
    // Let's connect on LinkedIn:
    / emmading001
    ====================
    Contents of this video:
    ====================
    00:00 Introduction to Resampling
    00:51 Sample and Sampling
    01:17 Why Is Resampling Useful?
    03:22 Assumptions of Resampling
    03:47 Resampling Methods
    04:01 Bootstrapping
    05:08 Bootstrap Distribution vs. Sampling Distribution
    05:41 How Many Resamples Are Needed?
    05:58 Permutation Tests
    08:33 Bootstrapping vs. Permutation Tests
    09:01 Statistics-Related Resources

ความคิดเห็น • 11

  • @miguelbuenogoogle
    @miguelbuenogoogle 6 หลายเดือนก่อน +1

    Great video Emma! Very digestible and clearly explained.
    Some additional context here for those who are interested; At @8:56, the bootstrap does rely on some assumptions. Ultimately, the consequence of these assumptions is that the standard Bootstrap struggles with extrema statistics like the sample maximum and sample minimum. So, you'll want to keep that in mind during interviews. Also, at risk of adding additional confusion, the Bootstrap is generally considered a non-parametric procedure. Alternatively, there is a parametric Bootstrap. Either of these can be used to construct confidence intervals whose interpretations can be equated to hypothesis tests, and in practice often are.

  • @rohmprakaash1577
    @rohmprakaash1577 ปีที่แล้ว

    Hi, Emma Keep posting data science videos like this, and surely will helpful for beginner like me and will motivating for many .

  • @Undertheauroraborealis
    @Undertheauroraborealis ปีที่แล้ว

    If I had to follow one channel on TH-cam it would be this! I can't say enough of how invaluable the content has been for me. Thanks Emma!

    • @emma_ding
      @emma_ding  ปีที่แล้ว

      Wow, thank you very much for the kind compliment! It makes me so happy to know you're benefitting from my content, which is the whole reason I make it in the first place! Thanks for watching! 💛

  • @rezarafieirad
    @rezarafieirad ปีที่แล้ว

    amazing and easy to understand

  • @hubertnguyen8855
    @hubertnguyen8855 ปีที่แล้ว

    Good video and clear explanation. thanks

    • @emma_ding
      @emma_ding  ปีที่แล้ว

      My pleasure, Hubert! Thanks for watching. 😊

  • @KrishnaMurthy-wh4vh
    @KrishnaMurthy-wh4vh ปีที่แล้ว +1

    Hi, I recently came across a question. What is the probability of getting all unique samples when bootstrap sampled n times in dataset containing n datapoints.

  • @marioluoni7588
    @marioluoni7588 ปีที่แล้ว

    Imagine you have an unknown population of positive real vectors (ai, bi, ci, di). You are given a sample from that population. You don't know what sampling algorithm was used, but you know that it was a biased algorithm in that vectors with small first component 'ai' were more likely to be sampled. Based on this biased sample, is there a way to a) make an educated guess at what the sampling algorithm was, and b) find a sampling algorithm that correlates even more with small first components?

  • @yashnikhare2210
    @yashnikhare2210 ปีที่แล้ว

    Hi Emma , Could you please share the presentation slides?

    • @emma_ding
      @emma_ding  ปีที่แล้ว

      Hi Yash, thanks for your comment! We don’t currently release our video content in the form of slides. However, if you’d like to know about some of the other services we *do* offer, feel free to check out our website at datainterviewpro.com to learn more! 💛