Python for Data Analysis: Hypothesis Testing and T-Tests

แชร์
ฝัง
  • เผยแพร่เมื่อ 14 ต.ค. 2024
  • This video covers the basics of statistical hypothesis testing and t-tests in Python. This video explains the basics of statistical hypothesis testing and shows how to run one-way, two-way and paired t-tests in Python.
    Subscribe:
    ► www.youtube.co...
    This is lesson 24 of a 30-part introduction to the Python programming language for data analysis and predictive modeling. Link to the code notebook below:
    Python for Data Analysis: Hypothesis Testing and T-Tests
    www.kaggle.com...
    This guide does not assume any prior exposure to Python, programming or data science. It is intended for beginners with an interest in data science and those who might know other programming languages and would like to learn Python.
    I will create the videos for this guide such that you should be able to learn a lot just watching on TH-cam, but to get the most out of the guide, it is recommended that you create a Kaggle account so that you can copy and edit each lesson so that you can follow along and run code yourself.
    Introduction to Python Playlist:
    • Python for Data Analysis
    Link to the Python for Data Analysis written guide index page:
    www.kaggle.com... .
    ⭐ Kite is a free AI-powered coding assistant that integrates with popular editors and IDEs to give you smart code completions and docs while you’re typing. It is a cool application of machine learning that can also help you code faster! Check it out here: www.kite.com/g...

ความคิดเห็น • 26

  • @martyzeenyc1210
    @martyzeenyc1210 3 ปีที่แล้ว +18

    I want to thank you for these videos as I'm struggling in my college data science course. This has helped me massively!

  • @grainofsalt2113
    @grainofsalt2113 2 ปีที่แล้ว +5

    you explained this concept in the simplest way i have ever seen

  • @florenciaortega6543
    @florenciaortega6543 3 ปีที่แล้ว +2

    Thank you very much!!! I couldn't find anywhere else this test as well explained as you did it. So accurated explanation. Thank you! A+ !

  • @michaelolubode6168
    @michaelolubode6168 2 ปีที่แล้ว

    Thanks for this video. I knew the theory of hypothesis testing and can do it on paper but it wasn't easy to do the same in python until I saw your video. Thanks for the simplicity.

  • @arashkashefian1735
    @arashkashefian1735 8 หลายเดือนก่อน

    thank you very useful video. just wondering for two sample or paired tests, is there a way to test if null hypothesis is not just 0 but some none-zero value. For example if S1 is the first sample and S2 is the second sample, then how do we test the hypothesis that S1 - S2 > 1

  • @marinastolet7799
    @marinastolet7799 2 ปีที่แล้ว +1

    This is exactly what I needed, thank you.

  • @kits1111
    @kits1111 ปีที่แล้ว

    you are so awesome !! you explained so well ...

  • @neelroy3
    @neelroy3 2 ปีที่แล้ว +1

    which statistical test can be used to find difference between two groups' percentage values?

  • @jongcheulkim7284
    @jongcheulkim7284 2 ปีที่แล้ว +1

    Thank you so much. This is very helpful.

  • @atom8926
    @atom8926 ปีที่แล้ว

    How you have learn statistics? Please mention some good resource to learn

  • @valda313
    @valda313 3 ปีที่แล้ว +4

    Shouldn't normality testing be done before performing ttests?
    (Otherwise, great video, thanks 👍🏻)

    • @DataDaft
      @DataDaft  3 ปีที่แล้ว +4

      That is a good point Valda. The distributions should be normal for the t-test, which can be checked through informal means like inspecting a histogram or normal qqplot or more formally with a test like scipy.stats.shapiro(). If the sample data is large enough though, say 50+, that might be adequate for the sample data due to the normality of the sampling distribution via the central limit theorem, but I'm not sure there's a good hard-and-fast rule as to when things are "not normal enough." It is probably a good idea to also run a non-parametric test like the Mann-Whitney test for independent samples and Wilcoxon Signed Rank test for paired samples if normality questionable.

    • @valda313
      @valda313 3 ปีที่แล้ว +1

      @@DataDaft thanks for response! When I compute these types of statsicial testing, I always firstly do normality test (scipy.stats.shapiro). Based on the result of normality testing I choose either from parametric ttests or nonparametric tests (like Wilcoxon or Mann-Whitney).

    • @DataDaft
      @DataDaft  3 ปีที่แล้ว +2

      @@valda313 Thanks for the input! it is helpful to have knowledge viewers fill in gaps (or make me aware of errors). Helps everyone learn.

  • @svitirur1665
    @svitirur1665 3 ปีที่แล้ว

    Do I need hypothesis testing in machine learning modeling,? or lets say when should i do hypotheses testing in dataset , as a data scientist

    • @DataDaft
      @DataDaft  3 ปีที่แล้ว +3

      Hypothesis testing is a core statistical idea that plays a role in many other concepts in data science and machine learning. Basically any time you have a situation where you want to investigate whether one sample of data differs another (or from a population), hypothesis testing is something to consider. For example, it is at the core of A/B testing which is used to choose between two different options, like which version of an ad or website attracts more clicks.

  • @durgabhavanikonamarthi6448
    @durgabhavanikonamarthi6448 3 ปีที่แล้ว

    what do we do to our model if we accept an alternative hypothesis?

  • @janabark6415
    @janabark6415 3 ปีที่แล้ว

    Please, what if I have a different number of records for each of the testing group.
    For instance 2000 records for control and 2050 for test group. Can I use python function :
    t_stat, p_val= ss.ttest_ind(df_cnt.exp_rev,df_trt.exp_rev)?
    I got result:
    T-score = 0.16434444604672976
    # There is 16 % deviation from H0 mean
    # p-value = 0.8694662602367074
    # p-value is > than significance level i.e. 0.05
    # Therefore I am rejecting H1 the treatment did not performed better than the control
    Can I interpret it like this? Thank you very mucho in advance.

  • @kartiksharma-yw7qf
    @kartiksharma-yw7qf 3 ปีที่แล้ว +1

    You are damnnn good m loving it to study with you.

  • @johnnybastos3390
    @johnnybastos3390 2 ปีที่แล้ว

    can i say that a p-value = false positive probability?

  • @ayush9psycho
    @ayush9psycho 3 ปีที่แล้ว

    quality material!!

  • @atom8926
    @atom8926 ปีที่แล้ว

    Awesome

  • @iddymanhunter1
    @iddymanhunter1 3 ปีที่แล้ว

    Amazing!!!

  • @forbesavila8006
    @forbesavila8006 ปีที่แล้ว

    Why do you set degree of freedom to 49?