Tutorial 32- All About P Value,T test,Chi Square Test, Anova Test and When to Use What?

แชร์
ฝัง
  • เผยแพร่เมื่อ 9 ม.ค. 2020
  • Please join as a member in my channel to get additional benefits like materials in Data Science, live streaming for Members and many more
    / @krishnaik06
    Please do subscribe my other channel too
    / @krishnaikhindi
    ----------------------------------------------------------------------------------------------------------------------------------------------------------------
    Many people has asked me about various products and tech gadgets that I use for recording purpose. Here I have put up all the products. Definitely check it out :)
    💻Primary Laptop : amzn.to/3AuKL8z
    💻Secondary Laptop: amzn.to/3wrznrb
    🎧 Headphones for streaming: amzn.to/3hdbzCS
    🎧 Wireless Earbuds: amzn.to/3wg6aiD
    💺 Chair for Studio: amzn.to/2TuJ0Ia
    🖥️ Monitors : amzn.to/3AvQkDK,amzn....
    📷 HD Webcam: amzn.to/3hvqBmm
    📋 Pen Tablet For Writing: amzn.to/3xdkGsN
    🧸 Soft-Toy: amzn.to/3hDUyk8
    📷 DSLR Camera: amzn.to/3dKnsxR
    Connect with me here:
    Twitter: / krishnaik06
    Facebook: / krishnaik06
    instagram: / krishnaik06

ความคิดเห็น • 306

  • @tanvirshakir
    @tanvirshakir 3 ปีที่แล้ว +18

    Amazing, its like my 5-6 hour online class video merged into a 12 minute video.

  • @abhishektyagi101
    @abhishektyagi101 4 ปีที่แล้ว +8

    Amazing video sir...
    It has cleared my doubt on one of contradictory topic.
    Thank you very much for this teaching........

  • @BiancaAguglia
    @BiancaAguglia 4 ปีที่แล้ว +58

    This was a good overview of the different hypothesis tests. Looking forward to seeing more videos from you in this series. 😊

    • @anamikalahiri2031
      @anamikalahiri2031 3 ปีที่แล้ว +1

      Thanks SAR

    • @ammar46
      @ammar46 2 ปีที่แล้ว

      You cannot take up any test, like if you want to use a binomial test, then your question should follow that binomial distribution.

  • @jibinjosemathewjibinjosema7617
    @jibinjosemathewjibinjosema7617 3 ปีที่แล้ว +2

    Thank you so much Sir...now i learned and understand the difference in between the T test, correlation, ANOVA.. P value significance ...etc

  • @ogahgodwin2390
    @ogahgodwin2390 3 ปีที่แล้ว +4

    You're doing a great job, sir. Understanding these concepts is as important as knowing how to code.

  • @rajatchaturvedi9332
    @rajatchaturvedi9332 4 ปีที่แล้ว +8

    Watching you hustle...i push my limits 🙏
    Thanks you so much Sir.

    • @arshadaafaqd8636
      @arshadaafaqd8636 ปีที่แล้ว

      Hey its three year's, what were you hustling, did you achieve that?

  • @abhinavraghunandankannan3546
    @abhinavraghunandankannan3546 4 ปีที่แล้ว +98

    Good job !! Some parts of the explanation can be improved, especially your point about ANOVA test when a categorical variable has more than 2 possible values. Consider slowly down and collecting your thoughts together and your videos will be even more effective.

    • @tehminakakar8753
      @tehminakakar8753 7 หลายเดือนก่อน

      Yeah, I have the question, when he takes Gender and Age Group, then he used Chi-Square test, but later said when a category containing multiple values (not binary) then we use ANOVA.

    • @neetaszone774
      @neetaszone774 4 หลายเดือนก่อน

      Sir I wish to watch all your vedios ..I subscribed it.. pl send all liks regarding Excell,data types, hypothesis testing,

  • @anandacharya9919
    @anandacharya9919 4 ปีที่แล้ว +1

    Super and Great, This was what I was waiting for long time, Thank you again 🙏

  • @abidhussainwani3028
    @abidhussainwani3028 3 ปีที่แล้ว

    It's the best overview of tests I have seen on TH-cam.
    Awesome dear sir.... Thank you.

  • @zarmeenakhan899
    @zarmeenakhan899 6 หลายเดือนก่อน

    best channel for learning statistics i've found so far. Great job

  • @tulasikrishna5429
    @tulasikrishna5429 3 ปีที่แล้ว +42

    Typically you reject Null Hypothesis or You Fail to reject Null Hypothesis. "Accepting" H0 or Ha term is typically not used..

    • @RANJEETSINGH-tr4ko
      @RANJEETSINGH-tr4ko 2 ปีที่แล้ว +1

      I had the same point, either we reject Null hypothesis or we fail to reject it.

    • @utkarshvikas7983
      @utkarshvikas7983 9 หลายเดือนก่อน

      Take null - *they are independent*and then proceed.

    • @Tusharchitrakar
      @Tusharchitrakar 6 หลายเดือนก่อน

      Exactly. You don't accept either alternate or null hypothesis.

  • @priyankasaklani7703
    @priyankasaklani7703 3 ปีที่แล้ว +1

    Wonderful explanation, thank you very much for making it so easy and interesting

  • @yugoshreesarkar3440
    @yugoshreesarkar3440 2 ปีที่แล้ว +1

    Thank you so much for putting it all together in this concise video.

  • @abhaypratap5311
    @abhaypratap5311 4 ปีที่แล้ว +1

    Hi I m trying to build a entropy based filter which could categorize benign and malicious domain..so I need a threshold value, got some value between 3 to 4 above 4 value it laying in malicious category so the optimum value would be between 3 to 4 so how 2 optimize it ?

  • @VVV-wx3ui
    @VVV-wx3ui 4 ปีที่แล้ว +4

    I think the starting point of Data Science is the Analysis of Data and these tests determine the Algorithm and the Regularization method to implement to minimize the cost function (RSS).
    Read recently that
    1) Co-variance and Multi-Collinearity would have impact on the Coefficients and NO impact on predictions
    2) There are L1 and L2 Norm regularization methods. A study (Mark Schmidt CS542B Project Report December 2005) says that L1 with Optimizing Least Squares is better than L2. Reason being that L2 does not address Parsimony (sparsity) of the model and Interpretability of the coefficients values and all it aims is Shrinking the Coefficients. L1 regularization has many benefits of the L2 and yet, sparsity and interpreting coefficients is easy.
    While above two are understandable in English but not as Statistics. May I request you to cover these, if possible, in your next session.
    Its so nice to see "whys" and "whens" in this video, which I think is the matter for Data Scientist. Great Work Krish. Please keep it going with more Whys and Whens.

  • @peacefulvibes1089
    @peacefulvibes1089 10 หลายเดือนก่อน

    Great explanation, much better than the education I received in the last three months combined.

  • @nikhilpriyanka
    @nikhilpriyanka 4 ปีที่แล้ว +1

    I am grateful for the brief information for the various test in the hypo & null hypo. helpful

  • @adilzia
    @adilzia 3 ปีที่แล้ว +1

    Hello doctor. If I want to see the change in sales before and after the covid19. Then should I use paired sample t test or any other test. Can you please suggest.

  • @sadhnasingh877
    @sadhnasingh877 4 ปีที่แล้ว +5

    Hi Krish, thanks for this amazing video. Could you explain this using python with the sample data set.

  • @kkilua6956
    @kkilua6956 2 ปีที่แล้ว

    Thank you so much. I love your method and pace of teaching.

  • @debjeetdas1882
    @debjeetdas1882 4 ปีที่แล้ว +2

    Hello,
    Can you please add a video implementing the pipelining technique for ensembling more than two different algorithms together.

  • @MrMultiSuprime
    @MrMultiSuprime 3 ปีที่แล้ว

    You sir are amazing! Thank you for this video!

  • @betanapallisandeepra
    @betanapallisandeepra 3 ปีที่แล้ว

    thank you for doing this video. it is a very useful and good explanation with a simple example.

  • @ZEA_TATA
    @ZEA_TATA 3 ปีที่แล้ว +2

    This video clear 80% of our Hypothesis testing concepts. It's a very good explanation.

    • @sanjeetsingh-iz1rb
      @sanjeetsingh-iz1rb 3 ปีที่แล้ว +1

      What about the remaining 20% of the concepts

    • @___DannyBoyka
      @___DannyBoyka 2 ปีที่แล้ว

      @@sanjeetsingh-iz1rb significance level is 20% in this case

  • @ranga-bashyam
    @ranga-bashyam ปีที่แล้ว +1

    sir! really blessed to watch your videos!! ur passion towords it make me feel enlightned 💯🙏

  • @bhargavpotluri5147
    @bhargavpotluri5147 4 ปีที่แล้ว +3

    Thanks for the compacted video & all the tests at one place. I don't think so there is any other video on you tube explaining all the tests in such short & meaningful way. Nice video.
    Also, just got a doubt what test do we need when there is a categorical & numeric variable combination?

    • @Mangkuisingsit
      @Mangkuisingsit ปีที่แล้ว

      If I'm not mistaken, acc to what he say if there are combination of categorical and numerical where both categorical and numerical variables has more than two distinct sets of value or group then Anova test should be apply.

  • @anikethdeshpande8336
    @anikethdeshpande8336 4 ปีที่แล้ว

    Thank You, very clear explanation

  • @vijayabharathi7239
    @vijayabharathi7239 2 ปีที่แล้ว

    Super krish naik jeee crystal clear explanation …..preparing for PhD
    It’s helping me a lot thank you once again

  • @Agrima_Art_World
    @Agrima_Art_World 4 ปีที่แล้ว

    How can we apply ANOVA test on mix of Categorical and Continuous Variables ?

  • @johnokech4232
    @johnokech4232 6 หลายเดือนก่อน

    Amazing this has given me a clear understanding.

  • @vijaypalmanit
    @vijaypalmanit 4 ปีที่แล้ว +2

    We can only reject null hypothesis but never accept alternate hypothesis. Based on test we can only conclude that we either have evidence in favor of null hypothesis or not.

  • @surekhagaikwad2801
    @surekhagaikwad2801 3 ปีที่แล้ว

    Sir, how we will do statistical test for increasing the presence of certain object (like ships/solar panels) in the images instead of using entire dataset.

  • @roshr9620
    @roshr9620 2 ปีที่แล้ว

    Dear Sir Thanks for the informative video.
    I have a query about using regression in longitudinal data (I have survey data collected at 4 different times).
    Now how to analyze this data?
    (Say a relationship Satisfaction--> Loyalty. Now I have four Beta values at 4 times. How can I use this data to help formulate a longitudinal relationship between Satisfaction and Loyalty?)
    Thanks

  • @silmoonislam9537
    @silmoonislam9537 2 ปีที่แล้ว

    thank you so much!! you make things easier!!

  • @Srikakolapu
    @Srikakolapu 3 ปีที่แล้ว

    Is there any relation between choosing null hypothesis and p value computation ?

  • @ShachisAcademy
    @ShachisAcademy 3 ปีที่แล้ว

    Its great to seea good video on hypothesis testing.... good going..

  • @shivambhayre5056
    @shivambhayre5056 4 ปีที่แล้ว +5

    Krish we understand the concept but don't know how to implement it in real dataset on python or R please make video on that by doing in jupyter notebook or rstudio.

  • @bharteshtandon5095
    @bharteshtandon5095 4 ปีที่แล้ว +1

    i am not able to join your channel please help I need your air quality index tutorial

  • @user-en5yv3iu2p
    @user-en5yv3iu2p 3 ปีที่แล้ว

    Very good explaining sir. Thank u ❤

  • @orangemegamind
    @orangemegamind 17 วันที่ผ่านมา

    Very informative video!!😃

  • @gajendrap.s.raghava6421
    @gajendrap.s.raghava6421 3 ปีที่แล้ว

    Excellent video, describe concept clearly

  • @cinemascope8847
    @cinemascope8847 4 ปีที่แล้ว

    Can anybody explain what to do if null hypothesis is rejected ? Should we keep that feature or remove that feature?

  • @sagarkumarbudihal3026
    @sagarkumarbudihal3026 4 ปีที่แล้ว +1

    Thank you very much, Krish. Tomorrow I have a mock interview on Machine Learning. a lot of thanks to you.

  • @sandipansarkar9211
    @sandipansarkar9211 3 ปีที่แล้ว

    Thanks Krishh for the awesome video.

  • @sameergoilkar9956
    @sameergoilkar9956 ปีที่แล้ว +1

    best playlist i have seen ever

  • @RevenueRocketeers
    @RevenueRocketeers 2 ปีที่แล้ว

    Thanks for the lucid explanation.

  • @adityapatnaik6079
    @adityapatnaik6079 4 ปีที่แล้ว +3

    THIS IS YOUR BEST VIDEO SO FAR !

  • @chiragagrawal7104
    @chiragagrawal7104 3 ปีที่แล้ว

    When we should consider that this will be null hypothesis and this will be alternate hypothesis?

  • @alishbashahid5736
    @alishbashahid5736 3 ปีที่แล้ว

    Having a majorcard would give information that person is self employed or not, what type of statistical technique will apply?

  • @aishwaryadey8713
    @aishwaryadey8713 ปีที่แล้ว +1

    You speak very fast! thank you for explaining so well

  • @Tungse98
    @Tungse98 3 ปีที่แล้ว

    Easy to understand.. You have enlightened me :D

  • @nilupulperera
    @nilupulperera 4 ปีที่แล้ว

    Very good video again as earlier. The way of connecting different concepts together is the difficult part for beginners and students. Your approach to answering the above issues are excellent Krish. Thank you very much. Please continue your good job for this world.

  • @kamran_desu
    @kamran_desu 3 ปีที่แล้ว

    Very nice explanation. Linked to these types of tests, when do we use the F-test?

  • @anynegi7456
    @anynegi7456 2 ปีที่แล้ว

    Thank u so much sir it really helped me a lot to understand this concept

  • @thirupathireddy6149
    @thirupathireddy6149 3 ปีที่แล้ว +3

    krish, I have observed that you mentioned to use T - test for two numerical variables and again you mentioned correlation test.

  • @prashanthshetkar2350
    @prashanthshetkar2350 4 ปีที่แล้ว

    thanks alot for this beautiful content

  • @noornajwabintimdamin_2882
    @noornajwabintimdamin_2882 6 หลายเดือนก่อน

    Good job!! I really like and understand your video.

  • @JaydeepSinghTindori
    @JaydeepSinghTindori 3 ปีที่แล้ว

    we can't say that the t-test and chi^2 is used only for the categorical variable. We can use it or analysis of mean, variance etc.

  • @amazingvideoswithyasser9574
    @amazingvideoswithyasser9574 4 ปีที่แล้ว

    sir, I want to find if there is difference in the rate of errors between males and females. which test can i apply?

  • @himaanshusingha
    @himaanshusingha 3 ปีที่แล้ว

    Sir you explained it very well, in a very easy to understand way. The only problem was audio quality. Else everything was perfect.

  • @jijie133
    @jijie133 2 ปีที่แล้ว

    Great video!

  • @NavdeepSingh-bm8or
    @NavdeepSingh-bm8or 3 ปีที่แล้ว

    very nicely explained. Thank you

  • @scientificidol
    @scientificidol ปีที่แล้ว +2

    You need a correction: Rejecting the null hypothesis does not mean that we accept the alternate hypothesis.
    We never accept the alternate hypothesis. We only reject the numm hypothesis or fail to reject. We don't do anything with the alternate hypothesis.

    • @Darklord-uk6yi
      @Darklord-uk6yi 11 หลายเดือนก่อน

      could you point to some more references of what you have said, cause till now even i thought that if we reject H0 we accept H1, if not references then maybe explain a bit more as to why. thank you!

  • @sunnysavita9071
    @sunnysavita9071 4 ปีที่แล้ว

    in the ANOVA test we perform the F-test ?

  • @krishnakanthchowdarybandi9838
    @krishnakanthchowdarybandi9838 4 ปีที่แล้ว

    sir can p value,T test,chi Square test be implemented to CNN
    if so please make a video on it with practical implementation

  • @jigneshjash89
    @jigneshjash89 4 ปีที่แล้ว +1

    Thanks this helps!!

  • @musabtanzeel4030
    @musabtanzeel4030 2 ปีที่แล้ว +1

    Thank You sir... It was very knowledge full

  • @user-qb9qf5mb5s
    @user-qb9qf5mb5s ปีที่แล้ว

    Thanks a lot. Thanks for excellent explaination

  • @sandipansarkar9211
    @sandipansarkar9211 3 ปีที่แล้ว

    watching the video for second time for revision. Thanks

  • @SAS020
    @SAS020 3 ปีที่แล้ว +2

    The p-value is the likelihood of the observed data, given that the null hypothesis is true. The more it is low, the more we are confident to reject H0

  • @soujanyabagam2034
    @soujanyabagam2034 3 ปีที่แล้ว

    sir, you said p value is the probability of alternate hypothesis to happen when we consider null hypothesis is true. then smaller the value, more weight it should add on null hypothesis, why is it other way round?

  • @pallavibub5804
    @pallavibub5804 3 ปีที่แล้ว

    Well explained!
    Thanks

  • @gh504
    @gh504 2 ปีที่แล้ว

    Thank you so much for this nice explanation

  • @vijayabharathi7239
    @vijayabharathi7239 2 ปีที่แล้ว

    Your explanation creating interest to learn statistics

  • @victorcapitano
    @victorcapitano 8 หลายเดือนก่อน

    Thank you for your effort sire

  • @saurabhpaul1602
    @saurabhpaul1602 3 ปีที่แล้ว

    Great video👍👍 really helpful

  • @user-vn9we6eh2b
    @user-vn9we6eh2b 2 หลายเดือนก่อน

    thank you, PLEASEi FOUND IN SMART PLS in structural model results some huge t statistics for example i got three huge values for three hypotheses 14 OR 17 OR 23 is there any problem in my data? thank you

  • @danchou9970
    @danchou9970 3 ปีที่แล้ว

    Why did you take that as alternate hypothesis? Is there a reason or layman rule for choosing which one will be null and which one will be alternate hypothesis?

  • @soujanyabagam2034
    @soujanyabagam2034 3 ปีที่แล้ว

    Hi sir, How can we decide what is null hypothesis and what is alternate hypothesis?

  • @thulasirao9139
    @thulasirao9139 3 ปีที่แล้ว

    Awesome explanation thank you

  • @anirudhashukla1305
    @anirudhashukla1305 ปีที่แล้ว

    for 2 continuous variables sir has said that we use correlation. but correlation is used when two variables have linear dependence. What test do we have to use for non linear continuous variables to find whether the variables are significant to each other or not???

  • @tinamukherjee6605
    @tinamukherjee6605 หลายเดือนก่อน

    Excellent teaching

  • @dennismwangi3573
    @dennismwangi3573 2 ปีที่แล้ว

    Helpful explanation.

  • @maheshsharma6521
    @maheshsharma6521 3 ปีที่แล้ว

    Very well explained Krish

  • @Anwerkhursheedofficial
    @Anwerkhursheedofficial 3 ปีที่แล้ว

    superb well explained appreciated

  • @moncykurien18
    @moncykurien18 4 ปีที่แล้ว

    Hi Krish, Great video. Thank you very much. Can you please do a video on Z-test vs T-test?

  • @pavankumar-fc1pb
    @pavankumar-fc1pb ปีที่แล้ว

    Sir i have work shop and data analysis we have to do null hypothesis, histogram in excel could explain me or any related videos

  • @adarshtiwari6742
    @adarshtiwari6742 4 ปีที่แล้ว +5

    Oh my god Krish got angry 7:02😂😂😂,jokes apart you are gr8 teacher.

  • @sakthivelnathan8525
    @sakthivelnathan8525 ปีที่แล้ว

    Excellent Teaching. Thanks

  • @rameshkannan1075
    @rameshkannan1075 3 ปีที่แล้ว

    Hi is it possible to do Prediction of default in loan for 1000 entries using excel

  • @nniv1986
    @nniv1986 4 ปีที่แล้ว

    Excellent tutorial

  • @malinyamato2291
    @malinyamato2291 ปีที่แล้ว

    love real whiteboard lessons like yours..... my professors are dull and just run powerpoints during lectures half asleep.

  • @rajasekharreddy7624
    @rajasekharreddy7624 3 ปีที่แล้ว +2

    Hello Krish/ Subscribes of Krish ,
    Kris mentioned T test can be used only for continuous variable in the beginning , but in the end at 11:11 Krish is saying it can be used when we have one continuous variable and one categorical variable( i.e only 2 categories), please reply me i am confused after this krish.

  • @user-ih2xc1dg2c
    @user-ih2xc1dg2c 3 ปีที่แล้ว

    Very helpful thanks

  • @srikanthm1908
    @srikanthm1908 4 ปีที่แล้ว +2

    Explanation was very good. I would like to know if my assumptions mentioned below are valid. Hope you acknowledge this.
    1. select k best can be applied on both classification and regression problems
    2. T-Test can be applied on a categorical feature which has only 2 distinct categories and when sample size is < 30
    3. Z-Test is same as T-Test but is applied when sample size > 30
    4. ANOVA Test is applied to categorical feature which has more than 2 distinct categories
    5. T-test, Z-Test & ANOVA tests are applied only when target has continuous values .
    I.e, when we are working on regression model
    6. Pierson Co-relation Co-eff can be applied only on numerical features. It can be applied between a feature & target and also between features
    If we find 2 features that are not co-related, we can remove one of them.
    7. Co-relation matrix can be applied only on numerical features
    8. Chi sqr test can be applied only on categorical features

    • @mooventhc1686
      @mooventhc1686 4 ปีที่แล้ว +2

      2. T-test applied on one or 2 numerical features.
      t-test and ANOVA work on numerical and continuous values.. yet in classification, we are using dummies the dependent feature(target column). Hence it can be applied.

    • @srikanthm1908
      @srikanthm1908 4 ปีที่แล้ว

      @@mooventhc1686 Thanks much. Correct me again please. T-Test, Z-Test & Anova-Test are used when our target column is having continuous values. I agree. But what should be the type of input feature ? Categorical / Numerical ? On which input feature type T test and ANOVA tests are applied ? Thanks in advance

  • @subhamsaha2235
    @subhamsaha2235 3 ปีที่แล้ว +1

    Q- why we use P=0.05 or 5%?
    A- From experience or we can say from previous experiments we have concluded that from a population about 5% outcome is defective or we can say we have to reject that amount of data that falls within or equal to 5%.

  • @sayakpalit3615
    @sayakpalit3615 3 ปีที่แล้ว +1

    Best explanation.. 👍👍

  • @igniraj
    @igniraj 3 ปีที่แล้ว

    Please send me the link for complete playlist of this series. I need from tutorial 1

  • @febryistyanto1611
    @febryistyanto1611 3 ปีที่แล้ว

    super and great video. it's powerful for me

  • @ArunKumar-sg6jf
    @ArunKumar-sg6jf 4 ปีที่แล้ว

    when there is difference what is impact of it in model