Sample is an example set of a data population. This video should have more views! I have read multiple blogs and sources but couldn’t understand things as easy as explained here. Thank you again Aman! 🙏🏻
Very nice video. Learning points of the video: 1. Test on : One continues features , Hypothesis on : mean , Comes under : One Sample Test , Name of Test : One Sample T-Test , Accept & Rejection hypothesis criteria on what scale comparison : p value 2. Test on : One categorical features - Two subclass , Hypothesis on : proportion between two class , Comes under : One Sample Test , Name of Test : One Sample Proportion Test , Accept & Rejection hypothesis criteria on what scale comparison : p value 3. Test on : Two continues features , Hypothesis on : correlation , Comes under : Two Sample Test , Name of Test : Correlation with T-Test , Accept & Rejection hypothesis criteria on what scale comparisons : correlation & p value 4. Test on : Two categorical features , Hypothesis on : proportion between two class based on other class , Comes under : Two Sample Test , Name of Test : Chi-Square Test , Accept & Rejection hypothesis criteria on what scale comparison : p value 5. Test on : One categorical feature - Two subclass & One continues feature , Hypothesis on : Difference of mean between two class(variance) , Comes under : Two Sample Test , Name of Test : Two Sample T-Test , Accept & Rejection hypothesis criteria on what scale comparison : p value 6. Test on : One categorical feature - More than two subclass & One continues feature , Hypothesis on : Difference of mean between more than two class(variance) , Comes under : Two Sample Test , Name of Test : ANOVA , Accept & Rejection hypothesis criteria on what scale comparison : p value
I have'nt come across any video with a simpler explanation of these basic statistical analyses. Kudos to the tutor for making it so simple to understand!
I am speechless. i am doing Phd . and currently doing course work of phd. our faculty did not taught this.i was restless for months, saw numbes of videos on youtube but nothing solved my problem. one simple video of 9 min solved my every doubt and now i am confident that i can do this. thank you so much sir.
Dude you are awesome. I will have my first data science technical interview tmr since I did my career transition from finance. I hope everything will go on well
Very clear explanations. Just to point out that correlation also has p-value for null hypothesis that there is no correlation. I think its through the F-test (not 100% sure). Correlation coefficient gives us the strength and direction while its p-value gives the confidence in the correlation claim. E,g, the correlation between two data points will be 1, since its a perfect fit, however, p-value will be high since there's no confidence given low sample size. Actually that raises another question linked to this video. When do we use F-test? Anova?
Yes absolutely right abt correlation. F-test is more of generic high level test whereas anova is specific type of procedure that produces f statistics.
I am speechless. i am doing Phd . and currently doing course work of phd. our faculty did not taught this.i was restless for months, saw numbes of videos on youtube but nothing solved my problem. one simple video of 9 min solved my every doubt and now i am confident that i can do this. thank you so much sir.
Thank You Very Much, Sir. It Takes A Lot Of Time And Brainstorming To Summarize These Concepts In Videos Like This. I Have Been Looking For This Video Since Very Long. This Video Removes All Of My Doubts.
Best one among all I watched. Cleared all my doubts. Such a great one, simplified to the core. One small doubt: My data is nominal Vs Nominal 2 sample But more than 2 categories in outcome: Example: Type of placenta Vs Baby weight (took it as nominal) Variable: Type of Placenta ( Normal / abnormal) Outcome variable Birth weight ( Low / Normal / high) I want to see the association of Type of placenta with category of birth weight Which test I can do for it Thank you sir
Perfect summary of the tests! Thanks again Aman. Now sampling : " this is the process of taking a sample of data from the actual population" and answer to your Q) "if we have numbers 1 to 100 can 1 to 10 a sample? A) generally NO because a sample should consist a min of 30 data points.
Thank you so much for these amazing tutorials. I am working on the ANSUR dataset which contains anthropometric measurements. there are two sets of data one for males and one for females. For females, we have around 2000 samples and for males, 4000 samples. There are about 93 features that are the same for both genders. Features such as weight, height, waist circumference and etc. I want to concatenate these two datasets and do some clustering analysis on them means looking at them as just humans and ignoring the gender. I would like to know if I can do that or I need to do some analysis before that to find if there is a significant difference between each feature for different genders. Like comparing waist circumference for males dataset with waist circumference for females dataset. Which test should I apply? and what other things should I consider? Regards
Both ways possible, either u combine data first then analyze or vice versa. Coming to tests, many test can be performed depending on my variable type. For example hypothesis related to waist circumference can be done using anova
Man you just explained it sooooooo well wow… best explanation on utube i spent so many time looking and wasting time not getting it but you just killed it ❤
Very well explained sir, just 1 confusion difference between cat.l to cont. And cont to cont. I.e. when to apply two sample t test and logistics regression
GOD BLESS YOU..I think I elder to you sir.... excellent.... explanation.... great..you resembles saint RAMANIJACHARYA... who told all secrets to the world inspite of restrictions from the almighty..
Thank you. Easy to understand. Quick question: how would you categorize data that is an answer to a question "how many times a week?". The answer is always a number 1-7. Categorical?
Sir for logistic regression you mentioned, continous as independent, categorical as 'target'? What do you mean by target? What do you mean by independent in this case?
Sampling is taking a subset of individuals,objects or observations selected from a population to estimate characteristics and make inferences for the entire poplulation
If there is two continents variable then we do co-relation coefficient as you said ..can we do t test for two independent continuous variable.... Or is it necessary the continuation variable should be dependent.
Sampling is a technique of selecting subset of population to meke statistical inferences from them and to estimate characteristics of the total population
Access Hindi, English courses here- www.unfolddatascience.com/s/store
Greatest teachers actually teach on humble white boards ❤️❤️❤️❤️thank you sir for this awesome explanation
Sample is an example set of a data population.
This video should have more views! I have read multiple blogs and sources but couldn’t understand things as easy as explained here.
Thank you again Aman! 🙏🏻
Glad it was helpful Yashpal.
Very nice video.
Learning points of the video:
1. Test on : One continues features , Hypothesis on : mean , Comes under : One Sample Test , Name of Test : One Sample T-Test , Accept & Rejection hypothesis criteria on what scale comparison : p value
2. Test on : One categorical features - Two subclass , Hypothesis on : proportion between two class , Comes under : One Sample Test , Name of Test : One Sample Proportion Test , Accept & Rejection hypothesis criteria on what scale comparison : p value
3. Test on : Two continues features , Hypothesis on : correlation , Comes under : Two Sample Test , Name of Test : Correlation with T-Test , Accept & Rejection hypothesis criteria on what scale comparisons : correlation & p value
4. Test on : Two categorical features , Hypothesis on : proportion between two class based on other class , Comes under : Two Sample Test , Name of Test : Chi-Square Test , Accept & Rejection hypothesis criteria on what scale comparison : p value
5. Test on : One categorical feature - Two subclass & One continues feature , Hypothesis on : Difference of mean between two class(variance) , Comes under : Two Sample Test , Name of Test : Two Sample T-Test , Accept & Rejection hypothesis criteria on what scale comparison : p value
6. Test on : One categorical feature - More than two subclass & One continues feature , Hypothesis on : Difference of mean between more than two class(variance) , Comes under : Two Sample Test , Name of Test : ANOVA , Accept & Rejection hypothesis criteria on what scale comparison : p value
Amazing. Thanks a lot for making me believe that it was understood. Keep learning.
I have'nt come across any video with a simpler explanation of these basic statistical analyses. Kudos to the tutor for making it so simple to understand!
Glad it was helpful Iktej.
I am speechless. i am doing Phd . and currently doing course work of phd. our faculty did not taught this.i was restless for months, saw numbes of videos on youtube but nothing solved my problem. one simple video of 9 min solved my every doubt and now i am confident that i can do this. thank you so much sir.
Dude you are awesome. I will have my first data science technical interview tmr since I did my career transition from finance. I hope everything will go on well
You just saved my life with this assignment! Thank you!
Thanks a lot.
The people who reacted dislikes came here for Gym videos for abs. Awesome explanation bhai.
Thanks Soumya :)
Absolutely right 😂
Nice explanation sir
I have seen many videos but your explanation is simple and easy to understand. Thank you very much.
Welcome Ramu.
This is first time I got to know which test is applicable in which situation.
Thank you so much sir.
Please make more videos on statistics
Like PCA
Keep watching
Very clear explanations.
Just to point out that correlation also has p-value for null hypothesis that there is no correlation.
I think its through the F-test (not 100% sure).
Correlation coefficient gives us the strength and direction while its p-value gives the confidence in the correlation claim.
E,g, the correlation between two data points will be 1, since its a perfect fit, however, p-value will be high since there's no confidence given low sample size.
Actually that raises another question linked to this video.
When do we use F-test? Anova?
Yes absolutely right abt correlation. F-test is more of generic high level test whereas anova is specific type of procedure that produces f statistics.
Beautiful... Only one word... Can't thank you enough brother... I thank god that I found you...
Wow, thank you!
I am speechless. i am doing Phd . and currently doing course work of phd. our faculty did not taught this.i was restless for months, saw numbes of videos on youtube but nothing solved my problem. one simple video of 9 min solved my every doubt and now i am confident that i can do this. thank you so much sir.
Thanks Saloni. I put your comment on my LinkedIn. Your comments are precious.
@@UnfoldDataScience thank you sir
Most simplest explanation of these topics I have ever studied. Thnx so much
Thank You Very Much, Sir.
It Takes A Lot Of Time And Brainstorming To Summarize These Concepts In Videos Like This. I Have Been Looking For This Video Since Very Long. This Video Removes All Of My Doubts.
Best one among all I watched. Cleared all my doubts. Such a great one, simplified to the core.
One small doubt:
My data is nominal Vs Nominal
2 sample
But more than 2 categories in outcome:
Example:
Type of placenta Vs Baby weight (took it as nominal)
Variable: Type of Placenta ( Normal / abnormal)
Outcome variable
Birth weight ( Low / Normal / high)
I want to see the association of Type of placenta with category of birth weight
Which test I can do for it
Thank you sir
Chi square
Very very good video. Excellent explanation. As a Data Scientist, I can only admire this explanation.
Thanks for your kind words 😊
Excellent lecture sir. Helpful for Ph.d scholars. Very helpful. Easy to understand
Thank you
very well explained even a layman can understand why these test is used for. Thanks a Lot brother.
All the years in my undergrad...it finally makes sense, thank you!
studying for media data science subject from korea university here :') thank you so much for your video!!
Thank you so much bhai..... Specially for explaining so clearly that what is the difference between one sample and two sample..
Welcome Ravi.
Perfect summary of the tests! Thanks again Aman. Now sampling : " this is the process of taking a sample of data from the actual population" and answer to your Q) "if we have numbers 1 to 100 can 1 to 10 a sample? A) generally NO because a sample should consist a min of 30 data points.
Thanks Santhosh. my follow up question - If I tell you to take 30 sample from above data, can u take number 1 to 30? :)
@@UnfoldDataScience no aman, these 30 should be randomly selected from the population.
Very precise and clear explanation, thank you sir, would love to watch more videos on subject....Beautiful presentation
Thanks a lot.
one of the best videos covering these topics.
Sample/samples are a part of researching masses which is an ideal representation of the masses
Thank you so much for these amazing tutorials. I am working on the ANSUR dataset which contains anthropometric measurements. there are two sets of data one for males and one for females. For females, we have around 2000 samples and for males, 4000 samples. There are about 93 features that are the same for both genders. Features such as weight, height, waist circumference and etc. I want to concatenate these two datasets and do some clustering analysis on them means looking at them as just humans and ignoring the gender. I would like to know if I can do that or I need to do some analysis before that to find if there is a significant difference between each feature for different genders. Like comparing waist circumference for males dataset with waist circumference for females dataset. Which test should I apply? and what other things should I consider? Regards
Both ways possible, either u combine data first then analyze or vice versa.
Coming to tests, many test can be performed depending on my variable type. For example hypothesis related to waist circumference can be done using anova
Brilliant explaining techniques bro. God blessed you.
Man you just explained it sooooooo well wow… best explanation on utube i spent so many time looking and wasting time not getting it but you just killed it ❤
Thank you 😇
Thanks so much. I need more from you ....... on ordinal, multinominal and poison logistic regression...
BEST EXPLANATION EVER .. THANK YOU MAN
My pleasure, please share with friends as well.
You are explaining so nicely which many people are not doing. Keep it up
Thanks Ram. Hope you are doing well
Best statistics video on TH-cam 🔥
Appreciate that comment Nikhil
This was so helpful! Thank you so much!
very good and simple explanations. Thanks a lot
Thank you
Very simple and effective Aman
Wonderful and simplified explanation, i needed this..
Wow! What a great explanation. I Understood everything. Thank you.
Thanks Carol.
A VERY NICE VIDEO WITH SUCH AN EASY EXPLANAION
Thanks Raj.
Best explanation I have found, thank you so much
Very simply and nicely explained.
Thank you very much for this clear and crisp explanation. It has been extremely helpful to understand the concept :)
Glad it was helpful Sithara. Thanks for watching.
Awesome explanation. Got what I came for. ❤
This is a world class video. Thank you so much.
Glad you found it helpful 😊
Thank you so so so much sir!
Amazing to see your method of teaching. Tomorrow is my exam and this is indeed helpful
All the best Shruti. Hope exam was good.
You simplified everything about all tests...Thank you so much🙂
Welcome
Thanks for giving all such good informations
Welcome.
Amazing , now gets clear
Very well explained sir, just 1 confusion difference between cat.l to cont. And cont to cont.
I.e. when to apply two sample t test and logistics regression
crisp and clear explanation !!!!!!!!!
Glad you liked it Akshay.
GOD BLESS YOU..I think I elder to you sir.... excellent.... explanation.... great..you resembles saint RAMANIJACHARYA... who told all secrets to the world inspite of restrictions from the almighty..
So nice of you. Thanks a lot.
Sir, your way of clearing concept is another level 🙏
Thanks Rohit, plz share with others as well
thank you for your videos.
I have a difficulty in which statistical test is best for new food product development project.
Glad it was helpful!
superb man , very easily understood. appreciate it
Thank you so much Aman . You videos are very clear and able to understand very easily.
So nice of you Lokesh.
Awesome explanation...Thank you a lot for all these videos...
Thanks Sandipa.
Thanks sir amazing .. simple and to the point explanation 🔥🔥
Thanks Aditya.
You are a good teacher
Very well explained and to the point. Thank you so much.
Very wonderful discussion. Thank you so much.
Great Video . Thanks
Amazing explanation! Well done!
Glad you liked it Tanmay
I have been enlightened, thank you
Thank you so much. best video on subject
Thanks Temiye.
This is great! So clear and precise !
Thanks a lot :)
Right
Great explanation
Thanks Aman
Thik is awesome outstanding
Thanks Annu.
Thank u greatly. You really cleared all my confusion.
Sir can we use one way anova for one categorical having two classes instead t test??
Thanks Rita. YEs we can.
Very well explained
Thanks Mousami.
Excellent explanation in simple words
Thanks Monika.
Thank you. Easy to understand. Quick question: how would you categorize data that is an answer to a question "how many times a week?". The answer is always a number 1-7. Categorical?
Yes Categorical , I think
Should be treated numerical I think, because its a count data, What do u guys think?
Sir for logistic regression you mentioned, continous as independent, categorical as 'target'? What do you mean by target? What do you mean by independent in this case?
Hi Akshay,
independent variable is also known as "features or predictors"
dependent variable is also known as "target or response"
Very nice explanation...I hope you will upload the next video very soon..
Welcome Susmit. Sure. Happy Learning.
To the point. Hats off.
Thanks Ankur.
excellent explanation sir
Thanks a lot.
Great overview brother ❤️ Really great 👍
Thanks a ton
nicely explain ,thanks a lot sir
Welcome Nishant.
Best tutorial sirji
Explained very well Sir
You are a blessing 🙌
Thank you for this simple explanation. Clears all my questions now. I have subscribed and will follow up with your videos. You rock 💪👏
Glad to be helpful. Thanks for your positive feedback 😊.
This is super informative 👏
Thanks Samira.
Thank you...can you please make video on difference in correlation and regression
Thanks Neha, I have videos on correlation, covariance and regression, pls search it in my channel u will get ur doubts clarified.
Thank you so much. Very well explained. This was very helpful 😊
Nice explanation
Thanks Akshita.
Thank you so much, you really unfolded it
Glad you found it helpful 😊
Good explanation brother...thanks a lot ...
Welcome.
please make videos on A/B testing.
nice explanation sir what is difference between t test and student t test and how to justify their use
Thanks Jawed.
awesome clarity!!!
Glad it was helpful! pls share with friends.
Sampling is taking a subset of individuals,objects or observations selected from a population to estimate characteristics and make inferences for the entire poplulation
If there is two continents variable then we do co-relation coefficient as you said ..can we do t test for two independent continuous variable.... Or is it necessary the continuation variable should be dependent.
Thanks for the simplified explanation, keep making more such content
Thank you Vineet, I will
I m really impressed...I will be waiting for more of ur videos.... awesome explaination ❤️❤️
Thank you so much 😀
Nice explanation...
Thanks Sumesh.
Wow nice video
Thanks Subhash, happy learning. Tc
Welll explained sir
Thank you! This is very helpful.
Welcome Archana
Thank you for your explanation
Sampling is a technique of selecting subset of population to meke statistical inferences from them and to estimate characteristics of the total population