"How normal distribution is used in data analysis?" - All I needed was this answer. N it took me 1.5 months to get a proper answer. I got bombarded everywhere with theories no one explained how to use it.. Thanks man. I loved your teaching style. This is the problem, we r using this to solve this problem. Now let's talk about what this is.. this should be the way to teach.
Every student deserves a teacher like you and we are all fortunate to learn from you...the way you explain concepts with practical applications is just awesome 👏 Thank you 🙏
Dear Dhaval, I wish you will be finding the person same like you for all your problems. Such a brilliant explanation, simply the best so far, love you man!!!
Learnt for the first time how normal distribution can be used for outlier removal. Thanks so much for explaining the concept and why and how it's used. I like your simple python code to explain the usage of the concept. Great teaching skills!
Great content!! thank you!! If the playlist is in below format it will be more useful. Descriptive Statistics: Mean, Median, Mode: Understanding central tendency. Range, Variance, Standard Deviation: Measuring data dispersion. Percentiles, Quartiles: Analyzing data distribution. Inferential Statistics: Hypothesis Testing: Formulating and testing hypotheses about population parameters based on sample data. Confidence Intervals: Estimating the range within which a population parameter is likely to fall. Significance Levels: Understanding and setting significance levels for hypothesis tests. Probability Distributions: Normal Distribution: Understanding and working with the normal distribution. Binomial Distribution: Relevant for analyzing binary outcomes. Poisson Distribution: Used for counting events over a fixed interval. Correlation and Causation: Correlation Coefficient: Measuring the strength and direction of a linear relationship between two variables. Causation Awareness: Understanding that correlation does not imply causation. Statistical Tests: T-Tests: Comparing means of two groups. Chi-Square Test: Analyzing associations between categorical variables. ANOVA (Analysis of Variance): Comparing means of more than two groups.
A Great Explanation of this concept , eventhough there are thousands of videos in youtube , this is very unique and clear video for explaining statistics and Standard deviation on data Science... Thanks A Lot !!!!
I have watched many videos to understand Z-score in real time data, no one explained this way. I never forget the concept in lifetime. Subscribed the channel and never miss your updates. Hats off to you!
- Gracias por estos vídeos, son píldoras de información valiosa para los que hemos comenzado en este mundo, ya sea por la estadística o el código aplicado. - Thanks for these videos, they are pills of valuable information for those of us who have started in this world, either because of the statistics or the applied code.
In this example we have calculated mean with outliers part of calculation. Outliers at first affect any measure of central tendency. Normal distribution is used to find the zo score and the associated area under curve as a Probablity.
Really good explanation but when i tried to perform the exercise, i was blank because of the approach for getting 1st question. I kept on following how it was did in video but it was totally different. All i can is thank you providing such exercises, it really helps to learn new things from errors.
Thank You very much sir! I am following your data analysis roadmap and your videos are really helpful to learn MAD, SD , Bell Curve , Z score such interesting topics. Now I can feel that ya Learning Statistics is really fun
Please make more videos in which theory + practical both should available with more explanation....& thanks a lot for making these type of videos...Really it's very helpful for us.
The very idea of creating a stat and math series is brilliant. Coding without any understanding of the underlying concept is a bit meaningless and frankly not reliable. And you explain well.
Thank you sir for creating so informative videos that not only are inspiring us to learn theoretical concepts but also do hands on practice by using your notebooks :) I tried using histplot but was getting error, updated my seaborn library to version 0.11.1 but still faced the same error. I was able to plot the graph by using distplot.
hello, @codebasics.. this is amazing stats series as you do always, pls keep this series continuous and update videos as early as possible i'm eagerly waiting for the next videos
sir, first, how r u? I hope well. I commented many times in different videos but never had any feedback! anyway, I hope to have a response this time. my comments r: 1. Thank you for the great explanation, you really make life easy 2. regarding the most interesting part (EXERCISES), why do you solve them in different methods rather than the ones you taught us with? and without explanations! e.g. in the teaching video, u used sn.histplot while in the exercise solution u used plt.hist with many parameters like bins=20, rwidth=0.8, density=True, all without telling us why you chose to use them!!! this is an example of many new codes that I found in the solution that were not been used in your tutorial. 3. regarding the 3 methods that u used to remove the outliers, I don't know if it's a mistake or if u meant to do it(again I don't know because you don't explain to us what u did or y u did it), but u removed the outlier first time with percentile and created a new dataframe called df2, which u used it to remove the outlier once again using the method 4 standard deviations!!! (2) Now remove outliers using 4 standard deviation max_limit = df2.price_per_sqft.mean() + 4*df2.price_per_sqft.std() min_limit = df2.price_per_sqft.mean() - 4*df2.price_per_sqft.std() max_limit, min_limit again you used df2 to remove the outleir using Z score: (3) Now remove outliers using z score. Use z score of 4 as your thresold df2['zscore'] = (df2.price_per_sqft-df2.price_per_sqft.mean())/df2.price_per_sqft.std() df2.sample(10) I think u supposed u use the original df and not df2, right?
Yahia, many times when I don't have time my viewers help me with solution exercises. They create a pull request which I accept on my GitHub after a review. You can also contribute if you wish to and feel that a better solution is available. As for your specific questions, I need to check it out. I have more than 500 videos and don't remember what I said in a specific video. I am very busy at the moment but if I find time I will look into this. Hope you understand my situation
Hi , exolaination is very nice , am having 3 yrs experience in java domain and now am learning data science , can you recommend me any projects so that i can add that to my resume.
It's a really nice explanation. You have shifted to maths after 7 videos in this playlist. That's all the stats we need for data science? OR you are covering only for beginners? I am a beginner and I want to know if I need to know more than your playlist. Thanks.
Hello Sir, thank you for your very good videos. I had a bit of trouble understanding where the 0.61 Standard Deviation came from when calculating Z score. So I tried to calculate it myself and i found 0.57, I'm sure I followed the steps correctly, please let me know if not the case. I did each data point minus the average (5.25) then squared it. Then i found the mean of all those by adding them up together and dividing them by 9 (because there are 9 data points). I found 0.335 for this. Then I found the square root which was 0.57 This is how you calculate standard deviation right? I am confused why I did not find 0.61 like you did.
Thank you , you are an amazing teacher. QQ: If standard deviation and Z-score are similar; why do we even use a Z score? What problem does a Z-score solve that standard deviation doesn't?
"How normal distribution is used in data analysis?" - All I needed was this answer. N it took me 1.5 months to get a proper answer. I got bombarded everywhere with theories no one explained how to use it.. Thanks man. I loved your teaching style. This is the problem, we r using this to solve this problem. Now let's talk about what this is.. this should be the way to teach.
true
Blown away by how easily you explained such concepts, which I scared me earlier.
I am happy this was helpful to you.
You left Dunder Mifflin for data science ?
@@shobhitsadwal7972 They kept calling me assistant "to" manager, so had to quit :(
@@dwightschrute1588 You should have joined Athlead then :)
I just because I never seen a explanation like this
Your dedication and impact as a teacher deserve global recognition.
Every student deserves a teacher like you and we are all fortunate to learn from you...the way you explain concepts with practical applications is just awesome 👏 Thank you 🙏
It's my pleasure, I am happy this was helpful to you.
@@codebasics Looking forward to see ur videos 😁
Really, he is very good.
@@codebasics sir , data set not uploaded here
The way you made understood z-score concept by showing the graphic at first and then explaining formula made concept too easy. 👏
Dear Dhaval, I wish you will be finding the person same like you for all your problems. Such a brilliant explanation, simply the best so far, love you man!!!
Learnt for the first time how normal distribution can be used for outlier removal. Thanks so much for explaining the concept and why and how it's used. I like your simple python code to explain the usage of the concept. Great teaching skills!
Glad it was helpful!
Great content!! thank you!!
If the playlist is in below format it will be more useful.
Descriptive Statistics:
Mean, Median, Mode: Understanding central tendency.
Range, Variance, Standard Deviation: Measuring data dispersion.
Percentiles, Quartiles: Analyzing data distribution.
Inferential Statistics:
Hypothesis Testing: Formulating and testing hypotheses about population parameters based on sample data.
Confidence Intervals: Estimating the range within which a population parameter is likely to fall.
Significance Levels: Understanding and setting significance levels for hypothesis tests.
Probability Distributions:
Normal Distribution: Understanding and working with the normal distribution.
Binomial Distribution: Relevant for analyzing binary outcomes.
Poisson Distribution: Used for counting events over a fixed interval.
Correlation and Causation:
Correlation Coefficient: Measuring the strength and direction of a linear relationship between two variables.
Causation Awareness: Understanding that correlation does not imply causation.
Statistical Tests:
T-Tests: Comparing means of two groups.
Chi-Square Test: Analyzing associations between categorical variables.
ANOVA (Analysis of Variance): Comparing means of more than two groups.
Was stuck with something similar for past 10 days , u solved it
Hats off guruji
A Great Explanation of this concept , eventhough there are thousands of videos in youtube , this is very unique and clear video for explaining statistics and Standard deviation on data Science... Thanks A Lot !!!!
Thank you !
not even professors could take time in explaining concepts in practical way
thanks for your time !
I have watched many videos to understand Z-score in real time data, no one explained this way. I never forget the concept in lifetime. Subscribed the channel and never miss your updates. Hats off to you!
Thanks for the feedback Deepthi 👍
What a great teacher... not only easy to follow by his guidance but he shows you other ways of doing things along the way which is great for learning!
glad you liked it :)
You really make complex topics so simple and easy to understand. You are a great mentor 🙏
👍🙏🙏
- Gracias por estos vídeos, son píldoras de información valiosa para los que hemos comenzado en este mundo, ya sea por la estadística o el código aplicado.
- Thanks for these videos, they are pills of valuable information for those of us who have started in this world, either because of the statistics or the applied code.
Couldn't figure out the use of normal distribution in data analysis.. finally got my answer.. thank you so much
In this example we have calculated mean with outliers part of calculation. Outliers at first affect any measure of central tendency. Normal distribution is used to find the zo score and the associated area under curve as a Probablity.
Excellent first I tried to unstd the nd in Google in many websites couldn't unstd but your video tells inch by inch info about nd.....thank u man
Thanks for the excellent explanation ❤❤🎉
Your explanation of the topics is good.
This video made my day, you made this so simple now I understand the concept of STD Deviation & Normal Distribution. Thankyou Dhaval Sir:):)
Thanks Vaibhav.
Really good explanation but when i tried to perform the exercise, i was blank because of the approach for getting 1st question. I kept on following how it was did in video but it was totally different. All i can is thank you providing such exercises, it really helps to learn new things from errors.
Glad it was helpful!
Love how articulated are the contents. Waiting eagerly for the upcoming videos on this topic.
Very soon!
@@codebasics thank you so much
Z score concept easily understood. With your examples. I am using R for analysis. To learn the statistics concept ony i watch your video.
I am happy this was helpful to you.
Very clear to understood the concept and how and where to apply in Data Science , Thanks a Lot
Glad it was helpful!
Thanks for explaining in a simple manner and with a real time example. Got a clear understanding about statistics concepts .
Glad it was helpful!
@@codebasics Yes sir
What ah explanation man really👌👌. now i got confidence...that i can also start my career in data science
This video is so usefull for me i m bad in stat but the way of teaching u did that was awesome , thank you so much
Glad it helped
I couldn't wait for further videos. How neatly you are explaining 🥳
More to come!
Excellent explanation. Simple & Easy understanding.
Glad you liked it
The way you explain is just amazing
Glad it was helpful!
This is really helpful for my statistics course 🙌🏼
Glad it was helpful!
Sir, you are so good at learning and teaching. Please make video about your learning methods/strategies. 🙏
Very informative video with nice explanation ❤
You have really explanied in a very unique way, loved your teaching!!
Thank You very much sir! I am following your data analysis roadmap and your videos are really helpful to learn MAD, SD , Bell Curve , Z score such interesting topics. Now I can feel that ya Learning Statistics is really fun
Well done sir! Super helpful, A+ teaching ability!
Glad it was helpful!
All i can say is thank you so much for the hands on.
Thanku so much sir, you are my mentor my datascince path
Explanation of the concept is so simple that any one can easily understand.waiting for the next video 👍
I am happy this was helpful to you.
You are one fantastic teacher
Amazing video for normal and z-score, thanks
Thank you very much for the simple explanation on complex topics. Really help people looking to upgrade skill !!!
Very nice explanation with practice and theory.. Really great one
Glad it was helpful!
Please make more videos in which theory + practical both should available with more explanation....& thanks a lot for making these type of videos...Really it's very helpful for us.
I will try my best. I am happy this was helpful to you.
your class is just awsome
Sir, you are amazing please make a tutorial on the hypothesis testing
Yes please!!!! that is also very confusing for me too...
The very idea of creating a stat and math series is brilliant. Coding without any understanding of the underlying concept is a bit meaningless and frankly not reliable. And you explain well.
Thank you sir for creating so informative videos that not only are inspiring us to learn theoretical concepts but also do hands on practice by using your notebooks :)
I tried using histplot but was getting error, updated my seaborn library to version 0.11.1 but still faced the same error.
I was able to plot the graph by using distplot.
I am following all your videos!! Thanks for being such a nice teacher!!
Happy to hear that!
hello, @codebasics..
this is amazing stats series as you do always,
pls keep this series continuous and update videos as early as possible
i'm eagerly waiting for the next videos
Man you are a genius 🙏
sir, You are the BEST!
Best Explanation, Thank You
Excellent explanation again!!
Thank you very much sir for your amazing videos
Simple and Clear .. thanks a lot !!
Glad you liked it!
Thank you for sharing this content sir
Nice Explanations!
superb explanation...
Thank you for your amazing work
Nicely explained sir.
Glad it was helpful!
These videos are so helpful
Glad it was helpful!
Love your videos!! Very educational and well done. Thanks!
Glad it was helpful!
Nice explanation of topic. Thank you
Glad you liked it
very very helpful and easily explain . please sir make a full playlist of mathematics and statistics required in data science
Glad it was helpful!
Awesome explanation
Glad it was helpful!
Sir nice video.very helpful
amazing teacher air
Glad it was helpful!
Ur the Master (can u do vedio life of working in day (data scientist vs data analyst)
Excellent Videos.
Glad you like them!
Tq so much sir... Hoping more videos on metrics and math in algorithms.
Keep watching
@@codebasics sure sir.
Very Informative.. Great explanations.. when is the next video.
Very soon, Glad it was helpful!
sir,
first, how r u? I hope well.
I commented many times in different videos but never had any feedback!
anyway, I hope to have a response this time.
my comments r:
1. Thank you for the great explanation, you really make life easy
2. regarding the most interesting part (EXERCISES), why do you solve them in different methods rather than the ones you taught us with? and without explanations! e.g. in the teaching video, u used sn.histplot while in the exercise solution u used plt.hist with many parameters like bins=20, rwidth=0.8, density=True, all without telling us why you chose to use them!!! this is an example of many new codes that I found in the solution that were not been used in your tutorial.
3. regarding the 3 methods that u used to remove the outliers, I don't know if it's a mistake or if u meant to do it(again I don't know because you don't explain to us what u did or y u did it), but u removed the outlier first time with percentile and created a new dataframe called df2, which u used it to remove the outlier once again using the method 4 standard deviations!!!
(2) Now remove outliers using 4 standard deviation
max_limit = df2.price_per_sqft.mean() + 4*df2.price_per_sqft.std()
min_limit = df2.price_per_sqft.mean() - 4*df2.price_per_sqft.std()
max_limit, min_limit
again you used df2 to remove the outleir using Z score:
(3) Now remove outliers using z score. Use z score of 4 as your thresold
df2['zscore'] = (df2.price_per_sqft-df2.price_per_sqft.mean())/df2.price_per_sqft.std()
df2.sample(10)
I think u supposed u use the original df and not df2, right?
Yahia, many times when I don't have time my viewers help me with solution exercises. They create a pull request which I accept on my GitHub after a review. You can also contribute if you wish to and feel that a better solution is available. As for your specific questions, I need to check it out. I have more than 500 videos and don't remember what I said in a specific video. I am very busy at the moment but if I find time I will look into this. Hope you understand my situation
@@codebasics
Thank u for ur reply.
I totally understand ur situation.
Good luck and thanks again
Thanks u sir keep make more video related to these topics , Keep it up 😎😎
Keep watching
Great explanations
Glad it was helpful!
Sir, please provide a video on function approximation. Just love your lectures.
Very informative.. thank you
Glad it was helpful!
Hi , exolaination is very nice , am having 3 yrs experience in java domain and now am learning data science , can you recommend me any projects so that i can add that to my resume.
thank you very much, sir. very valuable content.
I am happy this was helpful to you.
Good explanation
Thank you sir ❤
It's a really nice explanation. You have shifted to maths after 7 videos in this playlist. That's all the stats we need for data science? OR you are covering only for beginners? I am a beginner and I want to know if I need to know more than your playlist. Thanks.
Pls. let us know what are lower and upper bound percentiles. Kindly explain the usage of the quantile function used. Thanks.
Got to know about it from the Median, Mean, Mode, and Percentile tutorial video. Thanks so much!
Thank you so much for your amazing contents ❤️
Saras Sir 👌
Thank you
So well explained
Thank you so much for this. Sir, could you please upload some more videos on Statistics? Please
Hero ho tum hero mere liye
Is there any course of you sir
plz add sequence number in every video bcoz that is bit confusing which topic should I learn first... plz do this in the upcoming videos..
if multiple columns are there please tell me how to do apply the normal distribution for removing outliers
Hello Sir, thank you for your very good videos.
I had a bit of trouble understanding where the 0.61 Standard Deviation came from when calculating Z score. So I tried to calculate it myself and i found 0.57, I'm sure I followed the steps correctly, please let me know if not the case.
I did each data point minus the average (5.25) then squared it.
Then i found the mean of all those by adding them up together and dividing them by 9 (because there are 9 data points).
I found 0.335 for this.
Then I found the square root which was 0.57
This is how you calculate standard deviation right? I am confused why I did not find 0.61 like you did.
Print('Thank you ')
Slowly I feel that I can become data analyst
Lv u 3000 brother 😎
god will bless you sir
Thank you , you are an amazing teacher.
QQ: If standard deviation and Z-score are similar; why do we even use a Z score? What problem does a Z-score solve that standard deviation doesn't?
Which normalisation is best when data has positive and negative values..
can't we do a bell curve with matplotlib ?? i just want to learn one data visualisation tool
if u want to learn visualisation learn seaborn