Complete Statistics For Data Science In 6 hours By Krish Naik

แชร์
ฝัง
  • เผยแพร่เมื่อ 8 ก.พ. 2025

ความคิดเห็น • 639

  • @AmanNazare
    @AmanNazare 8 หลายเดือนก่อน +279

    00:08 Covering basic and advanced topics related to data science positions
    02:32 Understanding Statistical Analysis
    07:00 Descriptive stats is the organizing and summarizing of data.
    09:17 Understanding inferential statistics and the difference between population and sample.
    13:37 Different sampling techniques and their importance
    15:56 Stratified sampling is a technique where the population is split into non-overlapping groups.
    19:49 There are two sampling techniques: random sampling and convenient sampling.
    21:41 Sampling techniques may vary based on the use case.
    25:24 There are two types of variables: quantitative and qualitative.
    27:25 Classification of variables based on characteristics, such as IQ and t-shirt size
    31:21 There are four types of measurement wells that include nominal, ordinal, interval, and ratio related data.
    33:15 Ordinal, Interval, and Ratio data types explained
    36:55 Bar graphs and pie charts can be used to represent discrete variables.
    38:52 Histograms are used to represent continuous values through bins.
    42:48 Arithmetic mean is the average of a specific distribution.
    44:43 Mean, Median, and Mode are the three main measures of central tendency.
    48:30 Outliers have a major impact on data distribution
    50:22 The median is a measure of central tendency that is not affected by outliers
    54:19 Mode is used to handle missing values and find the most frequent element.
    56:19 Suitable measure for ages of students
    1:00:18 The calculated value is 1.81
    1:02:20 Variance measures the spread of data.
    1:06:24 Standard deviation and variance are important in understanding data spread.
    1:08:18 Finding outliers and understanding percentiles
    1:12:34 80% of the distribution is less than 10
    1:14:35 The five number summary is used to analyze data and remove outliers.
    1:18:35 Compute the lower fence and the higher fence values.
    1:20:41 The 5 number summary for the given data is: 1, 3, phi, 7, 9.
    1:24:50 Summary of Statistical Distributions
    1:26:55 Distributions are a way to visualize continuous data.
    1:30:50 The normal distribution is important for deriving conclusions.
    1:32:47 Empirical formula helps in understanding the distribution of data
    1:36:45 4.75 falls 0.75 standard deviation to the right of the mean
    1:38:47 The z-score helps calculate standard deviations and their positioning on a bell curve.
    1:42:51 Convert data into standard normal distribution using z-score
    1:44:58 Standardization and normalization are two different processes used for data conversion.
    1:49:03 The average score of Rishabh Pant in the series was 260.
    1:51:02 The average score of the series is -1.25
    1:55:29 The standard deviation of the scores indicates the distribution pattern.
    1:57:29 The main question is the percentage of scores that fall above 4.25.
    2:01:19 Z table shows area to the right of the curve
    2:03:27 The left and right areas can be calculated by subtracting from the mean and standard deviation.
    2:07:26 Compute the z score and find the area under the curve.
    2:09:21 Understanding body area symmetry and how to compute mean
    2:13:18 The data distribution does not follow a Gaussian distribution.
    2:15:40 Discussing topics on IQR, probability, permutation and combination, confidence intervals, p-value, and hypothesis testing.
    2:19:30 Implementing the outliers detection function using the z-score formula in Python.
    2:21:26 Finding outliers using z-score formula
    2:25:19 Outliers can be identified using z-score and interquartile range (IQR).
    2:27:18 Find the lower and upper fence using q1 and q3 respectively.
    2:31:17 Box plot creation and importance of probability in machine learning
    2:33:05 Probability can be defined as the number of ways an event can occur divided by the number of possible outcomes.
    2:36:48 Multiple events can occur at the same time, such as drawing a king or a queen from a deck of cards.
    2:38:42 Probability of mutually exclusive and non-mutually exclusive events
    2:42:38 Understanding probability concepts: addition rule and multiplication rule
    2:44:35 Events are independent and do not impact each other
    2:48:22 Probability of drawing a queen and an asus from a deck of cards
    2:50:18 Conditional probability helps in biased theorem
    2:54:09 Permutation and Combination in Mathematics
    2:56:04 P-Value
    3:00:05 The coin is fair.
    3:02:07 Hypothesis testing involves four steps: proof, fairness of coin, alternative hypothesis, and experiment
    3:06:17 The significance value of 0.05 is used to determine if a coin is fair or not.
    3:08:18 The coin is not fair.
    3:12:04 Type 1 and Type 2 errors in hypothesis testing
    3:13:58 Rejecting the null hypothesis can be a good or bad decision depending on whether the alternate hypothesis is true or false
    3:17:52 Outcome four is accepting the null hypothesis when it is true.
    3:19:44 One-tailed and two-tailed test explained.
    3:23:38 The experiment is conducting a two-tailed test on the placement rate of a college.
    3:25:30 Confidence interval is important in statistical analysis
    3:29:14 Confidence intervals help determine the range around the population mean
    3:31:17 Construct a 95% confidence interval about the mean 520.
    3:35:16 The upper bound of the confidence interval is 12947.52
    3:37:18 The confidence interval for the average size of sharks throughout the world is 520 with a lower bound of 480.8 and an upper bound of 559.2.
    3:41:24 Population standard deviation is not given, so we use t test.
    3:43:33 Compute the lower bound and upper bound using the sample variance problem and t-table.
    3:47:45 Researchers want to test a new medication to see its effect on intelligence.
    3:49:45 The hypothesis test is a two-tailed test with a confidence interval of 95%.
    3:53:38 Standard error is calculated by dividing the standard deviation by the square root of the sample size.
    3:55:31 The mean is not equal to 100.
    3:59:44 T-test is used to compare means of two groups
    4:01:51 The t value is greater than the decision rule, indicating the rejection of the null hypothesis.
    4:06:22 Chi square test is a non-parametric test performed on categorical or ordinal data.
    4:08:30 In 2010, the distribution of ages in a small town has changed compared to 2000 census.
    4:12:34 The observed distribution of the population is less than 18: 20%, 18 to 35: 30%, and greater than 35: 50%.
    4:14:40 There is a huge difference between the observed data and the expected distribution based on the 500 samples.
    4:18:37 The chi square test statistic is 232.94, which is greater than 5.99.
    4:20:48 Performing z test using Python to determine the significance of a new drug on IQ level
    4:24:34 Covariance and significance value
    4:26:37 The significance level determines whether to accept or reject the null hypothesis.
    4:30:29 Positive correlation between x and y when x is increasing y is also increasing, negative correlation when x is decreasing y is also increasing, no relationship between x and y when covariance is 0
    4:32:17 Covariance and Pearson correlation restrict values between -1 and +1
    4:36:00 Covariance and correlation capture the linear properties, but Spearman rank correlation also captures non-linear properties.
    4:37:51 The formula for calculating the Spearman rank correlation
    4:41:48 Performing a one-sample t-test with a sample size of 10 to determine if the mean is close to the population mean.
    4:43:52 The example demonstrates the changes in values based on different scenarios.
    4:47:42 The results should not be rejected as the p-value is extremely low.
    4:49:50 Reject the null hypothesis if p value is less than or equal to 0.05.
    4:53:55 Discussing various distributions and their significance
    4:55:57 The mean weight of 36 individuals is 169.5 pounds.
    5:00:03 The area under the curve is 0.99 triple 1
    5:02:02 The calculated p-value is 0.0089.
    5:06:11 The z-score is 2.30, indicating rejection of the null hypothesis.
    5:08:28 The average age of a college is 24 years with a standard deviation of 1.5.
    5:12:29 The p-value is significantly smaller than the significance value, indicating rejection of the null hypothesis.
    5:14:35 Bernoulli distribution is a probability distribution with two outcomes: 0 or 1.
    5:18:10 Probability Mass Function (PMF) explained for categorical variables
    5:19:57 Binomial and Pareto distributions are important in statistics.
    5:23:49 Log-normal distribution and its relationship with power law distribution
    5:25:39 The distribution follows a Pareto distribution and can be converted to a normal distribution using the central limit theorem.

    • @AnnuMad-si5kw
      @AnnuMad-si5kw 4 หลายเดือนก่อน +1

      Thanks buddy

    • @av5431
      @av5431 4 หลายเดือนก่อน +6

      Bro you crazy.Thanks a lot

    • @itzmyshorts9192
      @itzmyshorts9192 4 หลายเดือนก่อน

      tqsm

    • @adnanmalik3877
      @adnanmalik3877 2 หลายเดือนก่อน

      Thanks

    • @LorenFoister
      @LorenFoister หลายเดือนก่อน

      Excuse me, could you lend a hand with my problem? My USDT TRX20 is in a wallet, secured with the phrase (clean party soccer advance audit clean evil finish tonight involve whip action). How can I move it to OKX?

  • @ShauriePvs
    @ShauriePvs 2 ปีที่แล้ว +518

    Sir not only took pain to remove unnecessary parts, he also sped up video a little to save students' time...Hats off

    • @lazydamsel
      @lazydamsel ปีที่แล้ว +1

      Is this good or bad?

    • @ShauriePvs
      @ShauriePvs ปีที่แล้ว +5

      @@lazydamsel obviously good

    • @lazydamsel
      @lazydamsel ปีที่แล้ว +1

      @@ShauriePvs cool cool. I'll watch. Thanks!

    • @dhawalgore9338
      @dhawalgore9338 ปีที่แล้ว

      how can I get the notes. It states unable to render code on Github.@@ShauriePvs

  • @156_____11
    @156_____11 ปีที่แล้ว +63

    Best statistics course ever. I was looking for statistics courses for ML that explained concepts in a way that didn't drag on, and gave examples easy for high schoolers to understand. Thank you sir!

    • @LorenFoister
      @LorenFoister หลายเดือนก่อน

      Hello, could you kindly help me out with this situation? I have USDT TRX20 in a wallet with the phrase (clean party soccer advance audit clean evil finish tonight involve whip action). Could you explain how to move it to OKX?

  • @apudas6946
    @apudas6946 10 หลายเดือนก่อน +27

    Till now I have completed 3 hrs of this video & it is extremely helpful.
    Krish Sir, I must say that you know the art of explaining complex thing in simplest way. Thank for making this kind of helpful videos.

    • @LorenFoister
      @LorenFoister หลายเดือนก่อน

      Hi, would it be possible for you to assist me with this problem? my OKX wallet holds USDT TRX20, and my phrase is (clean party soccer advance audit clean evil finish tonight involve whip action). How can I move it to OKX?

  • @nikeshthorat1613
    @nikeshthorat1613 ปีที่แล้ว +43

    4:26:50 : Key point to note, if P-value

    • @AmarSharma60436
      @AmarSharma60436 ปีที่แล้ว +2

      No,
      If the p-value is greater than alpha, you accept the null hypothesis. If it is less than alpha, you reject the null hypothesis.

    • @sajankumarkar8237
      @sajankumarkar8237 ปีที่แล้ว +13

      Exactly this, he got it very confused in the video. For other folks who are confused:
      at 95% confidence, alpha = 0.05,
      at 90% confidence, alpha = 0.1, this is within the confidence interval if alpha = 0.05
      Looking at this, if alpha is 0.05, then a value > 0.05 (cuz 0.1 > 0.05) will fall within the confidence interval.
      So, p>alpha implies that it lies within the confidence interval, so we accept the null hypotheses.
      p

    • @aryansinha557
      @aryansinha557 ปีที่แล้ว +3

      @@sajankumarkar8237 yeah you are right he got it confused

    • @IvaZinga
      @IvaZinga ปีที่แล้ว +2

      he corrects it later in the video at 4:49:53

    • @Nandhakumar_Arumugam8
      @Nandhakumar_Arumugam8 6 หลายเดือนก่อน

      You don't accept null

  • @nikeshthorat1613
    @nikeshthorat1613 ปีที่แล้ว +18

    3:58:00 : if sample mean = 110, then Z = 3.65 & it's not in our calculated UB & LB range (-1.96 to +1.96) So, we reject the Null Hypothesis & it improves the Intelligence as Z > UB.

  • @nikeshthorat1734
    @nikeshthorat1734 ปีที่แล้ว +28

    1:24:00 = Why sample variance is divided by n-1?
    th-cam.com/video/vGsRwB3TsiE/w-d-xo.html
    Summary : Researchers found that using the denominator (n-1) in sample variance/standard deviation calculations provides estimates closer to the population variance/standard deviation in various types of sample data distributions (positively/negatively skewed). This correction is also known as Bessel's correction or degrees of freedom.

    • @Userh5-s3k
      @Userh5-s3k 4 หลายเดือนก่อน

      why because sample is coming out from population for e.g. if population is "n" and from n we are taking some values which are less than "n" ,so that sample can be taken as n-1 as denominator

    • @shama-_-
      @shama-_- 2 หลายเดือนก่อน

      @@Userh5-s3k can u explain more

    • @1999theskullcrack
      @1999theskullcrack หลายเดือนก่อน

      @@shama-_- think of it as a way to remove bias. Should be called Bessel's correction. Do correct me if I am wrong.

  • @amoghpathak9224
    @amoghpathak9224 ปีที่แล้ว +8

    Absolutely Magnificent !! Just gone through the full video and I must say it's very informative. All the best I hope you get more success and happiness! Thank you so much!

  • @siddharthpatel2193
    @siddharthpatel2193 2 ปีที่แล้ว +78

    My Statistics Revision: Completed video in a day, amazing content, everything covered! Thanks, Krish sir and team.

    • @abcdabcd8605
      @abcdabcd8605 2 ปีที่แล้ว +11

      is this amount of statistics enough for a entry level data scientist?

    • @bapupatil9354
      @bapupatil9354 2 ปีที่แล้ว +2

      Yeah, In my opinion.

    • @abcdabcd8605
      @abcdabcd8605 2 ปีที่แล้ว

      @@bapupatil9354 okay

    • @abcdabcd8605
      @abcdabcd8605 ปีที่แล้ว +1

      @connecttechrockstar4474 😂😅i just created this channel to comment on videos. While creating the account I couldn't think of any other name, and so I kept this. 😅

  • @sachindeshpande8923
    @sachindeshpande8923 2 ปีที่แล้ว +62

    Thank you for putting this all in one summary. What you do here and ineuron is off the charts (I am a proud subscriber)

    • @AmirKhan-vg4br
      @AmirKhan-vg4br ปีที่แล้ว

      Bhai apni video ky metereils download keye hai agar to muja b sand kardo plz mai download karni ky koshish ke lekin download nahi hoty

  • @romansozonov9312
    @romansozonov9312 ปีที่แล้ว +19

    You are the best teacher i have in my life! The planet needs more people like you! Thank you alot, because of you i understand so much!

    • @dhawalgore9338
      @dhawalgore9338 ปีที่แล้ว

      how can I get the notes. It states unable to render code on Github

    • @aswaniyaramala5833
      @aswaniyaramala5833 11 หลายเดือนก่อน

      @@dhawalgore9338 try to download it and then open.

  • @abhijithsjeevan2279
    @abhijithsjeevan2279 ปีที่แล้ว +14

    Awesome Summary this is one and only channel where I see the clear packet of necessary data outlines .. Hats off 😊

    • @itz_satya_3
      @itz_satya_3 ปีที่แล้ว +5

      Bro learning fully this video enough for statistics in data science?? Plz give reply 🙂 for this question bro❤

    • @mindofmagnet3373
      @mindofmagnet3373 ปีที่แล้ว +2

      Pretty much enough bro

  • @radhekrashna2148
    @radhekrashna2148 ปีที่แล้ว +26

    Thank you for uploading all basic statistics in one video
    You really explained all concepts in a single video

  • @dharilpatel2072
    @dharilpatel2072 2 ปีที่แล้ว +378

    Watching Your Videos is better than Watching Netflix. Thank you sir.

    • @loujon191
      @loujon191 ปีที่แล้ว +12

      Does Netflix have commercials every 5 seconds

    • @bappirahman3294
      @bappirahman3294 ปีที่แล้ว

      ​@@loujon191Do you pay to watch yt?

    • @saivishnu2246
      @saivishnu2246 ปีที่แล้ว

      @@loujon191 Does Netflix provide it's content for free ??

    • @divyanshutripathy3484
      @divyanshutripathy3484 ปีที่แล้ว

      @@loujon191You pay for netflix, if you pay for youtube premium tou won’t have them

    • @saviour1998
      @saviour1998 11 หลายเดือนก่อน

      @@loujon191but you have to pay for subscription mate !

  • @puttupurajeswari4061
    @puttupurajeswari4061 2 ปีที่แล้ว +11

    Really, your explanation is too good. When I read the topics, I understood them in one way, but after watching your videos, I could think in a more practical way and see when we could apply them.

  • @palakbindal1804
    @palakbindal1804 ปีที่แล้ว +10

    Thank you for making stat so easy to understand. Awesome all in one content.

  • @yuvi2085
    @yuvi2085 6 หลายเดือนก่อน +4

    02:24:31 In my case it will show empty list of outliers because 100, 110, and 115 are not extreme enough to be classified as outliers with a threshold of 3. So i change the threshold 3 to 2 this way it will work. My dataset is : dataset = [50, 52, 48, 47, 51, 49, 53, 45, 46, 54, 55, 50, 49, 52, 47, 48, 50, 51, 46, 53,100,110,115]

    • @Mudaseer44
      @Mudaseer44 หลายเดือนก่อน

      🤭🤭🤭

  • @DeepRelaxation9922
    @DeepRelaxation9922 หลายเดือนก่อน

    Your way of teaching is amazing. I really wish I had someone like you as my statistics teacher back at university. 👍👍

  • @bapupatil9354
    @bapupatil9354 2 ปีที่แล้ว +11

    This helped me to clear my interview today. Thanks a ton for the statistics crash course.

    • @arunabhkumar6501
      @arunabhkumar6501 ปีที่แล้ว

      What company and what role bro?

    • @jameel25
      @jameel25 ปีที่แล้ว

      Yeah pls update will be useful for us

    • @bapupatil9354
      @bapupatil9354 ปีที่แล้ว

      @@arunabhkumar6501 It's for the Data analyst role wayfair company, Question was how to handle the missing values & outliers in dimension and measure.

    • @Dineshhhh131
      @Dineshhhh131 ปีที่แล้ว

      ​@@bapupatil9354hi did you have any prior experience, if not kindly share how did you apply for the job because I am fresher & trying to apply for job but not getting any calls as well as emails kindly support

    • @Eswar.
      @Eswar. ปีที่แล้ว

      @@Dineshhhh131 where are you applying
      have you got the job

  • @GetafixDruid
    @GetafixDruid 11 หลายเดือนก่อน +6

    Super sir. What I never understood @ school for 4 years. You have taught me in 6 hours. You are amazing. Thank you.

  • @anshumankumar1946
    @anshumankumar1946 ปีที่แล้ว +30

    Hello, for those who are looking for notes, you can go to one of his original live videos and from the description box you can go to the course dashboard and from there you can get the notes for each day separately in the resources section.

    • @HarleenKaur-fg4qu
      @HarleenKaur-fg4qu ปีที่แล้ว +2

      Thankyou so much

    • @anupamabalanmenon4771
      @anupamabalanmenon4771 ปีที่แล้ว +1

      The dashboard has crashed though

    • @anshumankumar1946
      @anshumankumar1946 ปีที่แล้ว

      @@anupamabalanmenon4771 not this one's description, the original live one's

    • @shubhamkamboj9407
      @shubhamkamboj9407 ปีที่แล้ว

      Please share the link here.

    • @gourav2985
      @gourav2985 ปีที่แล้ว

      HELLO Anshuman
      I tried finding it but didn't got any
      can you please help me with the link
      Thanks!

  • @HKNAGPAL7
    @HKNAGPAL7 6 หลายเดือนก่อน +2

    Watched the video in a day and a half, to revise stats that I had last studied in engineering days, helped me a lot cracking 3 quant interviews ending up in a 50% higher pay quant researcher role. Thanks brother. ❤

  • @simonjak100
    @simonjak100 ปีที่แล้ว +2

    Hey Man. I've been watching many of your videos and they are super helpful. Thank you sincerely.

  • @MrTshering9
    @MrTshering9 2 ปีที่แล้ว +6

    Thank you for the video. Just want to point out on P-value less than alpha , we reject the null hypothesis. 4:24:10 Kris fixed it later in the video.

  • @iVector
    @iVector 23 วันที่ผ่านมา +16

    Man you look like atul subhash💀💀🌚
    Great course btw!!👏🏻

  • @GaviniLok
    @GaviniLok 2 ปีที่แล้ว +4

    5hr:3min, Each tail will have 0.00889 and the middle region will have 0.9822, Z value gives the region below 2.307 which also includes area less than -2.307. So we need to subtract that tail value to get the middle region.

    • @itz_satya_3
      @itz_satya_3 ปีที่แล้ว

      Bro learning fully this video enough for statistics in data science?? Plz give reply 🙂 for this question bro❤

    • @hrithiksaxena3727
      @hrithiksaxena3727 ปีที่แล้ว

      I agree with with u I also think the same way bcoz while finding out the z score we r getting the area from +2.307 till -ve end not till -2.307, so i guess it should be each tail as 0.00889 which means P-value is 0.00889*2 and the middle region will be 1-(0.00889*2). But with all due respect, hats-off to u Krish sir for making stats soo simple to under. Because of u only we are able to spot even these minor things

  • @sanchitshrivastava2229
    @sanchitshrivastava2229 ปีที่แล้ว +3

    Sir i cant thankyou enough , I greatly appreciate your way of teaching. My interview is close and i pretty much covered everything i wanted from your channel alone. Thankyou Krish

  • @arvarc7028
    @arvarc7028 2 ปีที่แล้ว +11

    There are multiple types of distribution we have to learn some of them are bionimial distribution, gaussian distribution, Geometric distribution , exponential distribution , gamma distribution , beta distribution, Poisson distribution,weibull distribution,cauchy distribution

  • @Consciousness382
    @Consciousness382 2 ปีที่แล้ว +6

    Thanks Krish Sir, your videos cleared my concepts too much.... 🥰🥰🥰🥰

  • @adekunleokunade9027
    @adekunleokunade9027 11 หลายเดือนก่อน

    You are a Genius and I love the way you teach.
    Most of the topiccs I never really understood during my 5 years in school, it became so easy and simple for me.
    Thank you so much. You are indeed a blessing.

  • @Online_store_finds
    @Online_store_finds 3 หลายเดือนก่อน +2

    🎯 Key points for quick navigation:
    00:16 *📊 This video covers statistics concepts relevant to data science roles such as data scientist and data analyst, including descriptive and inferential statistics.*
    00:42 *📈 Descriptive statistics will include topics like measures of central tendency and dispersion, histograms, box plots, and cumulative distribution functions (CDFs).*
    01:25 *🎲 Probability and distributions covered include Gaussian, log-normal, binomial, Bernoulli’s, Pareto, and standard normal distributions.*
    02:22 *🔍 Statistical tests such as Z-test, T-test, ANOVA, and Chi-square will be demonstrated in Python.*
    03:30 *🧪 The section on inferential statistics focuses on hypothesis testing, confidence intervals, and critical statistical tables like Z-table and T-table.*
    05:07 *📉 Statistics help in the collection, organization, and analysis of data to improve decision-making processes.*
    05:35 *🧮 Data is defined as facts or measurable pieces of information, with examples like IQ scores and student ages.*
    07:00 *🎓 Descriptive statistics organize and summarize data, whereas inferential statistics use measured data to form conclusions.*
    08:49 *📐 Descriptive statistics can calculate measures like average, standard deviation, and mode from a dataset.*
    09:44 *🎒 Inferential statistics is about forming conclusions from sample data, such as determining if classroom marks are representative of the whole college.*
    11:22 *🌍 Population refers to the entire set, while a sample is a subset used for study, with notations represented by capital and small "n," respectively.*
    13:03 *🗳️ Different sampling techniques including simple random sampling which gives all members an equal chance of selection.*
    15:51 *🧮 Stratified sampling splits populations into non-overlapping groups, useful for demographics like gender or age.*
    18:38 *📋 Systematic sampling involves selecting every nth individual from a list or queue, and can be seen in scenarios like exit polls outside malls.*
    20:12 *🎯 Convenience sampling focuses on participants having a specific interest or expertise, often used for domain-specific surveys.*
    23:28 *💊 When testing drugs or conducting sensitive surveys, the use case determines the appropriate sampling technique to ensure reliable data.*
    24:38 *📊 Variables can take any value; examples include height and weight, which are quantitative.*
    25:22 *🔢 Two main types of variables: quantitative (numerical) and qualitative (categorical).*
    26:04 *🧮 Quantitative variables are measurable and allow mathematical operations.*
    26:32 *👥 Qualitative variables, like gender, are categorized based on characteristics.*
    27:08 *🔄 Qualitative data includes IQ categories and t-shirt sizes; cannot perform math operations.*
    28:06 *🤔 Quantitative data divides into discrete (whole numbers) and continuous (any value).*
    28:21 *🏠 Discrete example: number of bank accounts or children in a family.*
    29:49 *🏞️ Continuous example: height, weight, rainfall amount.*
    31:08 *🔢 Measurement types: nominal, ordinal, interval, and ratio variables.*
    31:36 *🌈 Nominal data are categorical without numerical significance, like color or type of flower.*
    32:33 *🏆 Ordinal data concerns order, not value, e.g., ranking of students by marks.*
    33:45 *🌡️ Interval data includes ordered values without a true zero, like Fahrenheit temperatures.*
    35:23 *📈 Frequency distribution visualizes data like flower types using tables.*
    36:19 *📊 Cumulative frequency shows the running total of frequencies.*
    37:13 *📊 Bar charts for discrete variables; histograms for continuous.*
    38:25 *📊 Examples include bar charts for discrete data and histograms for continuous data sets.*
    41:01 *📊 PDF (probability density function) smooths histograms for continuous datasets.*
    42:17 *📚 Moving to intermediate stat topics: including central tendency and Gaussian distribution.*
    43:11 *📏 Arithmetic mean (average) for population and sample; population denoted by capital N.*
    45:11 *🧮 Central measure of tendency involves mean, median, and mode to find data center.*
    46:34 *📊 Mean can be impacted by outliers, exemplified by adding a large number to a dataset.*
    05:14:53 *🔄 Bernoulli Distributions: Only two outcomes are possible, defined by probabilities p and q, with q being 1 minus p.*
    05:16:46 *🎲 Probability Mass Function: Used for categorical variables, not continuous, and shown using graphs.*
    05:19:17 *📊 Binomial Distribution: Involves multiple Bernoulli trials, represented by number of trials (n) and probability of success (p).*
    05:20:53 *📈 Pareto Distribution: Known as the 80/20 rule or power law distribution, often used to illustrate skewed distributions like wealth concentration.*
    05:23:42 *🔍 Log Normal Distribution: Related to Pareto distribution, known for its right-skewed data representation.*
    05:24:09 *🔄 Data Transformation: Techniques such as Box-Cox transformation can convert distributions to a normal distribution.*
    Made with HARPA AI

  • @kapilvirat4443
    @kapilvirat4443 2 ปีที่แล้ว +3

    Thanks from my bottom of the heart

  • @anirbaniitgn8407
    @anirbaniitgn8407 2 ปีที่แล้ว +25

    sir with due respect you have made a major mistake in P-value and significance value Hypothesis conclusion 4:48:39 -
    A p-value less than 0.05 (typically ≤ 0.05) is statistically significant. It indicates strong evidence against the null hypothesis, as there is less than a 5% probability the null is correct (and the results are random). Therefore, we reject the null hypothesis, and accept the alternative hypothesis.
    You did the opposite.
    Overall the course was good but minor mistakes here and there. Thank you
    Though you corrected it later.. But best is when editing you could just add * and add comment on video. Because while studying and taking notes with lecture it becomes a very bad experience to go back and correct all the wrong. The thought process needs to changed fully to understand it again..

    • @avinashajmera2775
      @avinashajmera2775 2 ปีที่แล้ว +2

      true

    • @mirroring_2035
      @mirroring_2035 2 ปีที่แล้ว

      true, that was annoying

    • @itz_satya_3
      @itz_satya_3 ปีที่แล้ว +1

      Bro learning fully this video enough for statistics in data science?? Plz give reply 🙂 for this question bro❤

    • @nakulmehta797
      @nakulmehta797 9 หลายเดือนก่อน

      True i was also confused and stuck for half an hour because he only contradict himself .

  • @saringurung9776
    @saringurung9776 ปีที่แล้ว +2

    Thank you Krish!! Learned a lot from this session.

  • @Skill_builder
    @Skill_builder ปีที่แล้ว +8

    Much needed one
    If you could do the same for SQL and python would be very beneficial 😊

  • @Brightbuiii
    @Brightbuiii 2 หลายเดือนก่อน

    The best stats course in my life many thanks!!

  • @javeedtech
    @javeedtech 2 ปีที่แล้ว +3

    this is better than FSDA live classes.

  • @amitattafe
    @amitattafe 2 ปีที่แล้ว +4

    First of all thanks for producing such a useful and insightful video on Statistics.
    Now my question is about exit poll results (almost all are failing). What I infer from that is:
    1) Samples are biased- As they claim of random sampling but the samples are biased (gender biased, community biased, wrong answering biased).
    2) Sample size- owing to humongous population of our country its quite impossible to collect even considerable sample data from all types of populations.
    3) Biasing in result predictions- as can be seen all analysis of exit poll is agenda driven that is party specific. (Human bias)
    4) Collection techniques- as technology progresses still these companies rely on old conventional way of inferring the results. Most of them rely on structured data or on survey reports but in todays Data driven world unstructured data can predict better results which all companies are lagging.

  • @FakeGuy-hd7jj
    @FakeGuy-hd7jj 7 หลายเดือนก่อน +1

    Bhaiya...U r my saviour 🫶🫰
    Like tomorrow is my end sem. paper and right now I am actually getting my whole syllabus in a single video with this much clearance in every topic. As now I got some good overview of every topic so I have to put less effort than before for covering the whole syllabus. Feel like now I have build up some confidence for tomorrow.
    I really enjoyed the video and your method of teaching in simple language.
    Thanks a lot bhaiya 🙏🙏
    PS :- Never studied statistics in my class throughout the whole semester. As we are the backbenchers who only cover the syllabus one night before paper day.😅

  • @arabindamcs
    @arabindamcs 5 หลายเดือนก่อน

    Excellent tutorial. Reall appreciate this !!! You are doing a great social service ! God bless you

  • @manojtaurian
    @manojtaurian 2 ปีที่แล้ว +13

    Hi Krish Sir,
    Thanks for the amazing informative video. In type 1 and type 2 error while explaining confusion matrix TN should be type 2 error. Earlier you wrote correctly, but later you marked FN as type 2 error which is incorrect.

    • @jameel25
      @jameel25 ปีที่แล้ว

      Ya, why no one pointed it out

    • @_jahidulislam-iy3ju
      @_jahidulislam-iy3ju ปีที่แล้ว +1

      I got confused. Chatgpt says FN is type-2 error

    • @prateekjoshi9169
      @prateekjoshi9169 ปีที่แล้ว

      FP is type 1 and FN is type 2 error

  • @raghavverma120
    @raghavverma120 2 ปีที่แล้ว +5

    In z-score section.. u can add right table + 0.5 and then subtract it from 1.. it gives the same thing

  • @animeshgarg1288
    @animeshgarg1288 2 ปีที่แล้ว +5

    What a great resource!

  • @ginuxtech
    @ginuxtech ปีที่แล้ว +1

    Waoh! this has been my best tutorial on Statistic so far... Thank you so much for your explanation

  • @ankitavishwakarma4851
    @ankitavishwakarma4851 หลายเดือนก่อน

    Thankyou sir finally I found a video that helps me to learn statistics in practical ways.

  • @hrshlgunjal-1627
    @hrshlgunjal-1627 8 หลายเดือนก่อน

    Amazing video, thanks for cutting the video and making it more seamless. Your efforts are really appreciated. Thanks for this video.

  • @rishibakshi2004
    @rishibakshi2004 2 ปีที่แล้ว +4

    thankyou so much sir for this wonderful course statistics video...i came here as a newbie but now after completing this i have much knowledge about data and also my approach has changed in seeing data and treating it...blessed to find your channel thankyou sir❤❤

  • @maheshkumbhar5548
    @maheshkumbhar5548 2 ปีที่แล้ว +11

    no words to say. Legendary content!!!

  • @suvopal3234
    @suvopal3234 11 หลายเดือนก่อน +3

    A p-value less than 0.05 is typically considered to be statistically significant, in which case the null hypothesis should be rejected but you keep on saying should not be rejected - is there something i am missing here ? time stamp - 4:48:15

  • @anushruthikae839
    @anushruthikae839 8 หลายเดือนก่อน +1

    2:08:07 , we can consider this approach , where the area between range 100 till 145 (right part) to be 50 % as the guassian distribution is generally a symmetric one.
    and we also know , area between 85 to 115 is 68 % as it the between -1 SD to +1SD as per 68% 95% 99.7% rule , there fore area between 85 to 100 would 34 (68/2) due to its symmetry.
    now on adding area between (85-100 and 100-115), we get 50+34 = 84 .
    84% represents the region of people having IQ more than 85.
    to find less than 85 we need to subract 100 from it, i.e. 100-84 = 16%

    • @invinciblegirl4386
      @invinciblegirl4386 6 หลายเดือนก่อน

      Yea. It just gives a approximate value..But to get a precise value we should use the table I guess

  • @sagarpandya7865
    @sagarpandya7865 11 หลายเดือนก่อน

    This is great Stats Playlist . Thanks for making this Krish

  • @syedarbaz2060
    @syedarbaz2060 2 ปีที่แล้ว +3

    Thank you sir for this awesome tutorial and your teaching skills is just amazing

  • @arghadeepmisra7865
    @arghadeepmisra7865 2 ปีที่แล้ว +19

    This is so needed 1.Consise 2.Detailed 3.No idiot is asking unnecessary questions
    GREAT

  • @sapnatare
    @sapnatare ปีที่แล้ว +5

    Nice summary -well explained. !

    • @itz_satya_3
      @itz_satya_3 ปีที่แล้ว +1

      Mam learning fully this video enough for statistics in data science??😊

    • @syedsajjadali4220
      @syedsajjadali4220 11 หลายเดือนก่อน

      ​@@itz_satya_3yes i believe

  • @rajamritmohapatra83
    @rajamritmohapatra83 10 หลายเดือนก่อน +1

    wonderful great job sir,just my finding @ 5:00:00 the area 0.99111 is area from left end to till that z value, so 1-0.99111 = 0.00889 is the remaining area corresponding to alpha 0.025 so 0.00889*2 = 0.01778 is the p value for alpha 0.05 which you explained correct ,later you changed it which is not correct, but anyway it is less than 0.05 so Ho is rejected

  • @aiueo8962
    @aiueo8962 7 หลายเดือนก่อน

    Thanks, one of the best course i've ever watch

  • @sauravgupta2926
    @sauravgupta2926 2 ปีที่แล้ว +9

    4:48:49 if p-values is less than alpha, then we need to reject the null hypothesis. This needs to be corrected in the video. Let me know if I missed something. It was a great explanation overall. Thanks
    Later in the video, correction has been made. Thanks

  • @headoverheels6647
    @headoverheels6647 หลายเดือนก่อน

    Thank you so much for sharing the concepts in easier way!

  • @vyankateshkongari5128
    @vyankateshkongari5128 ปีที่แล้ว +2

    very informative and helpful video for revising all concept and also for interview purpose thank you krish sir for making for us this like beautiful video

  • @JesúsCastillo-n1h
    @JesúsCastillo-n1h 9 หลายเดือนก่อน +1

    Thanks Mr. Krish! Greetings from Chile!

  • @Salik-w7h
    @Salik-w7h 3 หลายเดือนก่อน

    Thanks, I enjoyed watching it over these three days. It is a good refresh for my statistic to give a good exam tomorrow, i hope!

  • @shohebshaikh9128
    @shohebshaikh9128 ปีที่แล้ว +1

    1st Assignment Answer :
    Ratio data = where "0" is treated as start point/ origin in measurement.
    e.g. income, experience, height, weight, etc.
    Please correct if I am wrong. Thanks

  • @rajeebm1
    @rajeebm1 ปีที่แล้ว +3

    I always knew that if p_value is less than significance lavelm(0.05), H0 is rejected. But here we have seen the opposite. Please check the video Krish

  • @abhishekbhardwaj923
    @abhishekbhardwaj923 ปีที่แล้ว

    Hi Krish ! I think in the p value concept ,you reversed the case .ref(lecture no 6 and 7)
    when p>alpha (we accept the null hypothesis)
    when p

    • @abhishekbhardwaj923
      @abhishekbhardwaj923 ปีที่แล้ว

      lecture 7 also there is a mistake in calculating p value

  • @lekhamotani
    @lekhamotani ปีที่แล้ว +1

    You are a great teacher!

  • @lilyfullery4779
    @lilyfullery4779 ปีที่แล้ว +4

    i have found this tutorial to be the best so far , A big thank you

  • @sahibsinghnagi683
    @sahibsinghnagi683 2 ปีที่แล้ว +2

    God Bless you! 😀

  • @akhandshahi3337
    @akhandshahi3337 2 ปีที่แล้ว +19

    If p_value is less than 0.5 then we should reject the null hypothesis(Ho).

  • @OrtegaTalks
    @OrtegaTalks ปีที่แล้ว +5

    On the measures of central tendency, I noticed, you kept referring to the median as the mode. Those are two different measures. Luckily you corrected it down the line. But overall, perfect summary of statistics.

  • @lecturesfromleeds614
    @lecturesfromleeds614 9 หลายเดือนก่อน +1

    You the man Krish!!

  • @paneercheeseparatha
    @paneercheeseparatha 3 หลายเดือนก่อน +1

    Great video. But there are a few errors which might lead to misunderstanding. In the p-value calculation part at 05:01:00, after obtaining the cumulative probability from the z-score table for +2.304, lets say its \alpha, then the p-value should be 2*(1-\alpha). This is because 1-\alpha gives the area to the right of z=+2.304 point and twice of that area should be the p value.
    Would highly appreaciate if you could add a few comments in the video to resolve this error. Thanks for the amazing video again.

    • @paneercheeseparatha
      @paneercheeseparatha 3 หลายเดือนก่อน

      Also it would be very helpful, if you could share the updated notes as well.

  • @rohanmalik2910
    @rohanmalik2910 5 หลายเดือนก่อน

    Most easy to understand course about math 🤩
    Thank you so much sir 🙏

  • @balakrishnay07
    @balakrishnay07 ปีที่แล้ว

    Superb,Thank you Krish for amazing content.👏

  • @sfk21
    @sfk21 ปีที่แล้ว +1

    Thank you so much Krish today I just have finished your lectures

  • @avronilbanerjee5302
    @avronilbanerjee5302 ปีที่แล้ว +25

    I bought the Data Science and Machine Learning course from GFG, in the statistics section the faculty suddenly started speaking alien language, and now I am here enjoying the best free content.

    • @JyotsnaK-c7l
      @JyotsnaK-c7l 10 หลายเดือนก่อน

      Please tell whether that course is worth to do or not or simply can we update our skills??!!

    • @avronilbanerjee5302
      @avronilbanerjee5302 10 หลายเดือนก่อน

      @@JyotsnaK-c7l stay away from gfg data science and machine learning course

    • @bollywooddairy4795
      @bollywooddairy4795 4 หลายเดือนก่อน

      Brother notes vgara h kya iske smj me nhi aa rha thoda aa rha h bss

  • @Terpene1
    @Terpene1 10 หลายเดือนก่อน

    Sir you are an absolute legend. I dont have an interveiw but this was useful to understand the concepts. Thank you!

  • @naveenkumarjadi2915
    @naveenkumarjadi2915 7 หลายเดือนก่อน +9

    Exactly this, he got it very confused in the video. For other folks who are confused:
    at 95% confidence, alpha = 0.05,
    at 90% confidence, alpha = 0.1, this is within the confidence interval if alpha = 0.05
    Looking at this, if alpha is 0.05, then a value > 0.05 (cuz 0.1 > 0.05) will fall within the confidence interval.
    So, p>alpha implies that it lies within the confidence interval, so we accept the null hypotheses.
    p REJECT THE NULL HYPOTHESIS
    p > alpha (domain expert will tell this values) ---> ACCEPT THE NULL HYPOTHESIS

  • @syedmahadurrayyan
    @syedmahadurrayyan หลายเดือนก่อน

    for p value you should multiply by 2 instead of divide by 2 because from z table we find out value of 2.3 it give us the submission of all value till 2.3 including -2.3 and after subtracting that value by 1 we will get only value which is greater than 2.3 but we also need value of less than 2.3 for this we have to multiply the value of greater than 2.3 by 2

  • @tbedaniel6387
    @tbedaniel6387 11 หลายเดือนก่อน

    I appreciate you Sir, best Statistics course ever!

  • @ArnabNayak-gl2xx
    @ArnabNayak-gl2xx 7 หลายเดือนก่อน +1

    Sir, I am going to do bstat in isi but I had not statistics at 10+ 2 level. Your video has helped me to start my journey with statistics by making every concept crystal like clear.

    • @dileepreddy7609
      @dileepreddy7609 4 หลายเดือนก่อน

      Cong bro. Im at isi too. CS

    • @ArnabNayak-gl2xx
      @ArnabNayak-gl2xx 3 หลายเดือนก่อน

      @@dileepreddy7609 in which isi? Kolkata or Bangalore?

    • @dileepreddy7609
      @dileepreddy7609 3 หลายเดือนก่อน

      @@ArnabNayak-gl2xx kolkata

    • @ArnabNayak-gl2xx
      @ArnabNayak-gl2xx 3 หลายเดือนก่อน

      @@dileepreddy7609 I am also in Kolkata

  • @adarshkashyap7402
    @adarshkashyap7402 10 หลายเดือนก่อน

    Guru ji mst..... Teaching and depth

  • @ShubhamKumar-sh8qy
    @ShubhamKumar-sh8qy 11 หลายเดือนก่อน +2

    thank you for the video. I have a subject named Introduction to Data Science using python and has statistic in my syllabus, as it was only introduction, i have completed the video till t - test and i have basic idea about covarience and correlation. My exam is tomorrow let's see if i pass.

  • @priyankchaudhari6412
    @priyankchaudhari6412 2 ปีที่แล้ว +8

    Sir while calculating p-value for 2 tail you don't have to divide by 2 5:02:03 ... you're making this mistake coz while considering CI area u have to again subtract 0.0089 coz area from z chart is from left to z value

    • @Justme_and_mysubscribers
      @Justme_and_mysubscribers ปีที่แล้ว

      Yes I was thinking the same

    • @jiyabyju565
      @jiyabyju565 ปีที่แล้ว

      i want to say the same...area from the z table giving entire area towards the left

  • @osikoyaadeola2530
    @osikoyaadeola2530 2 ปีที่แล้ว

    I am learning a lot from your Chanel. Thank you so much.

  • @onlaughlens
    @onlaughlens 2 ปีที่แล้ว +12

    Thank you sir for this awesome tutorial and your teaching skills is just amazing 😍

  • @tienesmalaliento
    @tienesmalaliento 6 หลายเดือนก่อน

    thanks, one of the only people who explain all of this clearly haha!

  • @TracyOyikowo
    @TracyOyikowo 6 หลายเดือนก่อน +1

    This is very nice thank you Mr Krish

  • @vinaypritwani
    @vinaypritwani ปีที่แล้ว +3

    this is pure gold for DS students

    • @itz_satya_3
      @itz_satya_3 ปีที่แล้ว

      Bro learning fully this video enough for statistics in data science?? Plz give reply 🙂 for this question bro❤

  • @devseal1215
    @devseal1215 ปีที่แล้ว +3

    Hey Krish! Very much impressed with your videos. Currently i'm going through DS science course. and these videos are the life saving
    for me.

  • @rahulugale45
    @rahulugale45 6 หลายเดือนก่อน

    fantastics course...Hats off sir

  • @sajjaduddin8188
    @sajjaduddin8188 6 หลายเดือนก่อน

    Best video. Thank you sir!

  • @Laxmi-l8h
    @Laxmi-l8h หลายเดือนก่อน

    Amazing course sir,thanks a lot

  • @MuhammadWahab-jt6ly
    @MuhammadWahab-jt6ly ปีที่แล้ว +1

    world's beast video ever on TH-cam about statistics

  • @maheshmishra7490
    @maheshmishra7490 24 วันที่ผ่านมา

    Thanky you sir ❤❤ it's a goldmine🎉

  • @sapnewapne
    @sapnewapne 3 หลายเดือนก่อน +2

    18.10.2024 : I finished this video from start to finish. Thank you Sir!

  • @ashutoshthokare2127
    @ashutoshthokare2127 ปีที่แล้ว +1

    thank you so much sir, you explained everything very well

  • @suryapn-hi6ru
    @suryapn-hi6ru 6 หลายเดือนก่อน +1

    Mind blowing video brilliant teaching 🫡

  • @saiyan5592
    @saiyan5592 9 หลายเดือนก่อน

    Hi sir recently i ve seen this video !! So amazing and helpful for my data science carrer !!
    Now i saw in other videos that in hypothesis testing ,
    the type 1 error is false positive and type 2 error is false negative

  • @priyanshigoel6710
    @priyanshigoel6710 ปีที่แล้ว +1

    amazing lecture!

  • @yasararafath9266
    @yasararafath9266 3 หลายเดือนก่อน +1

    20-10-2024 : i finished this video from start to finish, with handwritten notes and solving them
    Thank you Sir!

    • @asfiyahm-cx5kd
      @asfiyahm-cx5kd 3 หลายเดือนก่อน

      where is the handwritten notes can you share link pls ......coz im couldnt find here

    • @mdshahanawajansari7241
      @mdshahanawajansari7241 3 หลายเดือนก่อน

      @@asfiyahm-cx5kd All notes are present in this link . Class notes as well as interview Question notes

    • @asfiyahm-cx5kd
      @asfiyahm-cx5kd 3 หลายเดือนก่อน

      @@mdshahanawajansari7241 tq❤️