Goggins and Travis Scott explain Sample Variance

แชร์
ฝัง
  • เผยแพร่เมื่อ 29 ธ.ค. 2024

ความคิดเห็น • 60

  • @ootansup
    @ootansup 7 หลายเดือนก่อน +73

    This actually gave me knawledge

    • @pirate1619
      @pirate1619 7 หลายเดือนก่อน +3

      Yes 🥹😭😭

    • @bignose420
      @bignose420 6 หลายเดือนก่อน +2

      Well in math only at least......

  • @prithvidhyani2002
    @prithvidhyani2002 7 หลายเดือนก่อน +24

    Nice video, few additional comments:
    1) we don't square the deviations to keep them positive(we could use absolute value for that), we do this because it is has nice mathematical properties and quite frankly that's just how the second moment is defined(shifted by first moment).
    2) we divide by n-1 when calculating the sample stdev but n when calculating the population stdev, why? Because the calculation of stdev requires us to know the mean. Theoretically, if we knew the mean of the population, we wouldn't need to divide by n-1 to find the stdev of a sample. In practice however, in order to find the average squared deviations, we use the sample mean to estimate the population mean. In do doing so, we expend one degree of freedom because estimating the population mean with the sample mean forces the choice of the last data point which means only n-1 data points are free to vary.

    • @prithvidhyani2002
      @prithvidhyani2002 7 หลายเดือนก่อน +5

      read about Bessel's correction to understand the second point

  • @Shian077
    @Shian077 7 หลายเดือนก่อน +41

    So I have studied this chapter but today I learned this formula to some more extinct

    • @that1cr33p
      @that1cr33p 7 หลายเดือนก่อน +11

      extent*

  • @adnanmaruf282
    @adnanmaruf282 7 หลายเดือนก่อน +21

    Satying Hard-David Goggins

  • @thevikvids
    @thevikvids 7 หลายเดือนก่อน +14

    Better than my math teacher

  • @thebrownfrog
    @thebrownfrog 7 หลายเดือนก่อน +31

    The actual reason they are squared is different. By this video's logic you could just remove the negative sign. The actual reason has to do with Euclidean distance. If you treat the data as point in big-dimensional space, you're just taking the distance of the data and the average of the data

    • @ignasa007
      @ignasa007 7 หลายเดือนก่อน +15

      That's not the reason, but just an interpretation. Their is no reason; this is simply the definition of population variance being the second central moment, and sample variance is the same thing just scaled to make it an unbiased estimator.

    • @thebrownfrog
      @thebrownfrog 7 หลายเดือนก่อน +4

      @@ignasa007 in fact it is the reason, because you've misinterpreted my intended meaning of the "reason" in this context

    • @139-b7j
      @139-b7j 7 หลายเดือนก่อน +12

      The reason is because it is algebraically easier to work with. In fact, absolute value distance is a superior measure of statistical distance since it is far more robust.

    • @nikos4677
      @nikos4677 7 หลายเดือนก่อน +1

      yes but why wouldnt you use absolute for dostance? Your explanation fails to explain this.

    • @alma-lz7cq
      @alma-lz7cq 6 หลายเดือนก่อน +1

      Yeah if that was the only reason you coul have just get the root of those bumbers

  • @BYAMUKAMAKENNETHKENNETH
    @BYAMUKAMAKENNETHKENNETH 5 หลายเดือนก่อน

    This channel is really helping us with us with math problems

  • @nanamacapagal8342
    @nanamacapagal8342 7 หลายเดือนก่อน +1

    NOTE!
    If you're wondering why variance is labelled s^2, it's because we squared our measurements earlier. If you wanted just s, that would be "standard deviation", and to get that you literally just square root the variance. that's it.

  • @fr1gus
    @fr1gus 7 หลายเดือนก่อน +7

    Commenting for algorithm, love this type of content!!!

  • @pietronegretto
    @pietronegretto 6 หลายเดือนก่อน

    You square the difference so that if a value is way out from the average it is gonna highly influence the variance

  • @brockobama257
    @brockobama257 7 หลายเดือนก่อน +4

    “We just square it to make sure it’s positive”
    Why not take the absolute value instead then?

    • @hypronmaster6756
      @hypronmaster6756 7 หลายเดือนก่อน

      because its like using the distance formula

    • @geojoyson3747
      @geojoyson3747 7 หลายเดือนก่อน +2

      Its more complicated to take absolute. When u generalise this to continuous values, analysing absolute is difficult.

  • @NXT_LVL_DVL
    @NXT_LVL_DVL 7 หลายเดือนก่อน +2

    Why (n-1) ?
    Why don't we just take the mod

  • @joelroy9221
    @joelroy9221 2 หลายเดือนก่อน

    Travis Scott when he's not making brain rot mumble rap:

  • @notdaycrucial5179
    @notdaycrucial5179 7 หลายเดือนก่อน +3

    if the square is to make everything positive why can't we just take the absolute value of each term?

    • @139-b7j
      @139-b7j 7 หลายเดือนก่อน +1

      That is what is used in robust statistics.

    • @sentheaS
      @sentheaS 7 หลายเดือนก่อน

      You could, it just wouldn’t give the variance.

    • @notdaycrucial5179
      @notdaycrucial5179 7 หลายเดือนก่อน

      @@sentheaS I thought s^2 is just the variance squared. Is s^2 then just what the variance is

    • @MrTeen-ul7yc
      @MrTeen-ul7yc 7 หลายเดือนก่อน

      @@notdaycrucial5179 Yes s^2 is the variance. It has units that are square units of whatever you are measuring. To bring it back to s you take the square root and that's what you'd call the standard deviation. It now has the same units as the thing you're measuring.

    • @notdaycrucial5179
      @notdaycrucial5179 7 หลายเดือนก่อน

      @@MrTeen-ul7yc oh ok thanks

  • @ragnadrok7
    @ragnadrok7 6 หลายเดือนก่อน

    A/B Hypothesis testing please

  • @darondsouza600
    @darondsouza600 7 หลายเดือนก่อน +1

    I smell underated.

  • @yusufahmed2233
    @yusufahmed2233 7 หลายเดือนก่อน

    Please do the population and sample standard deviation with the degree of freedom thing

  • @Zepheray
    @Zepheray 7 หลายเดือนก่อน

    I'd love to see one about relativity, quantum, or other things in space. I really want to do this but the link is broken and I can't try it either

  • @djstorey
    @djstorey 7 หลายเดือนก่อน

    Damn. AI is insane insane

  • @RKAravind-eu7wj
    @RKAravind-eu7wj 7 หลายเดือนก่อน

    Thanks bro i never understood statistics i just straight up slap the formulas

  • @abe.vs.ape.
    @abe.vs.ape. 7 หลายเดือนก่อน

    Great content

  • @abe.vs.ape.
    @abe.vs.ape. 7 หลายเดือนก่อน +2

    Stay hard

  • @sweetsilver8016
    @sweetsilver8016 หลายเดือนก่อน

    How the hell an ai channel teaching better then a paid teacher.

  • @sergio_henrique
    @sergio_henrique 7 หลายเดือนก่อน +1

    Why divide by N-1 instead of just N ?

    • @alazrabed
      @alazrabed 7 หลายเดือนก่อน +2

      It's a bit convoluted but in small samples, just dividing by N actually gives a wrong estimator (we say that it is biased). Diving by N-1 fixes that. Notice that in sufficiently large sample N-1 just tends to be N so it wouldn't matter too much which one you use.

    • @sergio_henrique
      @sergio_henrique 7 หลายเดือนก่อน +1

      @@alazrabed I wonder if there's a mathematically constructed argument for choosing "N-1" to "make it work". Otherwise it kinda seems like some sort of "quick fix" to a problem, without much reasoning behind it. But I am just a layman.

    • @alazrabed
      @alazrabed 7 หลายเดือนก่อน +1

      @@sergio_henrique Oh yes, there is one, and it's actually quite subtle and profound. The idea is that, given your sample mean, you only need to know all of the sample value, except one: you can just deduce the last one if you know everything else.
      For instance let's say the mean is 5 from a population of three. If I know two values from my sample, let's say 4 and 5, then I can deduce the final one is 6 -- it has to be as such.
      This implies a loss of _degrees of freedom_ for the sample. Just one, and only one. And this is the "minus one" that you find in the denominator.

    • @sergio_henrique
      @sergio_henrique 7 หลายเดือนก่อน

      @@alazrabed Nice! Makes a lot more sense now with this intuition. Thanks for the clear and succinct explanation!👌

  • @Albert_Pecanio
    @Albert_Pecanio 2 หลายเดือนก่อน

    cm squared as final units?

  • @remsammo
    @remsammo 7 หลายเดือนก่อน

    Beautiful

  • @Armstrong1781
    @Armstrong1781 2 หลายเดือนก่อน

    why 'n minus one' below ?

  • @djstorey
    @djstorey 7 หลายเดือนก่อน

    Insane

  • @renr4502
    @renr4502 7 หลายเดือนก่อน

    So hey, why does the Standard Deviation use the absolute bracket instead??

    • @alazrabed
      @alazrabed 7 หลายเดือนก่อน +1

      Don't know nothing about absolute brackets. Standard deviation is just the square root of the variance. Really useful because it is homogeneous in dimension with the sample measurements you made.

    • @renr4502
      @renr4502 7 หลายเดือนก่อน

      @@alazrabed I meant "mean deviation", not standard deviation, sorry.

    • @alazrabed
      @alazrabed 7 หลายเดือนก่อน +2

      @@renr4502 Oh okay. So then the absolute function allows you, here again, to get rid of the negative values. But it won't measure the same thing. Because with the variance, as we square the difference from the mean, values that are really far from the mean will weight a lot in the final result. This is less true for the "mean deviation" -- I'm not sure where you would use that function, though.

    • @renr4502
      @renr4502 7 หลายเดือนก่อน +1

      @@alazrabed Yes, exactly, thank you for the explanation. I think I could grasp the understanding of both variance and standard deviation.
      But back a year ago, I learned statistic in my last year on high school, and they teached me about the mean deviation.
      I searched up on google and they said that both standard deviation and mean deviation are used to find volatility (Frequently used on finding risk on investmen since higher volatility gives a higher fluctuation). They also said that mean deviation is an alternate to the standard deviation, but are used less frequently. But the standard deviation is preferably used when there are large outliers because they can register higher levels of dispersion (or deviation from the center) than mean absolute deviation. So, I guess, standard deviation is more superior.
      I'm not really sure why they teached us the mean deviation in the first place, maybe because it's more praticable without calculator.

  • @parthgoswami3022
    @parthgoswami3022 7 หลายเดือนก่อน

    This is what I invest my dollars in Nvidia for

  • @knfczae
    @knfczae 7 หลายเดือนก่อน +1

    fuck this is genius