Shapiro-Wilk test

แชร์
ฝัง
  • เผยแพร่เมื่อ 20 ธ.ค. 2024

ความคิดเห็น • 16

  • @logantyner2214
    @logantyner2214 3 ปีที่แล้ว +6

    The way you broke this down makes it incredibly easy to understand. thank you!

  • @langleymcentyre2754
    @langleymcentyre2754 9 หลายเดือนก่อน

    Thank you for making this video it really clarified the concepts for me

  • @mobileentertainment212
    @mobileentertainment212 2 ปีที่แล้ว +1

    could we get more explanation on the numerator? on the actual derivation of the variable a

  • @keerthanavivin450
    @keerthanavivin450 3 ปีที่แล้ว

    Great video! Understood the concept perfectly. A big thanks.

  • @krahoc
    @krahoc 3 ปีที่แล้ว +7

    Remember Shapiro-Wilk's test should not be used for populations greater than 50, so basically it's quite useful for small populations even if it's a bit sensitive.
    Anyway, great video, maybe you should develop a bit the p-value part explaining how you could interpret it as a confidence indicator about H0.

    • @Николай-щ2ц7з
      @Николай-щ2ц7з 3 ปีที่แล้ว +1

      Why such a limitation? Which test is best for a population greater than 50?

    • @krahoc
      @krahoc 3 ปีที่แล้ว +3

      @@Николай-щ2ц7з the limitation is inheritant to the test method. In their paper, Shapiro and Wilk use a table to compare the W value to a tabulated critical W, but the table goes only from 3 to 50.
      There is a way to "extend" the test by tabulating yourself the critical values with Monte-Carlo simulation, but that's beyond the Shapiro-Wilk's paper. ;)
      I don't really know what's behind the scenes of the R calculations but I think that's what is done here.

    • @diazjubairy1729
      @diazjubairy1729 3 ปีที่แล้ว

      @@krahoc but i've read recent paper talking about shapiro-wilk test is good enough up until 1000 datas

    • @krahoc
      @krahoc 3 ปีที่แล้ว +1

      @@diazjubairy1729 if you compute the critical W values by Monte-Carlo simulation, you may use it with any amount of values but the more data you test the less discriminant it will be, just like every normality test. That's the curse of big samples :')

    • @waytolegacy
      @waytolegacy 4 หลายเดือนก่อน

      Just wondering, if it is not good for more than 50 samples, then why shouldn't we just simply use the skewness to asses the normality. I mean If -0.5 < skewness < 0.5 then Yes else No.

  • @tomp4925
    @tomp4925 2 ปีที่แล้ว

    Is there a formula to calculate "a"?

  • @aliakbarsafdari4084
    @aliakbarsafdari4084 3 ปีที่แล้ว

    This was an excellent video, thanks.
    So, my dataset has columns with very varying counts of values (from 8 to >250).
    Can I use this test as a rough estimation of normality for each column? Or should I prepare Q-Q plots for each?
    Also, if I wish to seek the statistical significance between these columns (they measure the same parameter), what test would you recommend?
    Thanks again.

  • @HuzMusFinanceData
    @HuzMusFinanceData 3 ปีที่แล้ว

    Thank you for this Video.. In simple words, explain if the P-value is < 0.05 so Data is normal or Abnormal?

    • @swastikpatro6436
      @swastikpatro6436 3 ปีที่แล้ว +2

      Normal

    • @HuzMusFinanceData
      @HuzMusFinanceData 3 ปีที่แล้ว

      @@swastikpatro6436 Thank you. ❤

    • @vikramjoshi3452
      @vikramjoshi3452 11 หลายเดือนก่อน

      If p < 0.05, the data are not normally distributed. Remember, the null hypothesis for Shapiro.test() is that the data are normally distributed.