How to read a box plot (a.k.a. a box-and-whisker plot) - Nick Desbarats

แชร์
ฝัง
  • เผยแพร่เมื่อ 19 ธ.ค. 2024

ความคิดเห็น •

  • @koushikrafsan5820
    @koushikrafsan5820 2 หลายเดือนก่อน +3

    Best video on Box plot, I found so far. It is extremely helpful for me.

  • @valerioharvey7289
    @valerioharvey7289 หลายเดือนก่อน +2

    finally an explanation that I can at least grasp

  • @thejohnringo
    @thejohnringo 2 ปีที่แล้ว +5

    Your explanations were exceptionally clear.

  • @jjsambac
    @jjsambac 3 ปีที่แล้ว +13

    great video! straightforward and informative. Loved that you also offered an alternative option

  • @PrathamGupta-g6b
    @PrathamGupta-g6b หลายเดือนก่อน +1

    Perfect Video. The only video one need

  • @anima8450
    @anima8450 ปีที่แล้ว +3

    As someone who's studying to become a psychologist this was very useful thank you!

  • @matthewkarasoulis5680
    @matthewkarasoulis5680 2 หลายเดือนก่อน +1

    Amazing video, really helped me out alot!

  • @nilamdhatrak6346
    @nilamdhatrak6346 3 ปีที่แล้ว +3

    Most easy and most informative!

  • @dianndp4957
    @dianndp4957 ปีที่แล้ว +2

    It was very helpful and easy to understand, thanks for the hard work

  • @chrismalingshu
    @chrismalingshu 3 ปีที่แล้ว +3

    Informative & easy to understand! Thanks for the explanation!

  • @Dergicetea
    @Dergicetea ปีที่แล้ว +1

    Mr., I have got a question. In which tool did you design the Frequency Heatmap? It was very stylish and clear to interpret data.

    • @practicalreporting
      @practicalreporting  ปีที่แล้ว +1

      It was actually created in Excel, using conditional formatting (and making the numbers in the cells invisible, see support.microsoft.com/en-au/office/hide-or-display-cell-values-c94b3493-7762-4a53-8461-fb5cd9f05c33 )

    • @Pulvy10
      @Pulvy10 6 หลายเดือนก่อน

      @@practicalreportingthis will only work if the value we want to represents are inside the table. What if we wanted to see the frequency of that value and represent that in the heatmap?
      in this vid that would be for example 20 people with >120k salary. Our data would have 20 different names with >120k salary. To represent that 20 people we need a table of countif salary>120k and we need to do this for each salary band. Then we do the conditional formatting..
      CMIIW. Good vid!!!

    • @practicalreporting
      @practicalreporting  5 หลายเดือนก่อน +1

      @marior6662 To make a distribution heatmap, yes, each cell has to contain (or be associated with) the number (or %) of values that fall within that cell. That value then determines the color of that cell. The example that I showed was created in Excel using conditional formatting, and the numbers in the cells were made invisible using this trick: support.microsoft.com/en-us/office/hide-or-display-cell-values-c94b3493-7762-4a53-8461-fb5cd9f05c33#:~:text=Hide%20cell%20values&text=On%20the%20Home%20tab%2C%20click,Type%20%3B%3B%3B%20(three%20semicolons).

  • @MrX-wd8cm
    @MrX-wd8cm ปีที่แล้ว

    Underrated, pretty good explanantion

  • @stephanie_ong
    @stephanie_ong 3 ปีที่แล้ว +1

    Hi Nick, thank you so much for your very informative video. In my work it is not very common to display box and whisker plots to management. It is more common to show long-term average values, monthly average (I run scenarios in a model and do comparative analysis)

    • @practicalreporting
      @practicalreporting  3 ปีที่แล้ว

      Thanks, Nicole! There is definitely a risk in only showing averages and not the "shapes" of distributions, though. For example, different data sets can have very different distributions but the same average. Also, I no longer use box plots at all now, opting for other distribution chart types such as strip plots and distribution heatmaps instead; see nightingaledvs.com/ive-stopped-using-box-plots-should-you/

  • @agermoune
    @agermoune 3 ปีที่แล้ว +1

    Thx Nick for sharing your knowledge. What I like about ur teaching is how reasonable your arguments are and inspire me immediately when to apply what I learn from you.
    Subscribed immediately.
    Was looking for your Beyond dashboards book, but couldn't find it!

    • @practicalreporting
      @practicalreporting  3 ปีที่แล้ว +1

      Thanks, Abderrahim. The book has been delayed but will be out next year (2022) and it's been renamed "Practical Dashboards": www.practicalreporting.com/practical-dashboards-book-summary

    • @agermoune
      @agermoune 3 ปีที่แล้ว +1

      @@practicalreporting thanks for the heads up. Please keep your videos coming- your content and subjects are way different and solid compared to what is already published in the site.

  • @SweetPeachannel
    @SweetPeachannel 5 หลายเดือนก่อน +1

    very useful, thank you and great explanation/

  • @user-eco2023
    @user-eco2023 29 วันที่ผ่านมา +1

    Your explanation is just like strawberry - so easy to grasp and very delicious! Thank you very very much!

  • @xunnygujjar2094
    @xunnygujjar2094 3 ปีที่แล้ว +1

    what an explanation. I Appreciate. Thank You.

  • @XoCortanaXo
    @XoCortanaXo 4 หลายเดือนก่อน

    Super helpful! Thank you

  • @helenarc5790
    @helenarc5790 2 ปีที่แล้ว

    this explanation helped me, thank youuuu!!!

  • @t199589
    @t199589 10 หลายเดือนก่อน

    Oh my God thank you so much I finally understand it

  • @donakarunaratne6012
    @donakarunaratne6012 2 ปีที่แล้ว +1

    Amazing video! Thank you!

  • @o_O29866
    @o_O29866 9 หลายเดือนก่อน

    great job! thank you!

    • @practicalreporting
      @practicalreporting  9 หลายเดือนก่อน

      Glad you found the video to be useful!

  • @BS33875
    @BS33875 2 ปีที่แล้ว +1

    really nice, thank you.

  • @elviscalvinowusu385
    @elviscalvinowusu385 9 หลายเดือนก่อน

    precise and concise

  • @aleziafrimpong2828
    @aleziafrimpong2828 2 ปีที่แล้ว +1

    Please how will you report - 1.113 skewness

    • @practicalreporting
      @practicalreporting  2 ปีที่แล้ว

      Well, skewness in general (not just 1.113) will appear in a box plot as the "whisker" and/or "box" sections at one end of the box and whisker shape being shorter than the box and whisker shapes at the other end.
      In a distribution heatmap, skewness appears as colored cells at one extremity of a column of cells being darker than cells at the other extremity.
      Kind of hard to explain without visual aids...

    • @aleziafrimpong2828
      @aleziafrimpong2828 2 ปีที่แล้ว

      Okay

  • @Arqueovader
    @Arqueovader 2 ปีที่แล้ว +1

    Nice video, but how do you locate extreme values?

    • @practicalreporting
      @practicalreporting  2 ปีที่แล้ว

      I didn't cover it in the video, but there's a widely used convention for determining what's an outlier and showing them in a box plot. This article describes it: www.real-statistics.com/excel-capabilities/creating-box-plot-outliers-manually/

  • @spilledgraphics
    @spilledgraphics 4 ปีที่แล้ว +1

    Hi Nick, congrats on this exemplary explanation about box plot. Curious to know what´s your opinion on why people don´t find this chart very intuitive? (minute 04:20) .... are there any like sociological or maybe anthropological reasons to explain why people have a hard time understanding a very informative plot? Please if you have any links to refer me, I would greatly appreciate it. Lastly, do you make those charts on the video with Excel or Tableau? thanks, mate, greetings from Perú.

    • @practicalreporting
      @practicalreporting  4 ปีที่แล้ว +3

      Glad that you found the video to be useful. My comment about many people finding box plots to be unintuitive is based on my experiences explaining them to thousands of workshop participants, and the fact that they require an understanding of the abstract notion of quartiles (which very few people possess in most organizations) in order to be interpreted. I suspect that many people find frequency heatmaps to be more intuitive since we pre-attentively associate higher color intensity with higher quantities (in this case, higher concentrations of values), whereas people have weaker pre-attentive associations for box and whisker shapes. Frequency heatmaps also only require an understanding of bins, which are easier to grasp than the concept of quartiles.
      If you have a statistical background, you may find these reasons to be kind of silly (i.e., "quartiles aren't that hard to understand..."), but most people don't have statistical backgrounds ;-)

  • @user-sw5tx3pr7m
    @user-sw5tx3pr7m 9 หลายเดือนก่อน

    Thank you for this !!! :)

  • @adityaagrawal1636
    @adityaagrawal1636 3 ปีที่แล้ว +1

    unfortunate that such a relevant channel has 118 subscribers only given that it has been more than an year.

    • @practicalreporting
      @practicalreporting  3 ปีที่แล้ว

      Thanks, Aditya. This channel is mostly for supplementary information for my training workshop participants, so I don't actively promote it.

  • @joyprokash4013
    @joyprokash4013 2 ปีที่แล้ว +1

    Excellent sir.

  • @Popup-hr4wm
    @Popup-hr4wm 6 หลายเดือนก่อน

    Bro is LEGEND

  • @Must23
    @Must23 2 ปีที่แล้ว +1

    This is veery useful ty~

  • @shihhuiwang
    @shihhuiwang 3 ปีที่แล้ว

    Thanks for the video! just one question🙋 Why does the heatmap use 0%, 30%, and 60% instead of 0%, 25%, 50%, and 75%?

    • @practicalreporting
      @practicalreporting  3 ปีที่แล้ว +3

      Thanks, Janet. This is a bit of a coin toss, to be honest. In general, scales with intervals based on 1, 2 or 5 (integers that are naturally divisible into 10) make it easiest for people to perceive values in a chart so, yes, intervals of 30% (i.e., based on 3) aren't ideal. Intervals of 25% might be better than 30%, but I tend to avoid intervals based on 2.5 (not an integer that's naturally divisible into 10). The question, then is whether intervals of 5%, 10%, 20% or 50% would work better. Of these, only 20% would "fit" in a small scale like the one in the frequency heatmap. There would be a lot of stops on that small scale, though (0%, 20%, 40%, 60%), so it might be crowded-looking. Like I said, a bit of a coin toss between 30% (less intuitive but cleaner-looking) and 20% (more intuitive but more cluttered-looking).

  • @monicaeskander6147
    @monicaeskander6147 3 ปีที่แล้ว

    which software you are using to creat frequency heatmaps

    • @practicalreporting
      @practicalreporting  3 ปีที่แล้ว +1

      The sample frequency heatmap in this video was mocked up using Excel's conditional formatting feature, with the median lines added manually. I'm sure more clever people than I could figure out a less hacky way to do it in various other applications, probably using stacked bars.

  • @arandhir
    @arandhir 2 ปีที่แล้ว

    Great video 👍

  • @RadfanOjailah
    @RadfanOjailah 2 ปีที่แล้ว

    amazing

  • @ivanvakulenko9830
    @ivanvakulenko9830 11 หลายเดือนก่อน

    thanks

  • @danielpalacios7546
    @danielpalacios7546 10 หลายเดือนก่อน

    Thank U

  • @muhammadzaidhasan1426
    @muhammadzaidhasan1426 2 ปีที่แล้ว +1

    something abouut u says you r canadian

  • @anima8450
    @anima8450 ปีที่แล้ว +1

    AHHHHHHHHHHHH

  • @ebrahimemad5100
    @ebrahimemad5100 ปีที่แล้ว

    aswome