Jake Vanderplas - Statistics for Hackers - PyCon 2016.mp4

แชร์
ฝัง
  • เผยแพร่เมื่อ 16 มิ.ย. 2016
  • Speaker: Jake Vanderplas
    Statistics has the reputation of being difficult to understand, but using some simple Python skills it can be made much more intuitive. This talk will cover several sampling-based approaches to solving statistical problems, and show you that if you can write a for-loop, you can do statistics.
    Slides can be found at: speakerdeck.com/pycon2016 and github.com/PyCon/2016-slides

ความคิดเห็น • 50

  • @dannyholley
    @dannyholley 6 ปีที่แล้ว +6

    I wish any of my statistics teachers had even a fraction of his creativity or legitimate desire to impart knowledge. Bravo.

  • @abdullahabd7677
    @abdullahabd7677 4 ปีที่แล้ว +6

    He has the best book which is freely available online.

  • @fluffmiller1084
    @fluffmiller1084 7 ปีที่แล้ว +14

    A really nice intuitive intro to resampling methods in the space of 30 minutes. So many times, bootstrapping is just bundled into data science education without any attempt to make clear the specifics of what is actually being done, what the purpose is, and why it makes sense or is reasonable to use it.
    Highly recommended!

    • @Johnnyboycurtis
      @Johnnyboycurtis 7 ปีที่แล้ว

      Fluff Miller should probably study statistics instead of "data science"

  • @richardzheng231
    @richardzheng231 4 ปีที่แล้ว +24

    CSC 207 gang is here

  • @sibinh
    @sibinh 6 ปีที่แล้ว +5

    Another great talk from Jake! Thanks for suggesting the resources at 32:38

  • @jukebox54
    @jukebox54 8 ปีที่แล้ว +8

    Oh my god - brilliant talk.

  • @JoHeN1990
    @JoHeN1990 5 ปีที่แล้ว

    This really hits home with me. Especially during grad schools when you are absolutely ingrained with the solutions and methods that by the time you finished, you totally forgot about the real questions you were solving. Good reminder indeed! Thanks!

  • @isabellaghement9442
    @isabellaghement9442 7 ปีที่แล้ว +2

    Clever and creative use of examples from Dr. Seuss makes statistical concepts fun!

  • @user-wr4yl7tx3w
    @user-wr4yl7tx3w ปีที่แล้ว +1

    He’s such a great presenter.

  • @stevenlebeau
    @stevenlebeau 7 ปีที่แล้ว +8

    Having just taken an intro stats course (I passed!), I love this video. I like how he talks about "the right level of abstraction." Actually, I think the whole "right level of abstraction" thing applies not only to stats, but to a lot of math. I know Allen B. Downey has written some books on understanding statistics using python (Think Stats, Think Bayes, etc), and I think the same approach ought to be taken for other branches of mathematics, such as trigonometry and calculus.

  • @kavinadithiya6771
    @kavinadithiya6771 4 ปีที่แล้ว +57

    anyone watching this cause arnold told you to?

    • @acaxu6765
      @acaxu6765 4 ปีที่แล้ว

      That's me

    • @Finity_Dust
      @Finity_Dust 4 ปีที่แล้ว

      Same

    • @hyunbani
      @hyunbani 4 ปีที่แล้ว

      same

    • @jieyizheng6915
      @jieyizheng6915 4 ปีที่แล้ว

      Same

    • @lorian4366
      @lorian4366 4 ปีที่แล้ว +5

      it feels like a pretty random thing to be included in the assignment

  • @DistortedV12
    @DistortedV12 5 ปีที่แล้ว

    Excellent delivery, wish other stats teachers had the same charm, and creativity; would be nice to see a causal inference for hackers.

  • @piggybox
    @piggybox 7 ปีที่แล้ว

    Very clearly presented. Well done!

  • @user-bx7xc2gp7k
    @user-bx7xc2gp7k 6 ปีที่แล้ว +1

    Great! I learn a lot from the video~

  • @michelleaudirac3492
    @michelleaudirac3492 6 ปีที่แล้ว +5

    I loved Jake, his talk, the presentation, examples and the Dr. Seuss theme. Still, even as he tries not to make hard statements... he certainly does. Finding out 'differences between statistics and data science' is a great open discussion. Are they really any different? There is evidently an intuitive explanation to t-statistics, chi-square distributions and degrees of freedom. These terms, didn't come out of the blue...these are just tools developed by humans who were trying to solve or describe real world situations (and these people certainly had problem solving minds).

  • @cnliving
    @cnliving 7 ปีที่แล้ว +1

    Excellent talk~

  • @davidblake8612
    @davidblake8612 7 ปีที่แล้ว

    I enjoyed this talk.

  • @CharlesDibsdale
    @CharlesDibsdale 5 ปีที่แล้ว

    Fabulous - many thanks

  • @thewilfreds
    @thewilfreds 3 ปีที่แล้ว

    Quant_King's post on Hacker Statistics on reddit brought me here.

  • @luisrueda6109
    @luisrueda6109 7 ปีที่แล้ว

    The resampling book mentioned is out of print. Can somebody recommend an alternative?

  • @gopikrishna9121
    @gopikrishna9121 5 ปีที่แล้ว

    Great talk

  • @ChernobylPizza
    @ChernobylPizza 4 ปีที่แล้ว +2

    14:38 This is anti-starbellied Sneech half-truth propaganda. You need a higher Sneech sample size. With your low sample size, it was almost guaranteed that the 6.6 point average difference would not be significant. Sample more Sneeches, and you may just find that average difference becomes quite significant at p =0.05

  • @ibraheemmoosa
    @ibraheemmoosa 2 ปีที่แล้ว

    The caveats at 21:00

  • @forheuristiclifeksh7836
    @forheuristiclifeksh7836 ปีที่แล้ว

    20:44

  • @thomasnn
    @thomasnn 6 ปีที่แล้ว

    Awesome

  • @yash1152
    @yash1152 ปีที่แล้ว

    Coming here via doc py lib random see also documentation

  • @ibraheemmoosa
    @ibraheemmoosa 2 ปีที่แล้ว

    When you know the analytical answer for something, using simulation is dumb. To give an example, we know the equation for the sum of the first n natural numbers. So we can calculate this in O(1). Now if you do not use the equation and just use for loop to calculate the sum, the cost is O(n). Why on earth would you go for O(n) algortihm when you have an O(1) algorithm? Another example would be using bogosort intead of quicksort.