Data Science and Statistics: different worlds?

แชร์
ฝัง
  • เผยแพร่เมื่อ 5 ก.ย. 2024
  • Chris Wiggins (Chief Data Scientist, New York Times)
    David Hand (Emeritus Professor of Mathematics, Imperial College)
    Francine Bennett (Founder, Mastodon-C)
    Patrick Wolfe (Professor of Statistics, UCL / Executive Director, UCL Big Data Institute)
    Zoubin Ghahramani (Professor of Machine Learning, University of Cambridge)
    Chair: Martin Goodson (Vice-President Data Science, Skimlinks)
    Discussant: John Pullinger (UK National Statistician)
    In the last few years data science has become an increasingly popular discipline. Often linked to the use and analysis of ‘big data’, data scientists are seen as the new professionals who can unlock the potential of an increasingly data-rich world, and to generate economic and social benefits from the data revolution.
    However within the world of statistics, the ‘big data’ and ‘data scientist’ developments are sometimes labelled as hypes, and ‘data science’ is seen as a rebranding of what should be statistics. One of the often heard criticisms of big data analytics is that there’s a lack of statistical rigour which can lead to the wrong decisions.
    As with any new discipline there are questions about exactly what data science is. Has the relevance of statistics been diminished because of new types of data or technologies which need a radical new approach? Is data science about ‘getting the job done’, and statistics about the deeper scientific understanding? Are our universities offering students the right skill sets to meet the high demand for data scientists?

ความคิดเห็น • 46

  • @sasha-leighpaules4295
    @sasha-leighpaules4295 7 ปีที่แล้ว +36

    I really enjoyed the discussion. Yes, statistics is difficult. But it needs to be understood to be used correctly. I'd like to hear what this group says now, currently about the combination of programming and statistics. Data science has been included as a degree at my university, however it is still so difficult to understand how the data scientist actually is defined. However, after 3 years of statistics, when you get to the end of all the theoretical learning and finally get into putting some stuff in R, it is really amazing to see how everything comes together and how modelling can finally be done. It is something truly satisfying. And I think the rigorous statistical background is quite necessary to get there and truly enjoy and understand the experience. I think the true data scientist is someone who enjoys the process and beauty of statistics, as well as the processes and beauty of computer science. Not someone who leans to either side, but rather enjoys putting them together.

  • @jamespaz4333
    @jamespaz4333 4 ปีที่แล้ว +10

    We need these series on Netflix right inmediately

  • @user4732_
    @user4732_ 4 ปีที่แล้ว +4

    Absolutely brilliant stuff, with some brilliant minds. Thank you. 5 years on and the debate still stands. It would be great if the same panel got together again in 2021 for a new debate. :)

  • @bennaarsongidi9269
    @bennaarsongidi9269 3 หลายเดือนก่อน

    Intellectual power exhibited by Hand is on another level .

  • @TusharKale9
    @TusharKale9 7 ปีที่แล้ว +8

    I enjoyed the comparison between Statistics and Mathematics by senior staff members both from academia and industry.

    • @mathhelpmadeeasy
      @mathhelpmadeeasy 7 ปีที่แล้ว

      Tushar Kale I agree. It was an cool to hear the correlation

  • @floralingalupo5995
    @floralingalupo5995 9 ปีที่แล้ว +3

    One of the most interesting discussions about this topic. Insightful!

  • @andresrossi9
    @andresrossi9 5 ปีที่แล้ว +8

    It's true that a data scientist is a better statistician than a computer engineer and better computer engineer than a statistician, BUT, it's worse computer engineer than a computer engineer, and worse statistician than a statistician.

    • @michaelpieters1844
      @michaelpieters1844 2 ปีที่แล้ว +2

      I am a statistician and I am a much better programmer than most data scientists. No idea where this old view comes from where people think statisticians can't program. What I also find strange is when some job includes some programming as a tool, it is immediately associated with computer science. While I was modeling astrophysical processes, I was a computational physicist, NOT a software engineer.

  • @bralis2
    @bralis2 7 ปีที่แล้ว +3

    It is interesting that a data scientist should have a background in both statistics and computer science. Both these fields are rigorous. Therefore, I support an idea that division not only of labour but also knowledge is the most effcient way for the data science field how to evolve. Actually these two should not be brought together, but left seperated.

  • @ispinozist7941
    @ispinozist7941 6 ปีที่แล้ว +3

    Giving this a thumbs up for the statistician!! 👏🏻👍🏻

  • @diahidvegi8536
    @diahidvegi8536 4 ปีที่แล้ว +3

    23:00 - 24:50 Such an accurate and interesting observation....and so perfectly said.

  • @victordepedrazaajenjo979
    @victordepedrazaajenjo979 2 ปีที่แล้ว

    I'll prefer being dead than having to listen to this guys for 1 and a half hour

  • @500sf
    @500sf 9 ปีที่แล้ว +1

    From the floor questions, its sounds like the statistics teaching community could take a leaf from the way graduate business schools teach using the case method. Perhaps this might be a new line for HBS?

  • @linchenpal
    @linchenpal 4 ปีที่แล้ว +6

    All subsets of Maths. :)

  • @AshishSharma-bd6mh
    @AshishSharma-bd6mh 4 ปีที่แล้ว

    I thoroughly enjoyed this talk, and great learning too. Thank you RSS 😄

  • @jamespaz4333
    @jamespaz4333 3 ปีที่แล้ว +1

    9:23 he definetely brome the ice

  • @pallaviharishchandre3021
    @pallaviharishchandre3021 3 ปีที่แล้ว +2

    I'm having hard time choosing between msc stats or msc data science after bsc stats

    • @denzokyedravk
      @denzokyedravk ปีที่แล้ว

      Which was the better option?
      Please advise. I am faced with the same conundrum?

    • @pallaviharishchandre3021
      @pallaviharishchandre3021 ปีที่แล้ว +2

      @@denzokyedravk okay I've completed MSc statistics and i would suggest you to go for data science

    • @denzokyedravk
      @denzokyedravk ปีที่แล้ว

      @@pallaviharishchandre3021 Why though? Why would you recommend data science over MSc in Statistics?

    • @pallaviharishchandre3021
      @pallaviharishchandre3021 ปีที่แล้ว +1

      @@denzokyedravk see.. If you want to learn to be a professor or just gain knowledge statistics is fantastic.. But irl if you want a job i would suggest data science

  • @fayazahmad9222
    @fayazahmad9222 8 ปีที่แล้ว +2

    big data also involved data , anything anywhere you want statistics will involved so big data is the small part of statistics

  • @JCResDoc94
    @JCResDoc94 9 ปีที่แล้ว +3

    23:20 UG stats

  • @JCResDoc94
    @JCResDoc94 9 ปีที่แล้ว +4

    51:00 Stats is its own discipline or else math funding issues

  • @eli8069
    @eli8069 5 ปีที่แล้ว

    104:32-113:45 An ideal curriculum for a data scientist and staticians..?

  • @avro549B
    @avro549B 8 ปีที่แล้ว

    "Big Data" means data volumes too large to handle conveniently with current technology. It's been around for a very long time; it's just the orders of magnitude that change. The question is whether there really is a separate trade of "Data Scientist"? Maybe there are already people who can do what is wanted, just using different names for the job?

  • @AO-rw5xg
    @AO-rw5xg 8 ปีที่แล้ว

    Instead of placing all these disciplines into one and calling it data science just create a team of statisticians, analyst, computer scientist and any other field to work on the project at hand

    • @sebastian8538
      @sebastian8538 5 ปีที่แล้ว +2

      Sure, I think in large scale projects that is still the case. Yet the need for a generalist, knowing of all those fields to a solid degree might still be in need to be on the upper end of planning, and managing the processes.

  • @nkristianschmidt
    @nkristianschmidt 3 ปีที่แล้ว

    since people can do data science knowing just some stats, some math, some programming; data science is a subset of none of these.

  • @AlejandroLopez-mg9xr
    @AlejandroLopez-mg9xr ปีที่แล้ว

    vengo de la uji :(

  • @alwizardus
    @alwizardus 7 ปีที่แล้ว +1

    Do you need exceptional hacking skills to be an awesome Data Scientist?

    • @SuperNishant92
      @SuperNishant92 7 ปีที่แล้ว

      I don't think so

    • @drapala97
      @drapala97 5 ปีที่แล้ว

      Hackers are specialists in cyber security. Data scientists are good programmers and understand quite a bit of machine learning.

    • @jamespaz4333
      @jamespaz4333 4 ปีที่แล้ว

      Hacking means to dominate different realms in this context. A data scientist must be the joint of Math/Stats/Machine Learning/Subject Matter

  • @FirstNameLastName-fv4eu
    @FirstNameLastName-fv4eu 8 ปีที่แล้ว +7

    Great panel, but dont think they have ever delivered a solution in production working with crazy, demanding and non-mathematical Banking or any other client. They are great in knowing the theories but real world is totally different than theories. To be a good data scientist, you need only 2 things - 1. "I can do it" attitude and 2. Common sense. Nothing else.

  • @synapss21
    @synapss21 8 ปีที่แล้ว +7

    Data Science is more about linear algebra than statistics !

    • @FirstNameLastName-fv4eu
      @FirstNameLastName-fv4eu 8 ปีที่แล้ว +3

      Looks like you mainly do recommendation system. Statistics comes in translating a business problem into analytical problem, linear algebra helps in solving all the big matrix calculations.

    • @vladimiriurcovschi1657
      @vladimiriurcovschi1657 6 ปีที่แล้ว +2

      Synaps Diallo as a mathematician and data scientist couldn’t disagree more with you

    • @nishantm2087
      @nishantm2087 6 ปีที่แล้ว

      This viewpoint of them versus us does not feel productive. Linear algebra itself is enjoying the fruits of statistical methods via randomized numerical linear algebra algorithms to speed up fundamental operations like matrix multiplication and solving linear systems. Check out e.g. recent work on oblivious subspace embeddings. So, my point is, classical linear algebra has been fundamental in supporting implementations of statistical methods, and modern linear algebra benefits from advanced statistical analyses.

    • @scottsimmons9296
      @scottsimmons9296 5 ปีที่แล้ว

      Good

  • @victordepedraza8460
    @victordepedraza8460 2 ปีที่แล้ว

    I hate this