Data science in Python: pandas, seaborn, scikit-learn

แชร์
ฝัง
  • เผยแพร่เมื่อ 5 ส.ค. 2024
  • In this video, we'll cover the data science pipeline from data ingestion (with pandas) to data visualization (with seaborn) to machine learning (with scikit-learn). We'll learn how to train and interpret a linear regression model, and then compare three possible evaluation metrics for regression problems. Finally, we'll apply the train/test split procedure to decide which features to include in our model.
    Download the notebook: github.com/justmarkham/scikit...
    pandas installation instructions: pandas.pydata.org/pandas-docs/...
    seaborn installation instructions: seaborn.pydata.org/installing....
    Longer linear regression notebook: github.com/justmarkham/DAT5/b...
    Chapter 3 of Introduction to Statistical Learning: www-bcf.usc.edu/~gareth/ISL/
    Videos related to Chapter 3: www.dataschool.io/15-hours-of...
    Quick reference guide to linear regression: www.dataschool.io/applying-an...
    Introduction to linear regression: people.duke.edu/~rnau/regintro...
    pandas Q&A video series: www.dataschool.io/easier-data...
    pandas 3-part tutorial: www.gregreda.com/2013/10/26/in...
    pandas read_csv documentation: pandas.pydata.org/pandas-docs/...
    pandas read_table documentation: pandas.pydata.org/pandas-docs/...
    seaborn tutorial: seaborn.pydata.org/tutorial.html
    seaborn example gallery: seaborn.pydata.org/examples/in...
    WANT TO GET BETTER AT MACHINE LEARNING? HERE ARE YOUR NEXT STEPS:
    1) WATCH my scikit-learn video series:
    • Machine learning in Py...
    2) SUBSCRIBE for more videos:
    th-cam.com/users/dataschool?su...
    3) JOIN "Data School Insiders" to access bonus content:
    / dataschool
    4) ENROLL in my Machine Learning course:
    www.dataschool.io/learn/
    5) LET'S CONNECT!
    - Newsletter: www.dataschool.io/subscribe/
    - Twitter: / justmarkham
    - Facebook: / datascienceschool
    - LinkedIn: / justmarkham

ความคิดเห็น • 574

  • @dataschool
    @dataschool  3 ปีที่แล้ว +9

    Having problems with the code? I just finished updating the notebooks to use *scikit-learn 0.23* and *Python 3.9* 🎉! You can download the updated notebooks here: github.com/justmarkham/scikit-learn-videos

    • @aryanterrance6092
      @aryanterrance6092 2 ปีที่แล้ว

      I know im randomly asking but does any of you know of a method to log back into an instagram account?
      I was stupid forgot the account password. I appreciate any help you can give me!

    • @stetsondavian5756
      @stetsondavian5756 2 ปีที่แล้ว

      @Aryan Terrance instablaster ;)

    • @aryanterrance6092
      @aryanterrance6092 2 ปีที่แล้ว

      @Stetson Davian thanks for your reply. I found the site through google and im trying it out now.
      I see it takes quite some time so I will get back to you later when my account password hopefully is recovered.

    • @aryanterrance6092
      @aryanterrance6092 2 ปีที่แล้ว

      @Stetson Davian It did the trick and I now got access to my account again. I'm so happy:D
      Thank you so much you really help me out :D

    • @stetsondavian5756
      @stetsondavian5756 2 ปีที่แล้ว

      @Aryan Terrance no problem :)

  • @dataschool
    @dataschool  8 ปีที่แล้ว +22

    Want to learn more pandas? I have a new video series about it: th-cam.com/play/PL5-da3qGB5ICCsgW1MxlZ0Hq8LL5U3u9y.html

  • @Emmaizam
    @Emmaizam 5 ปีที่แล้ว +22

    This is the best ML tutorials I have ever seen! Thank you very much Sir.

  • @prachinainawa3055
    @prachinainawa3055 3 ปีที่แล้ว +6

    I'm a beginner but your way of teaching makes me love machine learning, I feel it's so easy. Even you make me understand how the algo is working behind the scene. Love from India...

    • @dataschool
      @dataschool  3 ปีที่แล้ว +1

      That's awesome to hear! 😊

  • @TheBurningofSolomon
    @TheBurningofSolomon 7 ปีที่แล้ว +17

    MANY THANKS!!!
    All other data science tutorials (for beginners) go by way to quickly. Some people may find you going slowly a nuisance, but I found it to be EXTREMELY HELPFUL. THANK YOU! Subbed ^__^

    • @dataschool
      @dataschool  7 ปีที่แล้ว +4

      Awesome! That's so great to hear... thanks very much for your comment!

    • @XaccountFr
      @XaccountFr 4 ปีที่แล้ว +2

      @@dataschool yes very good explanation for the beginner like me

  • @pratikdhumal3975
    @pratikdhumal3975 7 ปีที่แล้ว +11

    I was searching for appropriate videos on ML from long time. After following this series i can say that it is the best which i have ever seen.Each and every concept is covered with great detail. Same applies for study material and links. Thanks Data School .....!!!!

    • @dataschool
      @dataschool  7 ปีที่แล้ว +1

      That is great to hear, thanks so much for your very kind words!!

  • @LekanMakanju
    @LekanMakanju ปีที่แล้ว +2

    This is unreal! I literally abandoned my datacamp machine learning course for this one and no regret at all. I especially like that you taught the underlying mathematical concept of how these codes come to be. You also speak clear and understandable English plus the sound system is top notch. I've taken your Data science course and your and prof Allen's remains my best to date with Hugo's coming in a distant 3rd. And to think you recorded this more than 7 years ago makes you conclude that this is way ahead of its time

    • @dataschool
      @dataschool  ปีที่แล้ว

      Thank you so much for your kind words, Moruf! 🙏

  • @aegystierone8505
    @aegystierone8505 4 ปีที่แล้ว

    Really appreciate that you also explain the algorithms and how to find the coefficient governing the equations. Thank you so much!

  • @lakswin
    @lakswin 4 ปีที่แล้ว +5

    Kinda complete one, putting together all at-once! The best, I have watched until now!

  • @dataschool
    @dataschool  6 ปีที่แล้ว +20

    *Note:* This video was recorded using Python 2.7 and scikit-learn 0.16. Recently, I updated the code to use Python 3.6 and scikit-learn 0.19.1. You can download the updated code here: github.com/justmarkham/scikit-learn-videos

    • @rael213rd
      @rael213rd 5 ปีที่แล้ว +2

      Can we please get a video about ensemble learning (bagging and boosting)

  • @andrewsanchez4349
    @andrewsanchez4349 7 ปีที่แล้ว

    Definitely one of the best tutorials I've ever watched. Can't wait to work through the 3 hour presentation at the end of this. Thank you!

    • @dataschool
      @dataschool  7 ปีที่แล้ว

      Thanks so much for your very nice comment! You're very welcome! :)

  • @tissues2441
    @tissues2441 6 ปีที่แล้ว +1

    You're a way better instructor than my college professors. The syntax is fairly simple and the explanation of the statistical intuition behind the metrics made this enjoyable.

    • @dataschool
      @dataschool  6 ปีที่แล้ว

      Thanks very much for your kind words! Really appreciate it!

  • @AashishKumar1
    @AashishKumar1 8 ปีที่แล้ว

    This is the best video tutorial series on Machine learning I have seen. You have hooked me up! Thanks for creating the series and you are an amazing teacher. Keep it up!

    • @dataschool
      @dataschool  8 ปีที่แล้ว

      +Aashish Kumar You're very welcome, and thanks for your kind words!

  • @Superdooperhero
    @Superdooperhero 7 ปีที่แล้ว +3

    I watch way too much training videos and I would like to say that I wish you were the presenter in all of them. You rule at this training thing!

    • @dataschool
      @dataschool  7 ปีที่แล้ว

      Thanks so much! :)

  • @DenzilJoseph
    @DenzilJoseph 6 ปีที่แล้ว

    Excellent description of the end-to-end ML flow. Thank you.

  • @mukulkathpalia6924
    @mukulkathpalia6924 7 ปีที่แล้ว +2

    These are the best tutorial series on machine learning.

    • @dataschool
      @dataschool  7 ปีที่แล้ว

      Wow, thank you so much!

  • @faroukobafemi9496
    @faroukobafemi9496 4 ปีที่แล้ว +1

    To be candid, this is the best video I've ever watched on scikit-learn. Thumbs up!!!

    • @dataschool
      @dataschool  4 ปีที่แล้ว

      That's awesome to hear... thank you! 🙏

  • @guptaachin
    @guptaachin 7 ปีที่แล้ว

    You are undeniably the best tutor i have ever had. Thank you for teaching DS precisely. :)

    • @dataschool
      @dataschool  7 ปีที่แล้ว

      Wow, thank you! I'm glad my teaching style works well for you :)

  • @terryhenyo9216
    @terryhenyo9216 4 ปีที่แล้ว

    Your video tutorial is outstanding! You can simplify complex concepts in an elegant manner. And unlike other instructors you don't show-off on how smart you are. That's why we know that you're really a smart guy :)

    • @dataschool
      @dataschool  4 ปีที่แล้ว

      Thank you SO MUCH for this kind comment! I truly appreciate it.

  • @vamsikrishna1131
    @vamsikrishna1131 5 ปีที่แล้ว +1

    Lots of great information at the end and links in the description. Very valuable. Really appreciate it!

  • @injypal
    @injypal 5 ปีที่แล้ว +1

    Please add more videos to the series. It is really helpful and amazing to watch your videos. You are a great teacher.

    • @dataschool
      @dataschool  5 ปีที่แล้ว

      Thanks for your suggestion, and for your kind words!

  • @kennyl7542
    @kennyl7542 8 ปีที่แล้ว +16

    wonderful videos! I would like you to focus on scikit-learn, and your style of teaching which combines hands-on with scikit-learnt, real examples, explanation of ML techniques are very helpful!

    • @dataschool
      @dataschool  8 ปีที่แล้ว

      +Kenny L Thanks for your kind comments and your feedback!

    • @nackyding
      @nackyding 7 ปีที่แล้ว +1

      Word! I agree with you!

    • @joancolon635
      @joancolon635 6 ปีที่แล้ว

      Kenny L i

  • @lubojurciak2525
    @lubojurciak2525 5 ปีที่แล้ว +1

    I wish you were my data analysis lecturer... Thank you very much for this.

    • @dataschool
      @dataschool  5 ปีที่แล้ว

      Thanks very much for your kind words!

  • @danielandreasen2293
    @danielandreasen2293 9 ปีที่แล้ว +1

    As for an answer for your question: I would like to learn more about sklearn. Pandas is amazing, and I'm just starting to learn it, but there are already a lot of nice tutourials out there. Keep up the good job :)

    • @dataschool
      @dataschool  9 ปีที่แล้ว

      Daniel Andreasen Good point! There are lots of Pandas tutorials already out there.

  • @JackSimpsonJBS
    @JackSimpsonJBS 9 ปีที่แล้ว +3

    Thank-you so much for your explanations of sk-learn, it finally makes sense to me! I'm already pretty familiar with Pandas so I'd love to learn more about sk-learn, because I feel there are so many other machine learning algorithms I'd love to get my head around.

    • @dataschool
      @dataschool  9 ปีที่แล้ว

      ***** Nice! I love to hear that my explanations are helping things to "click" for people. Thanks for your comment!

  • @doupanpan7271
    @doupanpan7271 6 ปีที่แล้ว

    really thankful for your video series. it is straightforward and easy to understand, highly recommend to other guys who are interested in python, machine learning etc.

    • @dataschool
      @dataschool  6 ปีที่แล้ว

      Awesome! Thanks for sharing it with others :)

  • @BluntAmericanHistory
    @BluntAmericanHistory 8 ปีที่แล้ว +1

    Your videos are fantastic, for people with random gaps in their knowledge you explain things very clearly.

    • @BluntAmericanHistory
      @BluntAmericanHistory 8 ปีที่แล้ว

      +Siddharth Gupta For people who have random chunks of exposure to certain aspects of sklearn/pandas/etc: watch the video at 1.25 or 1.5x speed. You can get through the lesson faster, and the increased speed will actually have a counterintuitive effect of making you focus more. Also when you start losing focus or miss a concept, you will notice right away because you will suddenly be totally lost, so you will know to rewind.

    • @dataschool
      @dataschool  8 ปีที่แล้ว

      +Siddharth Gupta Thanks for your kind comments!

  • @priyaponnus8620
    @priyaponnus8620 2 ปีที่แล้ว

    Thank you for the awesome videos. I am currently learning Machine Learning as part of a course. I don't have previous knowledge of Python (currently learning an introduction to Python as well), I am really struggling to understand; this is my midterm break; I found one of your videos while I was searching, I am one of the fortunate to found your videos. Thanks for your effort.

    • @dataschool
      @dataschool  2 ปีที่แล้ว

      You're very welcome! Glad I could help!

  • @dianawilliams9470
    @dianawilliams9470 5 ปีที่แล้ว

    Thank you! Your videos are helping to make the concepts click! This is the best resource I have found

    • @dataschool
      @dataschool  5 ปีที่แล้ว

      You're very welcome!

  • @umashankarverma3179
    @umashankarverma3179 5 ปีที่แล้ว

    Your teaching methodology is best,you step by step teaching method is very helpful for me to understand.You are the best.

  • @sribastavrajguru304
    @sribastavrajguru304 7 ปีที่แล้ว

    Great work,please upload more tutorials lyk these,really helpful to get started.
    Before watching this tutorial i was not at al aware of ML,but now after watching 4/5 videos i've got a good overview ,thank you

    • @dataschool
      @dataschool  7 ปีที่แล้ว

      Great to hear! Thanks for your kind comment.

  • @julians.2597
    @julians.2597 5 ปีที่แล้ว

    Wow, one of the best YT tutorials about this topic, thank you!

  • @gauravmitra3683
    @gauravmitra3683 8 ปีที่แล้ว +1

    This is one of the best available online resource for introduction to data science. Thank you for these amazing videos. Its teachers like you who inspire students like me :)

    • @dataschool
      @dataschool  8 ปีที่แล้ว

      Wow, what a kind comment! Thank you so much!

    • @arjunbakshi810
      @arjunbakshi810 4 ปีที่แล้ว

      Gaurav, Im having trouble reading advertisemets.csv
      Can you help ma?

  • @AntonioAugustoVianaS
    @AntonioAugustoVianaS 9 ปีที่แล้ว

    More pandas please! And more Seaborn!
    A large part of Machine Learning is "messing" with the data BEFORE you apply any of the algorithms on it, and pd and sns are really good at that.
    Also, I think it'd be interesting (maybe latter in the series) that you could go on an all out example, like working with the titanic dataset from Kaggle, and giving hints on how to visualize, understand the data and choose the best algorithm for it.
    As a final note, I'm already a bit familiar with the techniques you use, but your comments and clear explanations makes everything clearer and helps me fixate some of these techniques.
    Thank you for that! Excellent series, and keep on the good work.

    • @dataschool
      @dataschool  9 ปีที่แล้ว

      Antonio Augusto Santos Thanks for the feedback! I am planning to cover more examples later in the series, probably using a Kaggle competition. And, I appreciate your kind words! I was hoping to reach both users new to machine learning and those with some machine learning familiarity, so it's nice to hear that it's working :)

  • @WillGoesMeta
    @WillGoesMeta 7 ปีที่แล้ว

    Thank you so much for having this series!

  • @JoannaChmielewska_uk
    @JoannaChmielewska_uk 8 ปีที่แล้ว

    Thank you for making the effort to produce these videos. It's a great resource and your delivery is superb.

    • @dataschool
      @dataschool  8 ปีที่แล้ว

      Wow, what a kind compliment, thank you so much!

  • @igorfigueredo5040
    @igorfigueredo5040 6 ปีที่แล้ว

    Hi, im a begginer in data science and your videos are helping me a lot of, thanks.

  • @MrChristian331
    @MrChristian331 5 ปีที่แล้ว

    Say one thing....you are an excellent teacher. My teachers at engineering school and on Udemy don't explain things half as well as you do! That should tell you a lot!
    I wish I could hire you personally.

    • @dataschool
      @dataschool  5 ปีที่แล้ว

      Thanks so very much for your kind words! You might be interested in joining my membership community: www.patreon.com/dataschool

  • @The2002962
    @The2002962 7 ปีที่แล้ว

    Tutorial content is pretty cool. adding humor while explaining will add good experience for learners. :)

  • @flamboyantperson5936
    @flamboyantperson5936 6 ปีที่แล้ว

    You are the best teacher in the world. I learned something very important to me in this video. Thank you so much. Please keep the good work going.

    • @dataschool
      @dataschool  6 ปีที่แล้ว

      Wow! Thank you so much for the very kind comment! Good luck to you :)

  • @serdarb8995
    @serdarb8995 6 ปีที่แล้ว +1

    Hi Kevin, First of all thank you very much for those great videos. If you have a chance to make tutorial regarding deep learning it would be great. You are the best instructor, I've ever seen in this field. You are the best

    • @dataschool
      @dataschool  6 ปีที่แล้ว

      Thanks so much for your kind words, and for your suggestion!

  • @robindong3802
    @robindong3802 6 ปีที่แล้ว +1

    you made it so easy to learn. you lead me to ML right here. Thank you so much.

    • @dataschool
      @dataschool  6 ปีที่แล้ว

      You're very welcome!

  • @sofjakovalevskaya1446
    @sofjakovalevskaya1446 5 ปีที่แล้ว +1

    Really perfect explanation and walk through. Thanks a lot!

    • @dataschool
      @dataschool  5 ปีที่แล้ว

      You're very welcome!

  • @your_buddy_11
    @your_buddy_11 4 ปีที่แล้ว +1

    Thank you very much
    Your teaching methodology is awesome making things crystal clear.

  • @_SoundByte_
    @_SoundByte_ 6 ปีที่แล้ว

    Thanks for your lessons :-)
    Clear, detailed and to the point.

    • @dataschool
      @dataschool  6 ปีที่แล้ว

      Thanks for your kind comments!

  • @fritz0199
    @fritz0199 7 ปีที่แล้ว +1

    This series is amazing, thank you!

    • @dataschool
      @dataschool  7 ปีที่แล้ว

      You're welcome! Thanks for your kind words!

  • @AvivProg
    @AvivProg 8 ปีที่แล้ว

    Watched all your videos. Your teaching skills are amazing, thank you for compiling those videos.
    I'm looking forward to your next videos about machine learning using sklearn.

    • @dataschool
      @dataschool  8 ปีที่แล้ว

      +AvivProg Wow, thank you! You are very welcome -- I enjoyed creating the videos.
      Here is the playlist containing the entire video series: th-cam.com/play/PL5-da3qGB5ICeMbQuqbbCOQWcS6OYBr5A.html

  • @Huhyht
    @Huhyht 7 ปีที่แล้ว

    Very good content. I have tried so many video series for data science and this is by far the best! Thanks!

    • @dataschool
      @dataschool  7 ปีที่แล้ว

      That's great to hear - thanks so much for your kind comment!

  • @genaugenaugenau
    @genaugenaugenau 7 ปีที่แล้ว

    This guy is great at teaching. Much appreciated!

    • @dataschool
      @dataschool  7 ปีที่แล้ว

      Thanks for your kind comment!

  • @samkumargupta2536
    @samkumargupta2536 6 ปีที่แล้ว +1

    Really Awesome tutorials sir...
    Its very easy to understand...Better that other ML tutorials I have watched...☺☺☺

    • @dataschool
      @dataschool  6 ปีที่แล้ว

      Thanks for your kind comment!

  • @edrissemussa371
    @edrissemussa371 4 ปีที่แล้ว +1

    Thanks a lot for this great material you've put together. Very very helpful!

  • @suemareverton7756
    @suemareverton7756 8 ปีที่แล้ว +1

    These videos helped me a lot! Thank you so much!!

    • @dataschool
      @dataschool  8 ปีที่แล้ว

      Great, I'm glad the series is helpful to you!

  • @Dexter01
    @Dexter01 4 ปีที่แล้ว

    I am answering your question 5 years later but I would love to see more video tutorials from you about scikit-learn (e.g Neural network models (supervised)) or
    scikit-multilearn if you want!! :) Thnx a lot Kevin!

    • @dataschool
      @dataschool  4 ปีที่แล้ว

      Thanks for your suggestions!

  • @eturkoz
    @eturkoz 5 ปีที่แล้ว +1

    Your explanations are wonderful. Thank you.

  • @libardomm.trasimaco
    @libardomm.trasimaco 6 ปีที่แล้ว

    I absolutely love what you do!. Thank you very very much!

    • @dataschool
      @dataschool  6 ปีที่แล้ว +1

      You are very very welcome!

  • @unstatic_electronics
    @unstatic_electronics 9 ปีที่แล้ว

    Excellent and straight to the point content again. Thanks a lot for the videos and also the additional references you provide. It's always good to know where to go next :)
    And please continue on with scikit-learn rather than pandas/seaborn.

    • @dataschool
      @dataschool  9 ปีที่แล้ว

      Romain Lepert Thanks for the feedback! :)

  • @reassassinator
    @reassassinator 6 ปีที่แล้ว

    Your videos really helped me understand the sklearn basics easily. It would be great if you could do a similar video series on SVMs using scikit-learn and its applications. Your explanations and methods are great!
    Thanks a lot!

    • @dataschool
      @dataschool  6 ปีที่แล้ว

      Thanks for your suggestion as well as your kind words! I appreciate it :)

  • @sharlene1176
    @sharlene1176 8 ปีที่แล้ว

    Thank you very much for this video series!!! This is really helpful!

    • @dataschool
      @dataschool  8 ปีที่แล้ว

      +shalin LUO You're very welcome!

  • @datadriven659
    @datadriven659 4 ปีที่แล้ว

    thank you! very clear and helpful

  • @RicardoFerrazLeal
    @RicardoFerrazLeal 9 ปีที่แล้ว

    Pretty amazing video!
    +1 for sk-learn as next video in this series. I also think that plotting stuff helps a lot. Whenever possible it would be nice to show seaborn in action.
    Great job and looking forward to the next one.

    • @dataschool
      @dataschool  9 ปีที่แล้ว

      Ricardo Ferraz Leal Thanks for the feedback!

  • @bogdanjcnd
    @bogdanjcnd 7 ปีที่แล้ว

    I totally agree, the excellent guide for data learning , visualisation and machine learning.Great work

    • @dataschool
      @dataschool  7 ปีที่แล้ว

      Thanks for your kind comment!

  • @MrMmahesh007
    @MrMmahesh007 7 ปีที่แล้ว

    amazing videos. Very streamlined and easy to understand.

  • @raghug2073
    @raghug2073 6 ปีที่แล้ว

    Very very great way teaching. I really liked the speed and pronounce you do, the possible mistakes which you cover, also explanation. This is great series and you are a great tutor. Fan of you and subscribed. Please make a separate series on Machine Learning (Bit more detailed), Deeplearning, AI, Data Science. I am not sure which one should be learnt first and how. I decided you are the best guru for me to make me some good level in all these skills. Please help.

    • @dataschool
      @dataschool  6 ปีที่แล้ว

      Thanks for your suggestions! I'll consider them for the future :)

  • @yffzju3405
    @yffzju3405 7 ปีที่แล้ว +1

    Cool video!I just finish your pandas video series, but I thought pandas should be learned before the sklearn, well, anyway thank you for making such great videos for us.

    • @dataschool
      @dataschool  7 ปีที่แล้ว

      Great! I also have a scikit-learn video series: th-cam.com/play/PL5-da3qGB5ICeMbQuqbbCOQWcS6OYBr5A.html

  • @shivbalaji8286
    @shivbalaji8286 6 ปีที่แล้ว +2

    You are doing a great job !!!!!! Thank you very much for all your valuable videos !!! They are really helping me !!!! Thanks again :-)

    • @dataschool
      @dataschool  6 ปีที่แล้ว

      That's great to hear! I'm glad the videos are helpful to you!

  • @JCRMatos
    @JCRMatos 9 ปีที่แล้ว

    Another excellent video.
    Please continue to focus on ML and scikit-learn.

    • @dataschool
      @dataschool  9 ปีที่แล้ว

      João Matos Thanks for your feedback, much appreciated!

  • @FarizDarari
    @FarizDarari 5 ปีที่แล้ว

    very clear articulation, thanks

  • @elivazquez7582
    @elivazquez7582 6 ปีที่แล้ว

    Great videos - all of them! Thanks for doing this.

    • @dataschool
      @dataschool  6 ปีที่แล้ว

      Thanks for your kind comment!

  • @nackyding
    @nackyding 7 ปีที่แล้ว

    Thanks. Awesome tutorials. I'm learning a lot. Thank you again.

    • @dataschool
      @dataschool  7 ปีที่แล้ว

      You're very welcome!

  • @siming07
    @siming07 8 ปีที่แล้ว

    Thank you so much for the video, really great introduction to Pandas and SKlearn, I hope you can focus more on the sklearn with pandas dataframe, again, thanks for the great video!

    • @dataschool
      @dataschool  8 ปีที่แล้ว

      +Siming Zhao You're very welcome, and thanks for your comment!

  • @sebastianpinedaarango8239
    @sebastianpinedaarango8239 9 ปีที่แล้ว

    Great video!! Thanks for that. I'd like to keep learning about Scikit-learn. Although, Pandas is also definitely a powerful Python data analysis toolkit.

    • @dataschool
      @dataschool  9 ปีที่แล้ว

      Sebastian Pineda Arango Glad you liked it! Thanks for the feedback.

  • @aracelyssunico8116
    @aracelyssunico8116 6 ปีที่แล้ว

    Super Helpful! Your explanation are clear and clean :) thanks

    • @dataschool
      @dataschool  6 ปีที่แล้ว

      You're very welcome!

  • @abdelrhmanshokr7546
    @abdelrhmanshokr7546 4 ปีที่แล้ว

    dude you're one of the best

  • @21121990jay
    @21121990jay 7 ปีที่แล้ว

    Very helpful video !!! thanks for sharing your knowledge.
    looking forward for more !!

    • @dataschool
      @dataschool  7 ปีที่แล้ว

      You're very welcome! Glad to hear it was helpful to you!

  • @harshrajj9995
    @harshrajj9995 5 ปีที่แล้ว +1

    Such great content you provide sir! Thank you so much.

    • @dataschool
      @dataschool  5 ปีที่แล้ว

      You're very welcome!

  • @danielkazmi
    @danielkazmi 5 ปีที่แล้ว +1

    Absolutely amazing material, thank you Kevin!
    I just wanted to know how would you deal with non-numerical features (i.e Gender, Occupation, Education, etc.) when constructing your ML model? Would you assign them numerical values? If possible, I'd like some guidance or a push in the right direction.
    Again you explain this material much better than most channels do, please keep up the phenomenal work!

    • @dataschool
      @dataschool  5 ปีที่แล้ว +1

      Thanks very much for your kind words!
      This might be helpful to you: th-cam.com/video/0s_1IsROgDc/w-d-xo.html

  • @gopalgoel2289
    @gopalgoel2289 7 ปีที่แล้ว

    Thank you sooo much these are the best tutorial series :)

    • @dataschool
      @dataschool  7 ปีที่แล้ว

      Thanks for your kind comment!

  • @philippe6724
    @philippe6724 4 ปีที่แล้ว

    Impressive teacher!

  • @Tony770jr
    @Tony770jr 9 ปีที่แล้ว

    Cool stuff, would like to see more pandas integrated with scikit learn.

    • @dataschool
      @dataschool  9 ปีที่แล้ว

      Tony770jr Thanks for the suggestion!

  • @musabosman2843
    @musabosman2843 2 ปีที่แล้ว +1

    Nicely presented and delivered. Thank you!. I have subscribed to your channel!

  • @transportation-talk
    @transportation-talk 9 ปีที่แล้ว +4

    Great video once again. I think the focus of this series should be on ML and Scikit learn. You can explain the relevant pandas code wherever required as you did in this video.
    One question: Is there any algorithm in ML which can select the most relevant / explanatory predictor variables (features) from the data set (instead of user using trial and error approach)? I think this is critical for the data sets with high number of features

    • @dataschool
      @dataschool  9 ปีที่แล้ว +1

      umair durrani Great question! There is no "silver bullet" for feature selection, meaning no single strategy that will always tell you which variables to keep in your model. Domain understanding, data exploration, and human intuition are key.
      That being said, the Random Forests model will give you a measure of "variable importance" (on a scale of 0 to 1), and you could use that to guide the selection. As well, regularized linear models will shrink coefficients down to zero as the "penalty term" increases, effectively performing feature selection. Just keep in mind that both need to be tuned to perform properly, and features need to be scaled when performing regularization. scikit-learn has some more guidance on feature selection here: scikit-learn.org/stable/modules/feature_selection.html
      Thanks again for your kind and helpful comments!

    • @darronfuller5297
      @darronfuller5297 9 ปีที่แล้ว +1

      umair durrani Umair, there are several useful techniques for feature selection that I recommend you look into. Statistical methods such as forward- and backward-elimination are perfectly suited for determining the most predictive variables in a regression model and easy to understand and implement. Decision Trees inherently perform feature selection in that the variable splits are deemed significant and automatically chosen by the algorithm. A bit more on the complex side are Principle Component Analysis (PCA) and Association Rules which I believe PCA is in sci-kit-learn. Good luck! Darron. www.linkedin.com/in/votefordata

    • @sabr9906
      @sabr9906 8 ปีที่แล้ว

      +Data School Could you please advise in another course more about Feature Selection? Which models are more suitable for several cases etc. Like for example, sorting features' scores from RandomizedLasso, or by ranking from RecursiveFeatureElimination, or by selecting K best?

    • @dataschool
      @dataschool  8 ปีที่แล้ว

      +Sabr Tasbolatov Thanks for the suggestion! I'll consider it for the future.

    • @dataschool
      @dataschool  5 ปีที่แล้ว

      I just released a video about feature selection which might be helpful to you! th-cam.com/video/YaKMeAlHgqQ/w-d-xo.html

  • @ameer9900
    @ameer9900 7 ปีที่แล้ว

    wonderful videos for machine learning beginners.

    • @dataschool
      @dataschool  7 ปีที่แล้ว

      Thanks! Glad it was helpful to you.

  • @loaiabdallatif4947
    @loaiabdallatif4947 7 ปีที่แล้ว

    very useful video on liner regression
    thanks very much Mr.
    Kevin Markham

    • @dataschool
      @dataschool  7 ปีที่แล้ว

      You're very welcome! :)

  • @ebenezerpopoola7860
    @ebenezerpopoola7860 7 ปีที่แล้ว

    Wow! this is very clear. You are the best.

    • @dataschool
      @dataschool  7 ปีที่แล้ว

      Thanks very much for your kind comment!

  • @saranemohan
    @saranemohan 7 ปีที่แล้ว

    It's wonderful tutorial ever I seen regarding machine learning. I expect more videos related to machine learning. if you made some video regarding some optimization technique of linear regression, then it should be more beneficial. ( like bfgs etc )

    • @dataschool
      @dataschool  7 ปีที่แล้ว

      Thanks so much for your kind words! I'll take your suggestion under consideration.

  • @replyanand9
    @replyanand9 6 ปีที่แล้ว

    Thanks a lot for excellent Videos.

  • @harveysummers3175
    @harveysummers3175 9 ปีที่แล้ว

    These videos are outstanding. Am new to data science and many of the videos are too simple or too hard. You have found the goldilocks zone of data science. I also like that they are on youtube where I can speed them up to 1.5x to match my comprehension rate.Vimeo can't do that.
    I would like you to focus on Scikit, but use Pandas as most of use will be using both. I think a single lesson on how to use Pandas, as well as how to customize Ipython/Jupyter, would also be useful. I'd also like to see a video focused on data sources and on how to approach complex problems (ala kaggle challenges)
    Improvement suggestions: 1. Focus on technnical quality. Use basic stage lighting (difussed above, side, front, w/ reflector) and a condensor mic to better pic up your voice w/o echo. 2) put a whiteboard or suchsimple background behind you - way to much background clutter.
    And I think you are missing an opportunity to end with marketing your courses at data school, your book, etc.Not that I love ads, but... marketing!

    • @dataschool
      @dataschool  9 ปีที่แล้ว

      Harvey Summers Thanks for all of the suggestions, and your kind comments! Very helpful. Building up to more complex problems is definitely on the list. And, it's nice to know that I'm hitting the "sweet spot" in terms of difficulty level.

  • @harininedunuri5848
    @harininedunuri5848 6 ปีที่แล้ว

    love your videos!!!they helped me a lot.

    • @dataschool
      @dataschool  6 ปีที่แล้ว

      Thanks! Great to hear!

  • @manujdhariwal
    @manujdhariwal 8 ปีที่แล้ว

    does anyone know the Shift+Tab help functionality equivalent, when using python in Sublime?

  • @shawnz9833
    @shawnz9833 7 ปีที่แล้ว

    Very Very nice explanation. Thank you Kevin

    • @dataschool
      @dataschool  7 ปีที่แล้ว +1

      You're welcome!

  • @magdalenaobrzut8393
    @magdalenaobrzut8393 5 ปีที่แล้ว

    Thanks so much, you have helped me lots. Keep up your good work.

  • @itsmillerhere
    @itsmillerhere 9 ปีที่แล้ว

    Great video series so far! I have learned tons. I've been trying this out with my own datasets to learn, but I was wondering if there is a good way to plot the coefficient values found in the LinearRegression object after fitting data? I was thinking of maybe a bar graph but not sure. Thanks!

    • @dataschool
      @dataschool  9 ปีที่แล้ว

      killermiller1011 Glad it has been helpful to you! Regarding your question, I'm not sure the value of plotting the coefficients. The interpretation of the coefficients varies depending on the scale of the features, so a plot alone isn't useful for summarization unless it provides additional information (or all features are on the same scale). If the features are all on the same scale, a bar chart might be useful. Hope that helps!

  • @mirzaburgic
    @mirzaburgic 2 ปีที่แล้ว

    Great content, you have an inspiring way of presenting, keep it up!
    I have one question though, why is the TV coefficient smaller than the Radio coefficient, even though from the plots and best fit line it looks like the sales go up faster with more TV ad spending?

  • @MrBigmit37
    @MrBigmit37 7 ปีที่แล้ว

    Excellent tutorial.

  • @adsnehi
    @adsnehi 6 ปีที่แล้ว

    Great tutorial sir!

  • @BadriNathJK
    @BadriNathJK 8 ปีที่แล้ว +5

    You are the best.

    • @dataschool
      @dataschool  8 ปีที่แล้ว

      Thank you, I'm glad this content is useful to you!

  • @HossainRabin
    @HossainRabin 6 ปีที่แล้ว

    Fantastic tutorial series for PYTHON beginners ...Can you please start teaching us deep learning and neural network?
    I learn PANDAS, Numpy from your tutorial..
    Thanks a lot man

    • @dataschool
      @dataschool  6 ปีที่แล้ว

      Thanks for your suggestion!

  • @tareknahool
    @tareknahool 6 ปีที่แล้ว

    you are doing well, thanks so much

    • @dataschool
      @dataschool  6 ปีที่แล้ว

      You are very welcome! :)

  • @user-bp6vn2ng8m
    @user-bp6vn2ng8m 6 ปีที่แล้ว

    Thank you so much for your interesting videos

    • @dataschool
      @dataschool  6 ปีที่แล้ว

      You're very welcome!

  • @techcloud9025
    @techcloud9025 7 ปีที่แล้ว

    Thank u . These videos are great

    • @dataschool
      @dataschool  7 ปีที่แล้ว

      You're very welcome!