Scikit-learn Crash Course - Machine Learning Library for Python

แชร์
ฝัง
  • เผยแพร่เมื่อ 21 ส.ค. 2024

ความคิดเห็น • 220

  • @freecodecamp
    @freecodecamp  3 ปีที่แล้ว +83

    Message from the creator:
    I hope you've all enjoyed this series of videos. It was fun to collaborate with freeCodeCamp!
    If you're interested in more content from me feel free to check out calmcode. Also, I'd like to give a shoutout to my employer, Rasa! We're using scikit-learn (and a whole bunch of other tools) to build open-source chatbot technology for python. If that sounds interesting, definitely check out rasa.com/docs/rasa/.

    • @jadkylan7774
      @jadkylan7774 3 ปีที่แล้ว +2

      i guess I'm kinda randomly asking but do anybody know of a good place to watch newly released tv shows online ?

    • @ariesulises1611
      @ariesulises1611 3 ปีที่แล้ว

      @Jad Kylan Try flixzone. Just search on google for it =)

    • @brodyodin141
      @brodyodin141 3 ปีที่แล้ว

      @Aries Ulises definitely, I've been using flixzone for months myself =)

    • @jadkylan7774
      @jadkylan7774 3 ปีที่แล้ว

      @Aries Ulises thanks, I went there and it seems like a nice service :) I really appreciate it!!

    • @ariesulises1611
      @ariesulises1611 3 ปีที่แล้ว

      @Jad Kylan happy to help =)

  • @buraksenel263
    @buraksenel263 2 ปีที่แล้ว +74

    This is by far the most beginner friendly introduction to sk-learn I've seen

  • @riccello
    @riccello 3 ปีที่แล้ว +57

    This is the way everything should be taught!
    I love that you present concepts in a structured and systematic way, speaking slowly and clearly, using as few words as possible...
    - starting with the concept and talking through drawing a logical diagram (which is so important for developing abstract thinking in terms of high level concepts, which is how we think when we are experienced in something).
    - then writing clean, concise code to implement each part of the concept
    - showing plots that directly demonstrate the effects of the entire iteration
    Too many tutorials make the mistake of talking too much. A lot of videos also either assume too much or too little about the viewer's knowledge.
    This seems to confidently stike the nail on the head!
    Thanks!

    • @manuelcervantes1939
      @manuelcervantes1939 2 ปีที่แล้ว

      Amazing review!

    • @abdulwahab182
      @abdulwahab182 2 ปีที่แล้ว

      Exactly 👍

    • @abdullahshahzad333
      @abdullahshahzad333 ปีที่แล้ว

      Are you serious???
      Instructor didn't even show the dataset. How would anyone understand whats going on like this?

  • @flashbao1922
    @flashbao1922 3 ปีที่แล้ว +16

    This video saved me from a 5K course! Thanks! Loads of Love!

  • @dariuszspiewak5624
    @dariuszspiewak5624 2 ปีที่แล้ว +22

    I must agree with others: this is a great lecture. I mean... REALLY good. Vincent, do you have any more of these? This stuff is not only informative, but also pleasant to watch and listen to. Good, correct, and clear English is rather rare these days. Sadly. This lecture is good because it does not shy away from details. It also goes beyond just showing the API. It tries to build something new from the available "Lego" pieces. Which is great as it shows creativity and also how to dig deeper to understand the data. Very, very good exposition. Many thanks.

    • @tyronefrielinghaus3467
      @tyronefrielinghaus3467 9 หลายเดือนก่อน

      I feel you about clear and well enunciated English. I HATE having to 'interpret' what I'm hearing....too much extraneous Cognitive Load for an already high Intrinsic Load topic.

  • @gabriel1991
    @gabriel1991 3 ปีที่แล้ว +29

    OMG! I love all the contente that Vincent makes! I must watch this video!

    • @universal4334
      @universal4334 3 ปีที่แล้ว +4

      Send me a link to his channel

  • @ThomasKuncewicz
    @ThomasKuncewicz ปีที่แล้ว +5

    The way each dataset complements the associated pitfall you want to bring up at a given moment... wow. What an amazing intro -- it must have taken a lot of forethought and behind the scenes organization to make the flow of this video series seem so effortless. THANK YOU!!

    • @nonSensCave
      @nonSensCave 10 หลายเดือนก่อน

      please bro can you tell me where to find appending for the plot answer ?

  • @imdadood5705
    @imdadood5705 3 ปีที่แล้ว +4

    Just completed the first part of the lecture. I have been using scikit for a couple of months! Dudeee! This is an eye opener!

  • @lVaNeSsA90
    @lVaNeSsA90 3 ปีที่แล้ว +13

    Wow - I need to share this with the rest of the class! Thanks for making this video so understandable.

  • @rajveersinghanand
    @rajveersinghanand 3 ปีที่แล้ว +17

    16:00 pipe
    23:45 grid search
    37:00 standard scaler
    42:00 quantiles better
    46:55
    55:00 fraud ex

  • @develxper7931
    @develxper7931 2 ปีที่แล้ว +2

    I was rewatching the course to make my basics better , there were actually a lot of details man!!!

  • @MrCrunsh
    @MrCrunsh 3 ปีที่แล้ว +81

    Im busy for the next 2h.

  • @navneetTanks
    @navneetTanks 3 ปีที่แล้ว +8

    Thankyou very much, much needed for beginners like me❤️,
    I hope one day when I'll become expert, I will make free courses for others too❤️

  • @codesiddhi
    @codesiddhi 3 ปีที่แล้ว +13

    Just Amazing once again, u guys rock as always...

  • @JoshJetson
    @JoshJetson ปีที่แล้ว +3

    This is an excellent tutorial. Im doing the coursera ibm maachine learning cert and supplementing it with this video. This overall is a much more palatable and easier to understand tutorial of scikit learn and really a machine learning model in general. Awesome work!

  • @cerioscha
    @cerioscha 9 หลายเดือนก่อน +4

    great video series, thanks ! In this video @56:56 i think you meant to say that "there are way more cases without Fraud than with Fraud"

    • @victoran0
      @victoran0 8 หลายเดือนก่อน +3

      exactly why i came to the comments

  • @AcidiFy574
    @AcidiFy574 3 ปีที่แล้ว +13

    Awesome Tutorial,
    I have some suggestions regarding your content:
    1. Tutorial on RUST
    2. Tutorial on JULIA
    3. Tutorial on AWK & SED
    (Especially AWK)
    4. Tutorial on LUA
    What do you guys think????

  • @Treegrower
    @Treegrower 4 หลายเดือนก่อน

    This video is awesome! Your narration style is fantastic.

  • @Duh_Daily
    @Duh_Daily ปีที่แล้ว

    the explanations are well detailed, this really helps with understanding the library and know exactly what to use and where to use it. You have helped a great community of beginners. 🙏🏾🙏🏾🙏🏾🙏🏾🙏🏾

  • @bogoodski
    @bogoodski ปีที่แล้ว

    So amazing. Either this video is especially approachable or I've been exposed to these concepts enough now that they're finally starting to click. Probably both, but the former is definitely a significant factor. Well done

    • @bogoodski
      @bogoodski ปีที่แล้ว

      By the way, im working through the eCornell Python for Machine Learning and certificate in Machine Learning courses and this video is a perfect supplement. This is so helpful. Thank you!

  • @tanb13
    @tanb13 3 ปีที่แล้ว +5

    Does Vincent has his own Channel, I just love his teaching style!!

  • @rodrigo100kk
    @rodrigo100kk 3 ปีที่แล้ว +3

    Great video ! At 1:49:40 you could use ".values" at the end instead of np.array in the beginning.

  • @rouzbehamirazodi3001
    @rouzbehamirazodi3001 7 หลายเดือนก่อน

    Well explained and high quality video and audio. Unlike some other videos out there.

  • @abcdasa1830
    @abcdasa1830 2 หลายเดือนก่อน

    thank you. your video makes me clear about scikit-learn and machine learning. you're my saint

    • @Gh0stiefr
      @Gh0stiefr หลายเดือนก่อน

      does this tutorial worth it to watch like in this year , its 3 year old!!?

  • @dosiedoe
    @dosiedoe 2 ปีที่แล้ว

    it's insane how good this video is

  • @rodiekozlovsky2415
    @rodiekozlovsky2415 3 ปีที่แล้ว +2

    what a great course! thank you for openning the gates..

  • @johnmo1111
    @johnmo1111 ปีที่แล้ว

    Great video. Helped me with multiple sections that I had been fumbling my way through. No hard going over some things I already knew aswell.
    Thanks for this..👍

  • @dilshanchrishantha6548
    @dilshanchrishantha6548 3 ปีที่แล้ว +1

    excellent explanation for a beginner in ML .Thanks for the course.

  • @SK-qj3oj
    @SK-qj3oj 4 หลายเดือนก่อน

    Wow such an awesome course, cant believe this is free

  • @muhammadsahalsaiyed2595
    @muhammadsahalsaiyed2595 3 วันที่ผ่านมา

    Boston House Price Dataset is available on Kaggle for those who are saying scikit learn has removed it.

  • @berdeter
    @berdeter ปีที่แล้ว

    I loved the end chapter that joined machine learning with expert systems I've used 30 years ago...

  • @ShiftKoncepts
    @ShiftKoncepts 10 หลายเดือนก่อน

    thank you so much! I am slowly digesting this stuff and most likely will have to review it 2 or more times.

  • @dilshanchrishantha6548
    @dilshanchrishantha6548 3 ปีที่แล้ว +1

    great series of demo videos. well explained for a beginner to learn from zero.

  • @jakobaljaz705
    @jakobaljaz705 ปีที่แล้ว +2

    i feel i learned so much, great job sir. Thank you :)

  • @locky916
    @locky916 5 หลายเดือนก่อน

    Thanks for this great material about scikit-learn, it is really helpful and understanding is more comfortable with educators beatiful explanations. Huge thanks and keep going...

  • @kateryna_today
    @kateryna_today 3 ปีที่แล้ว

    Just started learning scikit! thank you for the material

  • @pw7225
    @pw7225 2 ปีที่แล้ว +1

    Kudos! Excellent training.

  • @ayanah4821
    @ayanah4821 หลายเดือนก่อน

    awesome! continue at 46:05

  • @JoseRicardoXavier
    @JoseRicardoXavier 3 ปีที่แล้ว +1

    Amazing presentation !!

  • @padmanabhan_s
    @padmanabhan_s 3 ปีที่แล้ว

    Excited!!!

  • @cristhiancasierra8265
    @cristhiancasierra8265 3 ปีที่แล้ว

    PERFECT TIMING!!!

  • @albertog2196
    @albertog2196 3 ปีที่แล้ว +1

    Very good teacher. Thanks for the content I learned a lot.

  • @gustavojuantorena
    @gustavojuantorena 3 ปีที่แล้ว +1

    Awesome! Thank you for sharing!

  • @gisleberge4363
    @gisleberge4363 2 ปีที่แล้ว

    Great introduction to ML, educational and well explained to the core... 🙂

  • @yugosaito9704
    @yugosaito9704 11 หลายเดือนก่อน

    Thank you for uploading this video!

  • @abhijeetkushwaha424
    @abhijeetkushwaha424 3 ปีที่แล้ว +4

    Do you guys like..read minds or something?
    I was working on a django project yesterday, and you released one. I was stuck on ML today, and here's the video. Wicked!

  • @ginopeduto4264
    @ginopeduto4264 18 วันที่ผ่านมา

    so well explained thank you

  • @sonalkudva1839
    @sonalkudva1839 6 หลายเดือนก่อน +4

    i am trying to learn from this course but it says that the boston data set has been removed from scikit learn. what should i do?

    • @juaningo24
      @juaningo24 3 หลายเดือนก่อน

      You can still downgrade your scikit-learn version to 1.0.2 and it should be fine, also if you don't want to, you can use the fetch_california_housing instead

  • @wiktorm9858
    @wiktorm9858 10 หลายเดือนก่อน

    Rime series needed these Polynomial parameters, i think. Cool tutorial though!

  • @Natalie-rl5wz
    @Natalie-rl5wz 5 หลายเดือนก่อน +3

    Hello, I just wanted to say for those who plan to do the videos. The data set 'Boston house prices' has been removed by scikit, therefore this tutorial is not really working anymore unless you change the dataset

  • @memelol1859
    @memelol1859 2 ปีที่แล้ว

    Wow thank u this really clarified my doubts :)

  • @develxper7931
    @develxper7931 2 ปีที่แล้ว

    50:00 count vecotorizer is a really good preprocessor for that too in my opinion

  • @rodionraskolnikov6989
    @rodionraskolnikov6989 ปีที่แล้ว

    truly a great tutorial!

  • @feep1642
    @feep1642 3 ปีที่แล้ว +1

    very nice tutorial watched the whole thing

    • @arnavmehta3669
      @arnavmehta3669 3 ปีที่แล้ว

      How you watched 2 hr video in 27minutes

  • @thomasbates9189
    @thomasbates9189 10 หลายเดือนก่อน

    Very helpful! Thank you!

  • @salivona
    @salivona 3 ปีที่แล้ว

    Beautiful lecture!

  • @ccuny1
    @ccuny1 3 ปีที่แล้ว

    Fantastic. Thank you very much.

  • @sunshadow9704
    @sunshadow9704 2 ปีที่แล้ว

    You are the ONE
    Thank you Sir

  • @louisshengliu
    @louisshengliu 2 ปีที่แล้ว +4

    Could you please explain why the min of recall and precision is lower than both? Could not find appendix.

    • @adrienpyb1611
      @adrienpyb1611 2 ปีที่แล้ว

      +1, anyone knows where to find the appendix?

    • @ANONIM9123
      @ANONIM9123 2 ปีที่แล้ว +1

      hint: min_both is calculated separately at every train/test split in the cross-validation

    • @user-wr6rb5eb5g
      @user-wr6rb5eb5g 6 หลายเดือนก่อน +1

      +1, same, could not find appendix

  • @abdelkaderkaouane1944
    @abdelkaderkaouane1944 ปีที่แล้ว

    Very interesting, Thank you very much

  • @mugumyavicent2803
    @mugumyavicent2803 2 ปีที่แล้ว

    thanks my co name --- vicent, you inspire me to do machine learning

  • @kennethstephani692
    @kennethstephani692 ปีที่แล้ว

    Great video!

  • @howardsmith4128
    @howardsmith4128 3 ปีที่แล้ว

    Great crash course.

  • @StarsTogether
    @StarsTogether 10 หลายเดือนก่อน

    This is compelling writing. If the subject fascinates you, a subsequent book with similar themes would be beneficial. "From Bytes to Consciousness: A Comprehensive Guide to Artificial Intelligence" by Stuart Mills

    • @nonSensCave
      @nonSensCave 10 หลายเดือนก่อน

      please bro can you tell me where to find appending for the plot answer ?

  • @mphomathabathe8558
    @mphomathabathe8558 3 ปีที่แล้ว

    31:31 fire statement!!

  • @fishnchips6627
    @fishnchips6627 2 ปีที่แล้ว

    35:56 as a non-American, it is so satisfying hearing z read as 'zed' not 'zi'. lol

  • @wb7779
    @wb7779 5 หลายเดือนก่อน

    Very nice, thank you.

  • @reyou7
    @reyou7 3 ปีที่แล้ว

    amazing content, thanks a ton!

  • @azertytnt421
    @azertytnt421 3 ปีที่แล้ว

    Really it is amazing course

  • @juanete69
    @juanete69 2 ปีที่แล้ว

    Very good tutorial.

  • @cientifiko
    @cientifiko ปีที่แล้ว

    this has an awesome didactics

  • @kevindandrade5307
    @kevindandrade5307 3 ปีที่แล้ว +3

    The section on Metrics gets confusing for me. Any easy to understand books I can read for understanding metrics?

    • @saptarshisanyal4869
      @saptarshisanyal4869 2 ปีที่แล้ว

      The metrics section was overwhelming for me as well. There has to be a pre requisite base work before going for this.

  • @wexwexexort
    @wexwexexort 3 ปีที่แล้ว

    I like this guy.

  • @vigneshpadmanabhan
    @vigneshpadmanabhan 3 ปีที่แล้ว +1

    Thanks!

    • @vigneshpadmanabhan
      @vigneshpadmanabhan 3 ปีที่แล้ว

      this is one of the best videos I have seen covering sklean so well. Thanks a lot! would love to learn sklearn in more depth for different scenarios ..

    • @saptarshisanyal4869
      @saptarshisanyal4869 2 ปีที่แล้ว

      Hi Vignesh, could you suggest a book which covers the metrics section?

  • @thomasnissen6695
    @thomasnissen6695 ปีที่แล้ว +2

    Did anybody figure out why the mean of the min(recall, precision) was below the actual mean of both recall & precision? 1:10:57

    • @meisterpianist
      @meisterpianist 7 หลายเดือนก่อน +1

      The mean is always measured over all 10 splits, for precision, for recall AND for the minimum separately. In other words, FIRST the minimum is calculated, THEN the mean over all these minimums is calculated. If you would have only one split, there would not be a problem. But starting with two splits, we have: test_precision 1.0 and 0.46 = mean 0.73. test_recall 0.37 and 1.0 = mean 0.68. However, the minimum is 0.37 and 0.46, and if you calculate the mean of these two, it's 0.42, which is below 0.73 and below 0.68. So it's reasonable that the minimum is always a bit lower than each of the two lines. In fact, I never found the "appendix", Vincent was talking about. I just took the grid-results as a dataframe, exported it to excel and played a bit around.

    • @user-wr6rb5eb5g
      @user-wr6rb5eb5g 6 หลายเดือนก่อน

      @@meisterpianist Thanks for the explanation!

  • @thecaptain2000
    @thecaptain2000 7 หลายเดือนก่อน +2

    It is a delicate subject, but I think the question of the Algorithm being racist is an ill advised one. The real question under it is whether The % of black population parameter affects the house price or not. Is the aim of a data scientist to make the actual prediction or to make the data fit a point of view (which, btw, I totally endorse in principle)

  • @mehdismaeili3743
    @mehdismaeili3743 2 ปีที่แล้ว

    thanks for his great video.

  • @AlmogYosef520
    @AlmogYosef520 3 ปีที่แล้ว +3

    Hi, what do you guys suggest me to watch if I'm totally new to ML?
    I find this course a little bit beyond my knowledge, I thought because I've got the foundation of DS I can jump on this course but I think I'll need some intro to ML videos.

    • @Caradaoutradimensao
      @Caradaoutradimensao 3 ปีที่แล้ว +4

      StatQuest

    • @AlmogYosef520
      @AlmogYosef520 3 ปีที่แล้ว +1

      @@Caradaoutradimensao Awesome looks good!
      Thanks a lot!

    • @spiritech7162
      @spiritech7162 2 ปีที่แล้ว +1

      @@Caradaoutradimensao thanks bro

  • @espirikii
    @espirikii 2 ปีที่แล้ว

    For the Titanic example: 76% of the women survived, whereas just 16% of the men survived, that would have been a really good classifier to start with

  • @ultraviolenc3
    @ultraviolenc3 3 ปีที่แล้ว +2

    1:11:00 what’s the answer though?

  • @hassanhijazi4757
    @hassanhijazi4757 ปีที่แล้ว +1

    I did not succeed to reproduce the figure @ 1:16:56. I'm always getting the same figure as the one just before even I did the log transformation of the "Amount" column. Anyone have had the same problem?

  • @riccello
    @riccello 3 ปีที่แล้ว +1

    Can I ask you how you are able to draw on the screen? I understand you are probably using a Stylus pen over some touch screen surface, which mirrors your display, but what software are you using for that?

  • @abdulwahab182
    @abdulwahab182 2 ปีที่แล้ว

    Great 👍

  • @user-fe2oh8oj2u
    @user-fe2oh8oj2u 3 ปีที่แล้ว +1

    Could you please do "Python for Raspberry Pi 4". I cannot fight a proper guide which properly introduces and explains from the very beginning. I would like to experiment with robotics (e.g. robot arm, etc.), but have no idea how to start programming it. All available guides are using irrelevant projects to start with Raspberry.
    Note: Thank you for the tutorial!

    • @mwanikimwaniki6801
      @mwanikimwaniki6801 3 ปีที่แล้ว

      I could help with a little info if you are still interested,

  • @nonSensCave
    @nonSensCave 10 หลายเดือนก่อน +1

    please guys, where is this appending for the plot answer ????????????????

    • @vignatej663
      @vignatej663 6 หลายเดือนก่อน

      Bro, did you got any???

  • @kodiaktheband
    @kodiaktheband ปีที่แล้ว +2

    The Boston housing prices dataset has an ethical problem: as
    investigated in [1], the authors of this dataset engineered a
    non-invertible variable "B" assuming that racial self-segregation had a
    positive impact on house prices [2]. Furthermore the goal of the
    research that led to the creation of this dataset was to study the
    impact of air quality but it did not give adequate demonstration of the
    validity of this assumption.
    The scikit-learn maintainers therefore strongly discourage the use of
    this dataset unless the purpose of the code is to study and educate
    about ethical issues in data science and machine learning.

  • @pushpendratripathi46
    @pushpendratripathi46 3 ปีที่แล้ว

    Thanks ♥️♥️

  • @nguyenphutho9503
    @nguyenphutho9503 3 ปีที่แล้ว +1

    Sorry, I have a question :
    Which version of python and opencv are matched ?
    Because a lot of tutorials I had follow, but unable to find matched compatible version of python and opencv.
    Please help me to find solution to my own project. Thank you so much.

  • @messedinsaan
    @messedinsaan หลายเดือนก่อน

    "Building dependencies failed"
    error: subprocess-exited-with-error
    Cannot import boston housing price dataset.

  • @aminghafoori6496
    @aminghafoori6496 3 ปีที่แล้ว

    thank you

  • @asokt4931
    @asokt4931 ปีที่แล้ว

    What do you mean watch all these videos? Are there different videos series?

  • @Ingles.con.peliculas
    @Ingles.con.peliculas ปีที่แล้ว

    it's great...

  • @markomilenkovic2714
    @markomilenkovic2714 ปีที่แล้ว

    Is it still worth watching this video? How much has changed in 2 years? Thank you

  • @cientifiko
    @cientifiko ปีที่แล้ว

    very useful... I run the code on idle but it didnt work well, there are something that need to revise like importation of library being after used variable.

  • @vadimrudakov8907
    @vadimrudakov8907 ปีที่แล้ว

    Data leakage? In the introducing section (like in 28:41) we have a gridsearch that contains a pipeline with the numeric features transformer. I guess it is the right way to data leakage, because in our pipeline we first transform all the numeric features in the entire dataset and straightly after that we start our model learning through the cross-validation process within the entirely transformed dataset. Our training sets, created during cv, contain previously standardized data, so the model "knows" something about the examples that are not in the training set and can predict better when process them in the prediction step. Thus we should exclude any numeric features transformation in our grid search, am I right? If I'm not, please explain the mechanism.

  • @mrrishiraj88
    @mrrishiraj88 3 ปีที่แล้ว

    Thanks

  • @Rukoilla
    @Rukoilla หลายเดือนก่อน

    B for blacks is wild.

  • @VASUofficial0
    @VASUofficial0 2 หลายเดือนก่อน

    for better learning you can also provide data links used in this course ,sir if u can

  • @BartekP
    @BartekP 3 ปีที่แล้ว

    25:50 using space instead of tab .... stops watching :) (joke) great video

  • @khal7994
    @khal7994 2 ปีที่แล้ว

    00:19 i did not underestand why after changing k value from 5 to 1 prediction diagram changed ? knn is a classification algoithm but here it was like a regration