Scikit-Learn Tutorial | Machine Learning With Scikit-Learn | Sklearn | Python Tutorial | Simplilearn

แชร์
ฝัง
  • เผยแพร่เมื่อ 2 ม.ค. 2025

ความคิดเห็น • 697

  • @SimplilearnOfficial
    @SimplilearnOfficial  ปีที่แล้ว +2

    "🔥Caltech Post Graduate Program In AI And Machine Learning - www.simplilearn.com/artificial-intelligence-masters-program-training-course?Lt9w-BxKFQ&Comments&TH-cam
    🔥IITK - Professional Certificate Course in Generative AI and Machine Learning (India Only) - www.simplilearn.com/iitk-professional-certificate-course-ai-machine-learning?Lt9w-BxKFQ&Comments&TH-cam
    🔥Purdue - Post Graduate Program in AI and Machine Learning - www.simplilearn.com/pgp-ai-machine-learning-certification-training-course?Lt9w-BxKFQ&Comments&TH-cam
    🔥IITG - Professional Certificate Program in Generative AI and Machine Learning (India Only) - www.simplilearn.com/iitg-generative-ai-machine-learning-program?Lt9w-BxKFQ&Comments&TH-cam
    🔥Caltech - AI & Machine Learning Bootcamp (US Only) - www.simplilearn.com/ai-machine-learning-bootcamp?Lt9w-BxKFQ&Comments&TH-cam"

  • @SimplilearnOfficial
    @SimplilearnOfficial  6 ปีที่แล้ว +10

    Do you have any questions on this topic? Please share your feedback in the comment section below and we'll have our experts answer it for you. Also, if you would like to have the dataset for implementing the use case shown in the video, please comment below and we will get back to you. Thanks watching the video. Cheers !!

    • @kuaranir2440
      @kuaranir2440 5 ปีที่แล้ว

      File "", line 3
      from sklearn.model_selection import train_test_split
      ^
      SyntaxError: invalid syntax
      Where is a mistake?

    • @georgyachkouty3127
      @georgyachkouty3127 5 ปีที่แล้ว

      Fantastic video man. Any chances we can get our hands on the file that you worked on? Maybe through you Git profile?

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว

      Hello Georgy, thanks for the kind comment. It would be helpful if you will provide your email ID to us so that we could send the requested dataset promptly. On the off chance that you need your email ID to be kept hidden from others, we can do that also. Hope that helps.

    • @gabrielsoutomaracaja
      @gabrielsoutomaracaja 5 ปีที่แล้ว +15

      @@georgyachkouty3127 archive.ics.uci.edu/ml/machine-learning-databases/wine-quality/

    • @juancarlosneufeld2022
      @juancarlosneufeld2022 5 ปีที่แล้ว

      Hi! Please send me the dataset and the Python script. juan_neufeld@hotmail.com Thank you!

  • @mahyarazad
    @mahyarazad 5 ปีที่แล้ว +10

    Beautifully explained the whole sklearn methods. I have been watching youtube videos to learn this topic for a while, nonetheless I couldn't get the juice until now. I suppose this video is 100% effective for those who wants to get straight to the point.

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว +1

      Hey Mahyar, thank you for appreciating our work. We are glad to have helped. Do check out our other tutorial videos and subscribe to us to stay connected. Cheers :)

  • @haykogevorgyan9935
    @haykogevorgyan9935 2 ปีที่แล้ว +3

    May you have an eternal bliss for the effort you put in doing things in the world, my friend!

    • @SimplilearnOfficial
      @SimplilearnOfficial  2 ปีที่แล้ว +1

      Hey, thank you for appreciating our work. We are glad to have helped. Do check out our other tutorial videos and subscribe to us to stay connected. Cheers :)

  • @usamazahid1
    @usamazahid1 5 ปีที่แล้ว +1

    wow....your tutorial was just bullz eye.Right on spot what was needed.A big thumbs up fir what was needed about get going about scikit learn.Keep it up!

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว

      Hey Muhammad, thank you for watching our video. We are glad that you liked our video. Do subscribe and stay connected with us. Cheers :)

  • @alonasorochynska5881
    @alonasorochynska5881 5 ปีที่แล้ว

    This is the best tutorial ever about his topic. All the information is given so clear. Thank you very much and do more!!!

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว

      WooHoo! We are so happy you love our videos. Please do keep checking back in. We put up new videos every week on all your favorite topics. Whenever you have the time, you must also check out our blog page @simplilearn.com and tell us what you think. Have a good day!

  • @haha2927
    @haha2927 4 ปีที่แล้ว +8

    Hey, thanks for your great tutorial! But i have one Question about the StandardScaler Function: When you scale the trainingset, don't you have to scale the testset (and the data you're using in the later Application) with the same Values? As an Example: The Feature A has a Mean of 2 and a Std of 0.3, so you have to normalize the testset with these Values (because you have to pretend the Testset is "unknown"). So how can i extract the Mean and Std Values?

  • @aliasjad9560
    @aliasjad9560 4 ปีที่แล้ว

    this is tutorial is incredible and very helpful. i had many doubts about scikit-learning ,now with this tutorial my problems have been solved

    • @SimplilearnOfficial
      @SimplilearnOfficial  3 ปีที่แล้ว

      You're very welcome! Please like and subscribe to our channel and click on the bell icon to get new video updates.

  • @DavidS1er
    @DavidS1er 3 ปีที่แล้ว +1

    Thank you!!! super straightforward and easy to follow along.

  • @sergiysergiy8875
    @sergiysergiy8875 ปีที่แล้ว +1

    Great tutorial. Thanks!

    • @SimplilearnOfficial
      @SimplilearnOfficial  ปีที่แล้ว

      We're so glad that you enjoyed your time learning with us! If you're interested in continuing your education and developing new skills, take a look at our course offerings in the description box. We're confident that you'll find something that piques your interest!

  • @alessandrosilveira9009
    @alessandrosilveira9009 2 ปีที่แล้ว +1

    Great Tutorial! Well Done!

    • @SimplilearnOfficial
      @SimplilearnOfficial  2 ปีที่แล้ว

      Hope you enjoyed our video! We have a ton more videos like this on our channel. We hope you will join our community!

  • @kennethstephani692
    @kennethstephani692 ปีที่แล้ว +1

    Great video!!

    • @SimplilearnOfficial
      @SimplilearnOfficial  ปีที่แล้ว

      We're thrilled to have been a part of your learning experience, and we hope that you feel confident and prepared to take on new challenges in your field. If you're interested in further expanding your knowledge, check out our course offerings in the description box.

  • @amirhosseinsarfi3893
    @amirhosseinsarfi3893 5 ปีที่แล้ว +8

    Thanks for this incredible tutorial, Does it have a second part?

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว +2

      Glad you enjoyed our video! We are sorry to say that this tutorial doesn't have the second part. And also don't forget to support us by subscribing to our channel. Cheers!

  • @tehreemqasim2204
    @tehreemqasim2204 3 ปีที่แล้ว +1

    Many thanks for the wonderful tutorial

  • @alejandromorcilloalegre219
    @alejandromorcilloalegre219 5 ปีที่แล้ว +5

    Hi, you made the transform of the X_train and X_test separately. Would not it be a better idea to rescale when they are together and then separate them?

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว

      Yes that could be done too.

    • @CJP3
      @CJP3 4 ปีที่แล้ว +1

      If you scale them together then you risk data leakage. This is not typically advised.

    • @updatascience
      @updatascience 4 ปีที่แล้ว

      As Christopher also added, you prefer to do the transformation separately since we wanna get the main features just from our training data (mean & variance ) and then use those features to transform also the testing data, else there is the risk of overfitting

  • @junpingyin6797
    @junpingyin6797 4 ปีที่แล้ว +3

    Thanks, very helpful tutorial. I just have one question, why the precision value in the classification report is not equal to the accuracy in the confusion matrix?

  • @TopicalAuthority
    @TopicalAuthority 4 ปีที่แล้ว +1

    Nice video, thank you.

  • @andreasnordbass
    @andreasnordbass 5 ปีที่แล้ว +6

    is this the best version of this video? sound quality is really low

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว +3

      Thank you so much for bringing this to our attention. We reported this right away to the relevant department.

    • @ppal64
      @ppal64 4 ปีที่แล้ว

      Sound is okay. Perhaps check elsewhere.

  • @yashmehta4922
    @yashmehta4922 5 ปีที่แล้ว +2

    An excellent video indeed! Had a doubt; why don't we use fit_transform for the x_test data?

  • @michaelsichenko2313
    @michaelsichenko2313 3 ปีที่แล้ว +1

    Amazing!
    much appreciated.

  • @rodolfobrandao5364
    @rodolfobrandao5364 5 ปีที่แล้ว +1

    Very useful, thank you so much! Great lesson!

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว +1

      Hey Rodolfo, thank you for watching our video. We are glad that you liked our video. Do subscribe and stay connected with us. Cheers :)

  • @mariav1234
    @mariav1234 5 ปีที่แล้ว +2

    Thanks for this excellent video!

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว

      Hey Mariav, thank you for appreciating our work. We are glad to have helped. Do check out our other tutorial videos and subscribe to us to stay connected. Cheers :)

  • @MrCaglar1993
    @MrCaglar1993 4 ปีที่แล้ว

    Sir. You nailed it...

    • @SimplilearnOfficial
      @SimplilearnOfficial  3 ปีที่แล้ว

      Thanks for the compliment. Do subscribe to our channel and stay tuned for more.

  • @TopicalAuthority
    @TopicalAuthority 3 ปีที่แล้ว +1

    Great job, thank you.

  • @bernsbuenaobra473
    @bernsbuenaobra473 4 ปีที่แล้ว +1

    Great video concise, direct to the point, hands-on makes one wanted to study and play at the same time with Machine Learning and then go deeper. Just found this much better approach than traditional Excel and legacy SAS JMP. No cut and paste for a disciplined developer - learn as you type! I have a use case for agricultural work I can apply the lessons here thanks! Please send me a link to the practice dataset and Jupyter Notebook file (to make is a reference solution set versus my version of code).

    • @SimplilearnOfficial
      @SimplilearnOfficial  4 ปีที่แล้ว

      Glad it was helpful! It would be helpful if you will provide your email ID to us so that we could send the requested dataset promptly.

    • @bernsbuenaobra473
      @bernsbuenaobra473 4 ปีที่แล้ว

      berns.buenaobra@gmail.com

  • @পথিক-ঘ৮য
    @পথিক-ঘ৮য 2 ปีที่แล้ว +1

    One of the best learning videos...can you please add the wine data file in google drive

    • @SimplilearnOfficial
      @SimplilearnOfficial  2 ปีที่แล้ว

      Hello, thanks for viewing our tutorial. You can find your requested dataset in the video description. Hope that helps.

  • @sourishw.5865
    @sourishw.5865 4 ปีที่แล้ว +1

    Thanks for the video! How would I go about taking my model and exporting it so that I can use it in my own different applications? Like, how do I find the actual code of the model that I can copy into another application to use regularly?
    Also, when refitting with new training data, does the standard scalar remember the scale to which it scaled the old data and apply that to the new data?

  • @Grafflog
    @Grafflog 5 ปีที่แล้ว

    Been following this tutorial to the letter with an exception of the dataset(im using a set with 4500 lines )
    When i get to 28:28 in the video i get the following error
    object of type 'CategoricalDtype' has no len()
    does any one here have a ide of why?

  • @afanouekue
    @afanouekue 4 ปีที่แล้ว

    Thanks for this straightforward tutorial!😊

  • @wisemindmastery
    @wisemindmastery 2 ปีที่แล้ว +1

    I have been having trouble downloading the dataset, and some others from Simplilearn videos. It downloads as .html file. Someone help me out. How do yo download the datasets?

    • @SimplilearnOfficial
      @SimplilearnOfficial  2 ปีที่แล้ว

      Hello, thanks for viewing our tutorial. You can find your requested dataset in the video description. Hope that helps.

  • @AkaExcel
    @AkaExcel 6 ปีที่แล้ว +2

    Thanks for the video!

    • @SimplilearnOfficial
      @SimplilearnOfficial  6 ปีที่แล้ว

      Hey, thank you for watching our video. We are glad that you liked our video. Do subscribe and stay connected with us. Cheers :)

    • @AkaExcel
      @AkaExcel 6 ปีที่แล้ว

      @@SimplilearnOfficial You are welcome!

    • @vimalchandran7810
      @vimalchandran7810 4 ปีที่แล้ว

      Great tutorial!👍,how can we get the dataset!?below is my mail id
      vcdwbi@gmail.com

  • @rohithvarma6763
    @rohithvarma6763 4 ปีที่แล้ว +1

    great explanation bruh

  • @glowish1993
    @glowish1993 5 ปีที่แล้ว +1

    Wow, this is a masterful crash course for people that alr has some knowledge, learnt quite a few things as well, thanks so much!!
    Also, are there any plans for more episodes of scikit learn?

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว +1

      Hello, thank you for the amazing feedback. We are glad to have helped. We do not have any current plan in making more videos about SciKit. However, we will be coming up with advanced videos for each topics soon. So, do check out our other tutorial videos and subscribe to us to stay connected. Cheers :)

    • @glowish1993
      @glowish1993 5 ปีที่แล้ว +1

      @@SimplilearnOfficial Looking fwd to that, subscribed!

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว

      Thanks for joining our community. We welcome you!

  • @hectoralarcon4888
    @hectoralarcon4888 4 ปีที่แล้ว

    You are amazing, very well explained.

    • @SimplilearnOfficial
      @SimplilearnOfficial  4 ปีที่แล้ว

      Hello, thank you for watching our video. We are glad that you liked our video. Do subscribe and stay connected with us. Cheers :)

  • @Chenard612
    @Chenard612 4 ปีที่แล้ว

    Great tutorial, thank you very much!

  • @ethamaneely3902
    @ethamaneely3902 5 ปีที่แล้ว +7

    Very useful, thank you so much. I had only one issue. my dataset has a lot of categorical data wasn't sure how to convert it

    • @AlejandroRodriguez-jo9kg
      @AlejandroRodriguez-jo9kg 4 ปีที่แล้ว

      The common practice is to create a numerical column and assinging a determined value depending on the category.

  • @sarabarriusotapia130
    @sarabarriusotapia130 4 ปีที่แล้ว

    very useful video.thanks

    • @SimplilearnOfficial
      @SimplilearnOfficial  4 ปีที่แล้ว

      Very welcome! Do subscribe to our channel and stay tuned for more.

  • @renmeker6588
    @renmeker6588 5 ปีที่แล้ว +1

    Thx for explaining :)
    But how could I predict the former raiting and not only if the wine is good or bad?

  • @devampatel9777
    @devampatel9777 3 ปีที่แล้ว +1

    How to access the CSV file which you are using in the tutorial?

    • @SimplilearnOfficial
      @SimplilearnOfficial  3 ปีที่แล้ว

      Hi, Simplilearn provides online training across the world. We would be happy to help you regarding this. Please visit us at www.simplilearn.com and drop us a query and we will get back to you! Thanks!

  • @andrew61987
    @andrew61987 5 ปีที่แล้ว

    Why distill quality down to just "good" or "bad"? How can we rework the example to predict the numerical value for quality?

  • @Ajhidri
    @Ajhidri 4 ปีที่แล้ว

    thank you so much, it's really helpful !

    • @SimplilearnOfficial
      @SimplilearnOfficial  4 ปีที่แล้ว +1

      Greetings! Thank you for your kind words. Spread the word by liking, sharing and subscribing to our channel! Cheers :).

  • @scigama71
    @scigama71 4 ปีที่แล้ว +1

    can you explain why you dont use sc.fit_transform on line 5?great video.

  • @FilippoTeodoro
    @FilippoTeodoro 5 ปีที่แล้ว +4

    Here is the dataset 'winequality-red.csv' github.com/zygmuntz/wine-quality/blob/master/winequality/winequality-red.csv

  • @mgdp12
    @mgdp12 4 ปีที่แล้ว +3

    Link to datasets:
    archive.ics.uci.edu/ml/machine-learning-databases/wine-quality/
    Note that these are semicolon delimited

  • @ozysjahputera7669
    @ozysjahputera7669 2 ปีที่แล้ว

    I would apply the scaler to the features before splitting the samples to training and testing sets.

    • @SimplilearnOfficial
      @SimplilearnOfficial  2 ปีที่แล้ว

      Keep learning with us .Stay connected with our channel and team :) . Do subscribe the channel for more updates : )

  • @ethamaneely3902
    @ethamaneely3902 5 ปีที่แล้ว

    Very useful, love it

  • @farshad-hasanpour
    @farshad-hasanpour 4 ปีที่แล้ว

    Thank you for this great tutorial

  • @syrnaya_narezka
    @syrnaya_narezka 4 ปีที่แล้ว +1

    Thanks for video! Please share link on 38:36

    • @SimplilearnOfficial
      @SimplilearnOfficial  4 ปีที่แล้ว

      Hi Halyna, are you looking for the dataset?

    • @syrnaya_narezka
      @syrnaya_narezka 4 ปีที่แล้ว

      @@SimplilearnOfficial yes and no. More I wish you will send me here link with algorithms description that I can't find on your site now it's shown on 38:36 min

    • @SimplilearnOfficial
      @SimplilearnOfficial  4 ปีที่แล้ว

      Hello Halyna, thanks for viewing our tutorial and we hope it is helpful. It would be helpful if you will provide your email ID to us so that we could send the requested dataset promptly.

  • @heenagirdher6443
    @heenagirdher6443 5 ปีที่แล้ว +2

    Hi. Kindly upload tutorial for image dataset with svm

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว

      Thanks for your suggestion. We will look into it. Thanks.

  • @kuwarkapur137
    @kuwarkapur137 3 ปีที่แล้ว

    Loved the course. On a scale of beginner, intermediate and advance. What would you say about your course??

    • @SimplilearnOfficial
      @SimplilearnOfficial  3 ปีที่แล้ว

      This video is for beginners and Intermediate level. For more advance concepts, you can check out our machine learning course: www.simplilearn.com/big-data-and-analytics/machine-learning-certification-training-course

  • @anirvana5587
    @anirvana5587 4 ปีที่แล้ว

    Hello. I am trying to sort thru a data set with values either 0 or 1 and separate them into 2 bins but the bin declaration bins=(2,.5,1) is giving me a value error. how would I store all of that data in 2 bins?

  • @sebastiankioli
    @sebastiankioli 5 ปีที่แล้ว +1

    I have just started with the tutorial and it seems very informative. I would like to have the dataset.

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว

      Hi Sebastian, we are glad you found our video informative. It would be helpful if you will provide your email ID to us so that we could send the requested dataset promptly.

  • @m.f.abouhashem4285
    @m.f.abouhashem4285 5 ปีที่แล้ว +1

    great tutorial thanks

  • @selvamraj1967
    @selvamraj1967 4 ปีที่แล้ว

    How to get the parameters for the prediction, so that we could use that parameters for future estimate offline in some other software like Excel?

  • @russakaushik8317
    @russakaushik8317 4 ปีที่แล้ว

    Thanks for sharing the tutorial. Taught in a very lucid way.
    Quick question
    if we write thebelow code snippet
    bins=(2,6.5,8)
    group_names=['0','1']
    wine['quality']=pd.cut(wine['quality'],bins=bins,labels=group_names)
    It will transform the quality column into binary values without further encoder and transform_fit.
    Then we do not need to do LabelEncoder() and label_quality.fit_transform(wine['quality']) .
    Am I correct?

    • @SimplilearnOfficial
      @SimplilearnOfficial  4 ปีที่แล้ว

      Hey Russa, thank you for appreciating our work. We are glad to have helped. Do check out our other tutorial videos and subscribe to us to stay connected. Cheers :)

  • @jp2nyy
    @jp2nyy 4 ปีที่แล้ว +1

    How can I do sklearn for a csv with multiple values.

    • @SimplilearnOfficial
      @SimplilearnOfficial  3 ปีที่แล้ว

      Hello, thanks for viewing our tutorial. It would be helpful if you will provide your email ID to us so that we can send the requested dataset promptly. On the off chance that you need your email ID to be kept hidden from others, we can do that too. Hope that helps.

    • @jp2nyy
      @jp2nyy 3 ปีที่แล้ว

      @@SimplilearnOfficial I actually had a dataset that I am working with but I am wondering if you could help guide me on how to do ML for it so I can predict.

  • @lets_get_it
    @lets_get_it 3 ปีที่แล้ว +1

    Where can I find the data set?

    • @SimplilearnOfficial
      @SimplilearnOfficial  3 ปีที่แล้ว

      Hello, thanks for viewing our tutorial. It would be helpful if you will provide your email ID to us so that we can send the requested dataset promptly. On the off chance that you need your email ID to be kept hidden from others, we can do that too. Hope that helps.

    • @lets_get_it
      @lets_get_it 3 ปีที่แล้ว

      @@SimplilearnOfficial kompajam@gmail.com

  • @ganesan7968
    @ganesan7968 4 ปีที่แล้ว

    I have a doubt
    Why not predict y_test?
    Explain(classification_report(y_test, pred_rfc)

  • @ChrisPChickennn
    @ChrisPChickennn 5 ปีที่แล้ว

    Hi, Thanks for providing a great intro to scikit learn.. Im trying to recreate the analysis on a test file, but im running into an issue. I am trying to predict the font colour for a coloured background in an excel file.. I created a bunch of random RGB values and found their grayscale value to determine if the font colour should be black or white. The black or white determination is recorded in a 4th column (after R,G,B), as 0 or 1. At cell 7 of runtime I get a KeyError - Traceback to the name of the 4th column, 'bgval'.. I've ensured everything is spelled correctly, i tried updating the CSV file to make sure the name does not confuse pandas somehow.. I edited on github instead of the csv file directly, but it reads properly with the prior cells, showing updated values. Is it because I have a binary test column? The bins are labeled (0,0.5,1), so I thought that should separate the bins to mimic the tutorial example.

  • @vedantkathe7711
    @vedantkathe7711 3 ปีที่แล้ว

    Thank you so much @simplilearn,incredible tutorial and explanation!!would be even better if you could provide Jupytr notebook link

    • @SimplilearnOfficial
      @SimplilearnOfficial  3 ปีที่แล้ว

      Hello, thanks for viewing our tutorial. It would be helpful if you will provide your email ID to us so that we can send the requested dataset promptly. On the off chance that you need your email ID to be kept hidden from others, we can do that too. Hope that helps.

  • @tanmaykadam6789
    @tanmaykadam6789 5 ปีที่แล้ว +2

    In bins(2,6.5,8)
    What is use of 8?

    • @glowish1993
      @glowish1993 5 ปีที่แล้ว +5

      Bins work in intervals. So one bin for values between 2-6.5 and another for values between 6.5-8. If you only put bins(2,6.5) you will only get one bin for range 2-6.5

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว

      Thanks for your input!

    • @marvin4519
      @marvin4519 5 ปีที่แล้ว +1

      thanks so helpful

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว

      Very welcome!

    • @nikitapatil5857
      @nikitapatil5857 4 ปีที่แล้ว

      If I put directly bins(2,8) then??

  • @mandeepbaluja5401
    @mandeepbaluja5401 5 ปีที่แล้ว +1

    Love it 😊

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว

      Thanks for your love and support! Do subscribe to our channel and never miss any updates! Cheers!

  • @llawliet7241
    @llawliet7241 4 ปีที่แล้ว

    Could it be possible that the random forest mislabels a lot of good wines because the training data is imbalanced and has a lot more bad than good wines?

    • @SimplilearnOfficial
      @SimplilearnOfficial  4 ปีที่แล้ว

      "Hi Lawliet,
      Yes, it is definitely possible that the predictions might completely go wrong if the data is inaccurate or has a lot of noise."

  • @DanielWeikert
    @DanielWeikert 6 ปีที่แล้ว

    Shouldn't the data we feed into our model for prediction (X new) be an numpy array? Thanks

    • @vsaulas
      @vsaulas 6 ปีที่แล้ว

      sklearn algorithms can be fed using numpy arrays or pandas dataframes, arrays as input are not an obligation

    • @SimplilearnOfficial
      @SimplilearnOfficial  6 ปีที่แล้ว

      Hi Daniel, it could be either an array or data frame. Both work fine.

  • @kshitijkhare306
    @kshitijkhare306 4 ปีที่แล้ว

    hi jst a review, the bckground voice is unsuitable in any of the earphones. please check it out

    • @SimplilearnOfficial
      @SimplilearnOfficial  4 ปีที่แล้ว

      We are sorry about that, we will share the feedback with the relevant department

  • @jorgepedrocastillocarrillo7244
    @jorgepedrocastillocarrillo7244 4 ปีที่แล้ว +1

    Excellent video! Pretty straight forward, you just need to have a little background on Python to understand the code

    • @SimplilearnOfficial
      @SimplilearnOfficial  4 ปีที่แล้ว

      Hello, thank you for watching our video. We are glad that you liked our video. Do subscribe and stay connected with us. Cheers :)

  • @akaziehl
    @akaziehl 5 ปีที่แล้ว +2

    I didn't understand why you use fit_transform on X_train but only transform on X_test. Shouldn't you also fit for the testing since it's how it's being trained?

    • @bholaprasad26
      @bholaprasad26 5 ปีที่แล้ว +1

      fitting data means you want the algorithm to learn from it. For that purpose, we use the training data set. But the problem is a model could do very well on the training data to make predictions but it doesn't perform well when faced with unseen data that the model never seen before. We don't care too much about how the data performed well on the training set, we want our model to do well on the unseen data. The test set is used as a proxy for unseen data. If we fit the model on the test set, it means the algorithm is going to learn from it, which we don't want in the initial step of model building. We want to use the test set after we were going through all the process. We want to use the test set at last to see how good our model do or generalize in the real world. That is why he only used the fitting and transforming on the training data but not on the test data. I hope this helps.

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว

      Thanks for your valuable input!

  • @takudzwadzikiti9028
    @takudzwadzikiti9028 ปีที่แล้ว

    Where can we access the dataset you used onthis tutorial

  • @shemayakangera1392
    @shemayakangera1392 2 ปีที่แล้ว +1

    i can't access the dataset only the notebook

    • @SimplilearnOfficial
      @SimplilearnOfficial  2 ปีที่แล้ว

      Hello, thanks for viewing our tutorial. You can find your requested dataset in the video description. Hope that helps.

  • @PerryONeilEMT
    @PerryONeilEMT 4 ปีที่แล้ว

    Thanks for an excellent tutorial. I found the data set online and used Jupyter to successfully replicate it with one exception. The neural network example did not converge. How can I submit my notebook privately?

    • @SimplilearnOfficial
      @SimplilearnOfficial  4 ปีที่แล้ว

      Hi Perry, you can share the query to this email ID: kennet.rajan@simplilearn.net
      We will try to help you out. Thanks.

  • @gauranshbhutani592
    @gauranshbhutani592 5 ปีที่แล้ว

    Hi Gaurie. I'm starting to learn machine learning/scikit. Thanks for your tutorial. Please send me the dataset and code. Appreciate ...you are a gem of an instructor....

    • @gauranshbhutani592
      @gauranshbhutani592 5 ปีที่แล้ว

      gaurtcd@gmail.com

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว

      Hi Gauransh, thanks for watching our video. We have sent the requested dataset to your mail ID. Do subscribe to our channel and get our new video updates directly into your email. If you have any questions related to these videos, you can post in the comments section, we will clear your queries/doubts.

  • @kaushiktummalapali4000
    @kaushiktummalapali4000 5 ปีที่แล้ว +1

    Can I get that data set?

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว +1

      Hello Kaushik, thanks for viewing our tutorial and we hope it is helpful. It would be helpful if you will provide your email ID to us so that we could send the requested dataset promptly.

  • @veera7202
    @veera7202 5 ปีที่แล้ว

    @simplilearn you are really awesome man !! Such a great work
    May I know from where the data sets can be taken ??

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว

      Hello, thanks for viewing our tutorial. It would be helpful if you will provide your email ID to us so that we could send the requested dataset promptly. On the off chance that you need your email ID to be kept hidden from others, we can do that also. Hope that helps.

    • @smartguy3043
      @smartguy3043 4 ปีที่แล้ว +7

      This Dataset is from UC Irvine . Link : archive.ics.uci.edu/ml/datasets/wine+quality

  • @IamOlufunmi
    @IamOlufunmi 2 ปีที่แล้ว +1

    I need the datasets used in this video please.

    • @SimplilearnOfficial
      @SimplilearnOfficial  2 ปีที่แล้ว +1

      Hello, thanks for viewing our tutorial. You can find your requested dataset in the video description. Hope that helps.

  • @chap400001
    @chap400001 4 ปีที่แล้ว

    Hi, very well done. Could you please me the dataset used in this video. Thanks again

    • @SimplilearnOfficial
      @SimplilearnOfficial  4 ปีที่แล้ว

      Hello Steve, thanks for viewing our tutorial and we hope it is helpful. It would be helpful if you will provide your email ID to us so that we could send the requested dataset promptly.

  • @gerardohuerta567
    @gerardohuerta567 3 ปีที่แล้ว +1

    Hi, where i can get that data?

    • @SimplilearnOfficial
      @SimplilearnOfficial  3 ปีที่แล้ว

      Hello, thanks for viewing our tutorial. You can find your requested dataset in the video description. Hope that helps.

  • @AMDoria-bu2yj
    @AMDoria-bu2yj 4 ปีที่แล้ว

    Hi, I' m new at jupyter editor and with Scikit-Learn, I was wondering if u could help me out with some issues like reading the csv file

    • @SimplilearnOfficial
      @SimplilearnOfficial  4 ปีที่แล้ว

      Hi, you can check out our Jupyter notebook video to learn interface of it: th-cam.com/video/3C9E2yPBw7s/w-d-xo.html

  • @GustavoLeig
    @GustavoLeig 5 ปีที่แล้ว

    Could I use this model to predict wine prices? I mean, I would not use the bins, would like to get an estimation of the price. btw awesome video

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว

      Hi Gustavo, it is possible to predict the prices of the wine using the wine dataset. Since, price is a quantitative variable, you need to use Linear Regression model.

  • @abdillahmohamed9585
    @abdillahmohamed9585 5 ปีที่แล้ว

    I am new in machine learning and now I am facing an issue. I have 7 projects I would like to predict whether a pull request would be rejected or not (Yes or No). And I would like to build a prediction model by using data from 6 projects as source project and predict the rejection of the pull request in the seventh project as a target project. Can you please tell me how can I structure my algorithm in Scikit-learn? Hope that my question is clear.
    Thanks

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว

      First, you need to ensure your data is in the right format. One problem you might face when pulling data from different sources is the structure. gather all the information into a single sheet and then you can proceed as usual.

  • @MrHedren
    @MrHedren 4 ปีที่แล้ว

    Hi,
    I get ModuleNotFoundError for some of the sklearn imports in Jupyter. I have everyting needed isntalled. Do you have a guess as to what the problem could be?

    • @MrHedren
      @MrHedren 4 ปีที่แล้ว

      Got it fixed!
      Great video!

    • @SimplilearnOfficial
      @SimplilearnOfficial  4 ปีที่แล้ว

      Thanks for watching our video. Do subscribe to our channel and stay tuned for more.

  • @marianahebborn8026
    @marianahebborn8026 5 ปีที่แล้ว

    Great tutorial! Where could I get the video?

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว +1

      Thanks for the kind comment. You can get our videos through our TH-cam channel. Thanks.

  • @ezhilarasu5822
    @ezhilarasu5822 4 ปีที่แล้ว

    I have a doubt... I don't understand the purpose of X and y variables for separating the data set

    • @SimplilearnOfficial
      @SimplilearnOfficial  4 ปีที่แล้ว

      "Hi Ezhil,
      In scikit learn, we use X and y for dividing the data between training and testing set."

  • @IneeAder
    @IneeAder 3 ปีที่แล้ว +1

    Daaang, I know this is 2 years old, but "restart and run all" for the kernel didn't work at the very beginning so my entire Jupyter notebook was erroring out :(

    • @SimplilearnOfficial
      @SimplilearnOfficial  3 ปีที่แล้ว

      "Hi Inee,
      Restart and run all option in Jupyter notebook restarts the kernel and run all the cells. It will throw errors if any of coding cells have an error."

  • @humptyneupane9226
    @humptyneupane9226 4 ปีที่แล้ว

    why didn't we scaled Y_train data please reply ?

  • @ahsannaseer7526
    @ahsannaseer7526 4 ปีที่แล้ว

    what if we have more than one qualities.. such as qualities range from 1 to 10. in this case, how code will change?

    • @SimplilearnOfficial
      @SimplilearnOfficial  4 ปีที่แล้ว

      Sure, could you please share your mail ID to get the dataset? THanks.

    • @ahsannaseer7526
      @ahsannaseer7526 4 ปีที่แล้ว

      @@SimplilearnOfficial kindly, give me the edited code for more than two qualities as well.
      ahsannaseer1122@gmail.com

  • @jeet198
    @jeet198 5 ปีที่แล้ว

    Please tell me which library and tools should i use to make a chatbot

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว

      Use the NLTK package to build chatbots. Stick around as we have a tutorial on the same coming up soon.

  • @varinderpunjab479
    @varinderpunjab479 4 ปีที่แล้ว

    I AM getting an error 'something like 'keyerror':quality
    could you help to solve this?

  • @adityasoni3794
    @adityasoni3794 4 ปีที่แล้ว

    what are the prerequisites for this video ?

    • @SimplilearnOfficial
      @SimplilearnOfficial  4 ปีที่แล้ว

      Hi Aditya, we suggest you to learn scikit after having some knowledge of python, Numpy and Pandas. Thanks.

  • @sandypearls5276
    @sandypearls5276 5 ปีที่แล้ว +1

    Great Tutorial!
    Please send me the datasets used in this video..

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว

      Hello Sandy, thanks for viewing our tutorial. It would be helpful if you will provide your email ID to us so that we could send the requested dataset promptly. On the off chance that you need your email ID to be kept hidden from others, we can do that also. Hope that helps.

    • @sandypearls5276
      @sandypearls5276 5 ปีที่แล้ว +1

      @@SimplilearnOfficial sandy.momo21@gmail.com
      Please keep it hidden, Thanks!

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว +2

      Hi Sandy, thanks for watching our video. We have sent the requested dataset to your mail ID. Do check out our other tutorial videos and subscribe to us to stay connected. Cheers :)

    • @baronchibuike6091
      @baronchibuike6091 5 ปีที่แล้ว +1

      Simplilearn can I get the dataset too
      awesomebaron007@gmail.com

    • @nguyenvannhat5114
      @nguyenvannhat5114 5 ปีที่แล้ว +1

      @@SimplilearnOfficial can u send me dataset: nvnhat.17ck1@gmail.com

  • @AritraPal-s1k
    @AritraPal-s1k ปีที่แล้ว

    Sir, What is bins (in preprocessing zone)?

  • @shaistarahman9178
    @shaistarahman9178 4 ปีที่แล้ว

    v.good work

    • @shaistarahman9178
      @shaistarahman9178 4 ปีที่แล้ว

      sir r u send me this source code Scikit-Learn Tutorial | Machine Learning With Scikit-Learn | Sklearn | Python Tutorial | Simplilearn

    • @SimplilearnOfficial
      @SimplilearnOfficial  4 ปีที่แล้ว

      Hello Shaista, thanks for viewing our tutorial. It would be helpful if you will provide your email ID to us so that we could send the requested dataset promptly. On the off chance that you need your email ID to be kept hidden from others, we can do that also. Hope that helps.

  • @oziashounkpatin6628
    @oziashounkpatin6628 3 ปีที่แล้ว

    Great tutorial. Thanks. Would like to have data and code

    • @SimplilearnOfficial
      @SimplilearnOfficial  3 ปีที่แล้ว

      Hello, thanks for viewing our tutorial. It would be helpful if you will provide your email ID to us so that we can send the requested dataset promptly. On the off chance that you need your email ID to be kept hidden from others, we can do that too. Hope that helps.

  • @ejasackey
    @ejasackey 4 ปีที่แล้ว

    very nice tutorial! please how to I access the datasets i really need them...

    • @SimplilearnOfficial
      @SimplilearnOfficial  4 ปีที่แล้ว +1

      Hi, thanks for watching our video. We have sent the requested dataset to your mail ID. Do show your love by subscribing to our channel using this link: th-cam.com/users/Simplilearn and don't forget to hit the like button as well. Cheers!

  • @memesv3.093
    @memesv3.093 5 ปีที่แล้ว +1

    Can i plz get this wine dataset....Thanking u in anticipation

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว

      Hello, thanks for viewing our tutorial. It would be helpful if you will provide your email ID to us so that we could send the requested dataset promptly. On the off chance that you need your email ID to be kept hidden from others, we can do that also. Hope that helps.

    • @GrayCo
      @GrayCo 5 ปีที่แล้ว

      The dataset is at the UCI Machine Learning Repo. You will need to do some work to get this dataset properly ready for preprocessing.

  • @marvin4519
    @marvin4519 5 ปีที่แล้ว

    why do i keep getting ranked numbers after label encoding my data

  • @chrisyan5507
    @chrisyan5507 4 ปีที่แล้ว

    i think svm refer to support vector machine not support vector modal...

    • @SimplilearnOfficial
      @SimplilearnOfficial  4 ปีที่แล้ว

      In machine learning, support-vector machines (SVMs, also support-vector networks) are supervised learning models with associated learning algorithms that analyze data used for classification and regression analysis.

  • @triyugchandra44
    @triyugchandra44 5 ปีที่แล้ว

    can you provide the wine data set which you have used on your program

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว

      Hello Triyug, thanks for viewing our tutorial and we hope it is helpful. It would be helpful if you will provide your email ID to us so that we could send the requested dataset promptly.

  • @AroundBDVillage
    @AroundBDVillage 5 ปีที่แล้ว

    was best bro ___

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว

      Thank you! Do subscribe to our channel and stay tuned.

  • @oliverdean4078
    @oliverdean4078 5 ปีที่แล้ว

    Hi, I was wondering if I could get access to the dataset please?

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว

      Hello Oliver, thanks for viewing our tutorial and we hope it is helpful. It would be helpful if you will provide your email ID to us so that we could send the requested dataset promptly.

  • @harjitsingh7308
    @harjitsingh7308 5 ปีที่แล้ว +1

    Can you do a tutorial showing the Linear Algebra/Calculus required for both machine learning and deep learning? Great video btw

    • @SimplilearnOfficial
      @SimplilearnOfficial  5 ปีที่แล้ว

      Hey Harjit, thank you for watching our video. We will definitely look into your suggestions. Do subscribe and stay tuned for updates on our channel. Cheers :)