Machine Learning Algorithm- Which one to choose for your Problem?

แชร์
ฝัง
  • เผยแพร่เมื่อ 30 ก.ย. 2024
  • Here is a video which helps you understand which machine learning algorithm you should use for your use case.
    You can buy my book of finance with ML
    #Whichalgorithmtochooseforyourproblem
    amazon url : www.amazon.in/...

ความคิดเห็น • 250

  • @nandinidasgupta7781
    @nandinidasgupta7781 4 ปีที่แล้ว +114

    This is the only channel i guess who understands what students are expecting to know other than traditional algorithm knowledge.. Thank You so much..

    • @chandlerdarius6380
      @chandlerdarius6380 3 ปีที่แล้ว

      sorry to be offtopic but does any of you know a way to log back into an instagram account??
      I was dumb lost my login password. I would love any tricks you can offer me

    • @hunainaghai3342
      @hunainaghai3342 ปีที่แล้ว

      Which algorithm did u choose, Nandini?

    • @shubhpundir2191
      @shubhpundir2191 ปีที่แล้ว +3

      @@chandlerdarius6380 this dude 💀💀💀

  • @somalkant6452
    @somalkant6452 4 ปีที่แล้ว +14

    hi krish, it was an awesome video. just a doubt popping up in my mind, when there is overlap of datapoints, why cant we use SVM, bacause that will also take the points (of different classes) to other dimensions and divide the points using hyperplane. please correct me if my understanding is not correct. Than you so much for your awesome videos.

  • @translatethis7765
    @translatethis7765 3 ปีที่แล้ว +14

    The thing is, you are looking at the pair-plot for only two dimensions. Although everything is overlapped when projected into pairwise space, it doesn't necessarily mean the data is not linear separatable right?

  • @srikanththecoolhunk
    @srikanththecoolhunk 4 ปีที่แล้ว +18

    You have no idea how useful all your videos were to help me find a job. I cannot thank you enough Sir, please post more.

  • @kamran_desu
    @kamran_desu 4 ปีที่แล้ว +16

    Hey Krish, great video - just shows how important it is to understand the geometry/maths behind ML.
    In regards to using KNN here for overlapping data points, would SVM (radial/polynomial) be a good choice as well, since it will use the kernel trick to apply a non-linear classifier in a higher dimension?
    And how would SVM compare to the tree methods in terms of computation complexity?

    • @adityakishan589
      @adityakishan589 3 ปีที่แล้ว

      Same doubt. Krish please help us get clarity on this.

    • @karandua6564
      @karandua6564 3 ปีที่แล้ว

      Even logistic regression may work, he is just comparing two dimensions at a time, they may be linearly separable in higher dimensions
      .

    • @danielschulz5447
      @danielschulz5447 2 ปีที่แล้ว

      @@karandua6564 no, I think that is not true. LR will not work good, if single plots are pairwise highly overlapped and/or not dividible into straight lines.

  • @swetapatra
    @swetapatra 4 ปีที่แล้ว +7

    ok, so ideally, we decide on the algorithm based on the charts we have plotted?

  • @minime8745
    @minime8745 ปีที่แล้ว +1

    Shouldn’t we scale the features before plotting?

  • @fit_shaha
    @fit_shaha 3 ปีที่แล้ว +1

    Same ques....mere bhi mind m that Kon si algo...suru kru bt phir Maine ek tutorial ko follow kiya and dekho decision tree ko hi Laker 5 days ho chuke hai bt uska code m error show ho rhi hai

  • @chravi511
    @chravi511 3 ปีที่แล้ว +2

    Please Explain me,
    Scenario 1 : All Categorical, ordinal, nominal features - Categorical target
    Scenario 2 : All continuous features - Categorical target
    Scenario 3 : Combination of categorical and Continuous features - Categorical target
    Which model to use in these scenarios (particularly SCENARIO 1) ?

  • @varunsagartheegala
    @varunsagartheegala 3 ปีที่แล้ว +9

    Your videos is a gift to all data science aspirants like me and working professionals too. Thank you for making our learning easy and fun. Please don't stop

  • @syedtasleem2827
    @syedtasleem2827 5 ปีที่แล้ว +5

    Sir, What about SVM U didn't mention on what kind of Data it can be used after understanding pairplot,..

  • @chinmayjape4981
    @chinmayjape4981 4 ปีที่แล้ว +7

    Please make a video on how a beginner can participate in Kaggle competitions.

    • @ritwickjoshi
      @ritwickjoshi 3 ปีที่แล้ว

      brother you need to dive in to find out
      Dont worry about the result see where you lack and work from it
      but remember not to go down from your baseline

  • @harshj84
    @harshj84 4 ปีที่แล้ว +5

    Hi Krish,
    I really enjoyed your videos. The specialty of your videos are they fill the gap between theory and practical.
    I have watched tones of other videos, most of them are telling "how to plot the graph" but no one is explaining "how to leverage information from these plots?"
    Just like this video, we need to use knn as most of the points are overlapping. I would really love to such a great video for other algorithms as well. Not only me, but I believe your subscribers would also love them.
    Tons of thanks to you.

  • @puneettiwari2251
    @puneettiwari2251 3 ปีที่แล้ว +19

    Great session,
    Please make an elaborative video on this topic covering all the pros and cons of different algorithms and if possible with codes.
    Thanks for this session again sir🙏👍👍

  • @learbasicsofpythonwithus-pe1kk
    @learbasicsofpythonwithus-pe1kk หลายเดือนก่อน +1

    I never saw a video like this description...

  • @shreyasb.s3819
    @shreyasb.s3819 4 ปีที่แล้ว +4

    Can I use all kinds of all algorithms and choose which one give more accuracy? Is there any problem in this solution.?

    • @manishsharma2211
      @manishsharma2211 3 ปีที่แล้ว

      There is no problem, but it takes lot of time

  • @rajivturkar1671
    @rajivturkar1671 5 ปีที่แล้ว +6

    Great explaination Krishna.. I would like to know how pairplot will show categorical variable and how we will do the feature engineering for categorical variable?

    • @sejalchandra2114
      @sejalchandra2114 4 ปีที่แล้ว

      Yes, this is my doubt also. Please help!!

    • @manikaransingh3234
      @manikaransingh3234 4 ปีที่แล้ว

      @@sejalchandra2114 You first have to do the label encoding to your categorical features. and then use the pairplots.

  • @pravinkumar54
    @pravinkumar54 4 ปีที่แล้ว +3

    @Krish Naik 11:30 if you double pick the pairplot in jupyter notebook. It zooms in

  • @anandacharya9919
    @anandacharya9919 4 ปีที่แล้ว +7

    This is your best and most important video of all. Thank you 🙏

  • @manxyoutuber3272
    @manxyoutuber3272 3 ปีที่แล้ว

    I have eye blink eeg data set in txt formet
    Can you please say how to my model to predict eye blink in code. And which is best ml algorithm? It's something related to BCI project
    Can you please put some tutorial separately for Brain computer interface (BCI)

  • @samlo4166
    @samlo4166 3 ปีที่แล้ว +3

    Awesome Video! Thank you for sharing this. I have one question. When there is a lot of overlap, can I use SVM to solve this? thank you

  • @mithunkumar7063
    @mithunkumar7063 5 ปีที่แล้ว +4

    what if there are more number of categories..the graph takes time and will it be useful then???

  • @modernmonk17
    @modernmonk17 3 ปีที่แล้ว +2

    I tried to plot the same on my datset & it shows a mix of overlapping in some features & non-overlapping in others,so based on that which model should i go for?
    Any help would be appreciated

  • @yogitapeshane6934
    @yogitapeshane6934 4 ปีที่แล้ว +1

    It was really good video.. Please make more videos in DataScience...

  • @siddhantkaushal7669
    @siddhantkaushal7669 3 ปีที่แล้ว +4

    It is a good video. Also explained in detail to understand pair plots. :) Thanks..
    My Question is:
    When should we decide to use SVM and Naive by visualizing and analyzing the pair plots? :)

    • @datahat642
      @datahat642 ปีที่แล้ว

      @siddhant Naive Bayes works well with higher dimensionality (such as text corpuses) while support vectors whpith its kernel is applied when the data is non linear but you want to apply a linear classifier by transformation

  • @mandarchincholkar5955
    @mandarchincholkar5955 3 ปีที่แล้ว +1

    In this case, non linear svm is much better

  • @nageshpindi1348
    @nageshpindi1348 4 ปีที่แล้ว +2

    Does it works for regression techniques, I'm trying but not able to generate plots as you showed.

  • @asankab.l.5166
    @asankab.l.5166 3 ปีที่แล้ว +1

    Your book is out of stock, please provide a way to buy. Great tutorial!

  • @sithalrao
    @sithalrao 4 ปีที่แล้ว +2

    Hi Krish, if we have more numbers of features say 30, than in that case we wont be able to draw pair plot. What should be our approach in that scenario? Many thanks for your wonderful videos :)

  • @satriogani3105
    @satriogani3105 4 ปีที่แล้ว +1

    In the pairwise plot, the points are only plotted on two dimensions. However, our data have dimensions more than two. This means that we do not plot the other features. It looks that we do some projection from higher dimensional space to two dimensional space. Can we conclude if the groups overlap with each other only by looking on two dimensional plots? Simple example, if we project the earth on two dimensional space, we can see that the norther and southern hemisphere will overlap on each other.

    • @translatethis7765
      @translatethis7765 3 ปีที่แล้ว +1

      I have been looking for comments like this... I agree with you. I don't think one can conclude whether the data is linear separable, from just looking at pairwise plots.

  • @hbk1517
    @hbk1517 3 ปีที่แล้ว +1

    'KNN is good for overlapping data points', can you please share your source or cite a paper for this info?

  • @allandavis6116
    @allandavis6116 2 ปีที่แล้ว

    Completely incomprehensible, and I'm a PhD engineer, aerospace, not 'data science', .... first the accent is off the chart. Speak understandable English, or make the vid in your natvie language for your native country.

  • @famin7794
    @famin7794 หลายเดือนก่อน

    You explained the wrong concept of Logistic regression model
    . You mainly explained the linear regression model

  • @TrendyAnamika
    @TrendyAnamika 2 ปีที่แล้ว

    Hi bro, h r u, your videos are really very informative. I am from medical transcription field, but since 3 months learning data science, but need proper regarding good job with good salary package, it's very important bro, can u help?

  • @khalfanalkharusi6577
    @khalfanalkharusi6577 4 ปีที่แล้ว +2

    Thank you for your contribution. Please note also that classification learner app in Matlab provides you such pair plots

  • @siddharthasinha520
    @siddharthasinha520 3 ปีที่แล้ว

    This approach is not 100% correct. It shows which attributes are good. Those which can clearly separate the target into two classes are good variables. Thus this is at best a feature selection method. But often multiple variables together can also create a clear separation. So by doing pairplots, it may lead to high false negatives against the suitability of logistic regression. Better way is to take 10-20 smaller chunks from different locations and apply all algorithms. The ones which show consistently better performance (with least variaton)is a good algorithm for the given data.

  • @theindianrover2007
    @theindianrover2007 5 ปีที่แล้ว +2

    Liked this video, but in case of large test set KNN might prove you costly at the time of prediction

    • @MyTube6686
      @MyTube6686 4 ปีที่แล้ว

      how ? can you explain further Please

  • @amirhosseinrahimi3964
    @amirhosseinrahimi3964 ปีที่แล้ว +1

    Thanks a lot for such a great explanation 😊

  • @pearcestolts1696
    @pearcestolts1696 3 ปีที่แล้ว

    The silent badge suprisingly belong because stock phytochemically describe an a wry respect. defiant, flawless belgian

  • @navya-s3v
    @navya-s3v 21 วันที่ผ่านมา

    Selecting the appropriate machine learning algorithm is determined by aspects such as problem type, data qualities, and desired results.

  • @abdullahjhatial2614
    @abdullahjhatial2614 ปีที่แล้ว

    why my python script giving erroe as key TARGET CLASS ? please help

  • @MrDeepinaction
    @MrDeepinaction 5 ปีที่แล้ว +4

    Great video, friend!! Keep up the good work. Need to learn a lot through you.

  • @ranjaniselvaraj6475
    @ranjaniselvaraj6475 4 ปีที่แล้ว +2

    Good Job! Clearly explained. Thank you so much for this video.

  • @adarshkaul656
    @adarshkaul656 4 ปีที่แล้ว +1

    How to do model selection of Regression models ?? I m getting the cross validation scor of -2.8456

  • @rohinikalamkar5736
    @rohinikalamkar5736 3 ปีที่แล้ว +2

    This is what is required .. thank you so much for sharing this 👍👍 great work

  • @cedriclebocq1060
    @cedriclebocq1060 4 ปีที่แล้ว +1

    Hello Krish. I would like to order your book, but it seems to be unvailable on amazon for European zone. could you explain me how to order ? Thx

  • @thongtech1984
    @thongtech1984 4 ปีที่แล้ว +2

    thanks a lot for this wonderful explanation, Krishna. You are my new hero now :)

  • @karandua6564
    @karandua6564 3 ปีที่แล้ว

    At 13:30, you are making an assumption the data is not linearly separated based on just two dimensions. That's not correct, since the data can be linearly separable in multiple dimensions.

  • @nokiyem521
    @nokiyem521 3 ปีที่แล้ว +2

    Great, this is the video i was looking for to explain the difference from a basic mathematical perspective.

  • @BharathSalla
    @BharathSalla 2 หลายเดือนก่อน

    What if few of overlaps and few are having classification, what algorithm works ?

  • @malkiatsandhu6169
    @malkiatsandhu6169 2 ปีที่แล้ว

    hello Sir, can you make a project on detection of phishing website using machine learning

  • @cseswadhinpanda
    @cseswadhinpanda 2 ปีที่แล้ว

    Nice explanation , you explained us like teaching a parrot love you from india .

  • @arjyabasu1311
    @arjyabasu1311 5 ปีที่แล้ว +1

    Sir what about Naive Bayes classifier?? At what condition we should use Naive Bayes classifier?

  • @devmaharaj4640
    @devmaharaj4640 4 ปีที่แล้ว +2

    One of best video's I have seen , won't forget to return u the favour....

  • @kunikakhobragade6953
    @kunikakhobragade6953 ปีที่แล้ว

    great sir ...
    can u plz make a video on the comparison of every ML algorithm

  • @sahinur_zmn
    @sahinur_zmn 4 ปีที่แล้ว +1

    just awesome 💚💚💚i am just recharged after completing this video..thanks a lot.

  • @tilakrajchoubey5534
    @tilakrajchoubey5534 2 ปีที่แล้ว

    How can you say that logistic only forms straight line division. I am confused.

  • @sectiongmit1630
    @sectiongmit1630 5 หลายเดือนก่อน

    Hello Krish Sir. Does df.corr() give same result as pairplot?

  • @naveedrahman4556
    @naveedrahman4556 3 ปีที่แล้ว

    I used to be a fan...
    .
    .
    .
    But now.....
    .
    .
    .
    .
    I'M AN AIR CONDITIONER

  • @swethakulkarni3563
    @swethakulkarni3563 4 ปีที่แล้ว +1

    @krish.Naik Can you explain Techniques for regression too.

    • @nageshpindi1348
      @nageshpindi1348 4 ปีที่แล้ว

      I'm tried for regression but I'm not able to generate plots can you help me if you tried?

  • @ferielhachi1304
    @ferielhachi1304 2 ปีที่แล้ว

    What is the best technique to predict the QOS of services in microservices architecture please

  • @swagatamsaha835
    @swagatamsaha835 5 ปีที่แล้ว +1

    can you provide link for code?.You have solved my query.... thanks again

  • @vijaynale7893
    @vijaynale7893 5 ปีที่แล้ว +8

    Nice ... Please make more videos for data science, beacuse it better to understand into that way...... Thank you
    Also make video for data cleaning, Solve any business problem

  • @chravi511
    @chravi511 3 ปีที่แล้ว +4

    Hi Krish,
    I've been following your channel for most of my learning.
    The way you explain the things gives a very familiar approach and giving out what is needed in the way everyone can understand.
    This let me watch any video of yours till the end and learn completely.
    Happy to learn from you. Thanks

    • @deepuraju2456
      @deepuraju2456 2 ปีที่แล้ว

      Hi sir , can u give any idea problem statement for loan approval prediction

  • @itybahadur3625
    @itybahadur3625 5 ปีที่แล้ว +2

    Excellent video for an initial understanding! Thanks!

  • @saunitmarolia3901
    @saunitmarolia3901 3 ปีที่แล้ว +1

    Thank you Krish! All doubts cleared now.

  • @roshanajadhav5854
    @roshanajadhav5854 4 ปีที่แล้ว +2

    You are doing amazing job Sir,
    I am too late to land on your channel
    Keep it up
    You are providing knowledge free of cost where people taking money for this in thousands
    No need to join any private classes with huge pay if you will provide this material in sequential order
    Great job all the best👍💯

    • @fuck30093
      @fuck30093 2 ปีที่แล้ว

      Have you found an answer to your question?

  • @izzycrux48
    @izzycrux48 3 ปีที่แล้ว +2

    Thank you Krish for the amount of effort and insights you put into your videos. Really helps a lot🙏❤️👍 May God bless you and keep you well🙏

  • @jonty670
    @jonty670 4 ปีที่แล้ว

    Logistic regression is not what you are explaining here. Linear regression is where straight line comes

  • @salmashahid2380
    @salmashahid2380 3 ปีที่แล้ว

    For unsupervised learning which algorithm i have to select

  • @srujanas4520
    @srujanas4520 ปีที่แล้ว

    Anyone would like to join with me to do a new ml project? I will explain the task

  • @sameerdubey1990
    @sameerdubey1990 3 ปีที่แล้ว

    can u suggest any book which explains this method?

  • @ainju1
    @ainju1 2 ปีที่แล้ว

    Informative. Thank you so much

  • @shreyjain6447
    @shreyjain6447 2 ปีที่แล้ว +1

    How do we decide which algorithm to use in case of Regression task?

  • @ajaynegi6278
    @ajaynegi6278 2 ปีที่แล้ว

    why you didn't include Support Vector Machine

  • @arafamahmood9356
    @arafamahmood9356 3 ปีที่แล้ว +1

    Hello, thank you for your contents. Your explanation really helps. I have a question, in the beginning you are mentioning Logistic Regression drawn as a straight line, however, is not Logistic Regression a curve line? While the linear regression is a straight line?
    Thank you.

  • @anupambarasia7865
    @anupambarasia7865 ปีที่แล้ว

    very well explained thankyou so much sir

  • @saichoz
    @saichoz 4 ปีที่แล้ว +1

    Great Vid. You made things clear. Thanks

  • @ankitaB11
    @ankitaB11 3 ปีที่แล้ว +1

    Most important video. Explanation is just amazing.

  • @vinusanker8767
    @vinusanker8767 4 ปีที่แล้ว +3

    Bro, y is there no subtitles, I have a problem in hearing😞

    • @randomdude79404
      @randomdude79404 3 ปีที่แล้ว

      In my opinion there is no need for subtitles , I could hear him perfectly fine.

    • @masthanjinostra2981
      @masthanjinostra2981 3 ปีที่แล้ว +1

      @@randomdude79404 hey bro, he meant deaf 🧏‍♂️ or any disability of hearing..

    • @randomdude79404
      @randomdude79404 3 ปีที่แล้ว

      @@masthanjinostra2981 I humbly apologise sorry about that.

  • @thecitizen9747
    @thecitizen9747 2 ปีที่แล้ว

    Can we get simular video with regression analysis thanks

  • @farhankhan-zz6hz
    @farhankhan-zz6hz 5 ปีที่แล้ว +1

    Sir if possible please upload this python script on github

  • @akashchoudhuri3932
    @akashchoudhuri3932 3 ปีที่แล้ว +1

    Best video till now. Thank you Krish.

  • @Sunnysinghbhatia
    @Sunnysinghbhatia 2 ปีที่แล้ว

    This was very much helpful Thank you Krish

  • @deepuraju2456
    @deepuraju2456 2 ปีที่แล้ว

    Hlo sir I need a problem statement for loan approval prediction

  • @ravikumarasabohar
    @ravikumarasabohar 3 ปีที่แล้ว

    I have one python code. Will you help me make is error free?

  • @gebremedhnmehari8451
    @gebremedhnmehari8451 2 ปีที่แล้ว

    What about naive bayes. Please answer fast if you can.

  • @manishgautam5171
    @manishgautam5171 5 ปีที่แล้ว +2

    Thanks for this amazing video Krishna. However, some questions are striking on my mind. For binary class classification it is understood that if we have linearly separable data then we can use SVM or logistic regression or any other classifier which can make decision boundary. Suppose I have a dataset that can be separate by a decision boundary then I can make user of SVM, logistics regression or perceptron algorithm. How to choose best classifier out of these 3 classifier?

    • @MrKrishna351
      @MrKrishna351 4 ปีที่แล้ว +1

      Check with Cross validation technique to understand a better classifier

  • @KetanChaudharyTHE-GREAT-KETAN
    @KetanChaudharyTHE-GREAT-KETAN 4 ปีที่แล้ว

    Thanks a lot Krish......it's really very helpful ......

  • @shivanandhadapad5987
    @shivanandhadapad5987 3 ปีที่แล้ว

    Which one is suitable for integrity of Iot device

  • @diptakshisen5826
    @diptakshisen5826 4 ปีที่แล้ว

    sir, can you please provide the github link for this video? its not present in the descprition.

  • @sdz9081
    @sdz9081 3 ปีที่แล้ว

    Which book explains this visualization of data?

  • @vineethp8925
    @vineethp8925 3 ปีที่แล้ว

    Superb Explanation can you please do on regression

  • @MarioLaudazi
    @MarioLaudazi 5 หลายเดือนก่อน

    Thanks for the useful videos you post.

  • @PawanSingh-pf2bl
    @PawanSingh-pf2bl 4 ปีที่แล้ว

    Nice video. But I like the way you are presenting at whiteboard. It feels like we are in actual class.

  • @suhailabessa9901
    @suhailabessa9901 2 ปีที่แล้ว

    Woow, Thank you. its helpful

  • @beingrishi6450
    @beingrishi6450 ปีที่แล้ว

    Is overlapping denotes Overfitting?

  • @deeprajmazumder6261
    @deeprajmazumder6261 2 ปีที่แล้ว

    How to select the best model for a regression problem?

  • @Crazyanya_
    @Crazyanya_ ปีที่แล้ว

    Sir thank you so much you solve my problem

  • @sankarapandian.selvaraj
    @sankarapandian.selvaraj 4 ปีที่แล้ว +1

    All of your videos have a lot of useful information. Thank you...