Project 13. Customer Segmentation using K-Means Clustering with Python | Machine Learning Projects

แชร์
ฝัง
  • เผยแพร่เมื่อ 17 ธ.ค. 2024

ความคิดเห็น •

  • @gramps4017
    @gramps4017 2 ปีที่แล้ว +4

    Massive respect mate!!! Thanks for sharing your knowledge

  • @kaiiiser7
    @kaiiiser7 3 ปีที่แล้ว +1

    You are an absolute savior

  • @MrKapilsingh
    @MrKapilsingh 3 ปีที่แล้ว +5

    I watched many videos but none of them has better explanation than yours. Great job!!

    • @Siddhardhan
      @Siddhardhan  3 ปีที่แล้ว +2

      Thanks a ton, brother 😇

  • @onaopemipoAmobi
    @onaopemipoAmobi 2 ปีที่แล้ว

    You just solved a problem. Thank you for sharing this.

  • @mat9095
    @mat9095 7 หลายเดือนก่อน +1

    Awesome video 💯💯...understood each and every line you said....just wonderful.....keep going...all the best 👍

  • @DigsHere
    @DigsHere ปีที่แล้ว

    Sooo helpful.
    You gave explanation precisely with points.
    Thank you so much 👍🏻

  • @GoodLuckForever-wi9kb
    @GoodLuckForever-wi9kb ปีที่แล้ว

    Thanks
    Like Your method to teach basics alongwith advance topics.
    Appreciate

  • @heitorandrade6387
    @heitorandrade6387 3 ปีที่แล้ว +4

    Really good ! If you could provide in future an example of segmentation for a B2B business , it would be great. Thanks !

  • @victorialeigh2726
    @victorialeigh2726 2 ปีที่แล้ว +2

    Hi Siddhardhan
    It's a really really nice lecture. Very considerate and detailed, explaining everything the learners need / want to know!
    Happy new year to you!

    • @Siddhardhan
      @Siddhardhan  2 ปีที่แล้ว

      Thanks a ton! Happy New year!

  • @santoshr1708
    @santoshr1708 2 ปีที่แล้ว +1

    Thank you sir. You are doing great work.

  • @samirpatel7892
    @samirpatel7892 2 ปีที่แล้ว

    easy understanding about the k mean clustering

  • @farazrizvi416
    @farazrizvi416 3 ปีที่แล้ว +6

    Love your videos man, you have a great way of explaining your code and the concepts.
    I wanted to ask 2 questions; firstly, does it make sense to add the array of clusters back to the original dataframe so you can identify which customers the clusters represent? It would make graphing the clusters easier and also allow you to optimize hyperparameters without needing to retype the code.
    Secondly, can you do videos on deploying models? I couldn't find any good videos of that. - Thanks a lot!

  • @ArtisticGautam318
    @ArtisticGautam318 5 วันที่ผ่านมา

    Really Amazing!

  • @priyankathakur1691
    @priyankathakur1691 ปีที่แล้ว

    Thank u for making such good videos. PLease keep making more videos on different projects of other unsupervised learning models.

  • @ginnaamulya461
    @ginnaamulya461 3 ปีที่แล้ว +1

    Clear explanation 👍

  • @manepallisaisanthosh3728
    @manepallisaisanthosh3728 3 ปีที่แล้ว +1

    bro your videos are helping very much pls continue bro..and attach handwritten material it will help a lot

    • @Siddhardhan
      @Siddhardhan  3 ปีที่แล้ว +2

      Hi! Thanks for your positive words 😇 I'll see what I can do about the notes

  • @PenAndSpecs007
    @PenAndSpecs007 2 ปีที่แล้ว

    Awesome tutorial! Amazing and clear explanation.

  • @fatmamansour8606
    @fatmamansour8606 3 ปีที่แล้ว

    excellent video and explanation too. Thank you so much and definitely will watch the rest of the videos.

  • @adityamahamuni7365
    @adityamahamuni7365 3 ปีที่แล้ว +2

    This one was superbly explained, especially the difficult to understand syntax of .iloc[ ] and one in the scatter plot too. Also I'm a huge fan of Data Visualisation, absolutely love it.
    I had a question: how can we plot a similar scatterplot using Seaborn? Could you please show it in one of the future Unsupervised Learning project videos?
    As always, thanks a lot!🙌🏻

    • @Siddhardhan
      @Siddhardhan  3 ปีที่แล้ว

      sure! will do that. use sns.scatterplot( )

    • @adityamahamuni7365
      @adityamahamuni7365 3 ปีที่แล้ว

      @@Siddhardhan are all the arguments the same?

  • @raveenachakrapani1090
    @raveenachakrapani1090 3 ปีที่แล้ว

    Highly helpful. Awesome explanation !!

    • @Siddhardhan
      @Siddhardhan  3 ปีที่แล้ว

      Glad to hear that!😇

  • @khushiprasad
    @khushiprasad 4 หลายเดือนก่อน +1

    Thanks for your amazing content❤🎉.

  • @rohinihrudhay5600
    @rohinihrudhay5600 6 หลายเดือนก่อน

    Thank you so much man you really helped me a lot.

  • @KiddyKidz167
    @KiddyKidz167 2 ปีที่แล้ว

    So nice explanation

  • @dineshjothiram3677
    @dineshjothiram3677 3 ปีที่แล้ว

    Your work was awesome and really appreciated sir.

    • @Siddhardhan
      @Siddhardhan  3 ปีที่แล้ว +1

      Thanks a ton😇

    • @dineshjothiram3677
      @dineshjothiram3677 3 ปีที่แล้ว

      @@Siddhardhan sir in this project I can't understand KMeans syntax sir. (Video is blur sir)

    • @dineshjothiram3677
      @dineshjothiram3677 3 ปีที่แล้ว

      @@Siddhardhan sir in this project I can't understand KMeans syntax sir. (Video is blur sir)

  • @csebreach8565
    @csebreach8565 3 ปีที่แล้ว +8

    Can i use this in my resume and linked in profile .. If i use what are the hashtags and conditions to be followed..By the way you are supporting pillar to all upcomers for candidates like me.. thank you

    • @Siddhardhan
      @Siddhardhan  3 ปีที่แล้ว +25

      hi! i don't demand any citation and credit. Just understand the project completely before putting it in your resume, so that you can answer the questions related to this project asked by the interviewer.

    • @csebreach8565
      @csebreach8565 3 ปีที่แล้ว +3

      @@Siddhardhan thank you so much

  • @adarshn8043
    @adarshn8043 ปีที่แล้ว

    Super video and good explanation
    What is the accuracy

  • @Vartwol
    @Vartwol ปีที่แล้ว

    Thanks . This one really helped
    Only one doubt ,how canwe display the clustered dataitems in tabular form instead of scatter plot?

  • @gkakash4789
    @gkakash4789 3 ปีที่แล้ว

    It's really helpful to me...

  • @manojchoukiker3232
    @manojchoukiker3232 3 ปีที่แล้ว

    your lecture series very good for understating and nicely explain, i request to you please increase the font size or increase the screen size. t very small font so its difficult to read. thanks sir . please continue....

    • @Siddhardhan
      @Siddhardhan  3 ปีที่แล้ว +1

      sure! will look into it.

  • @johnnyannan8341
    @johnnyannan8341 2 ปีที่แล้ว

    Thank you very much

  • @sandipansarkar9211
    @sandipansarkar9211 2 ปีที่แล้ว

    finished watching

  • @prashu25925
    @prashu25925 3 ปีที่แล้ว

    Brilliant....thanks

  • @rahulgaud4340
    @rahulgaud4340 ปีที่แล้ว

    Thanks a lot.

  • @copaceanubobi6101
    @copaceanubobi6101 3 ปีที่แล้ว

    i hope you make a project for image segmentation using k-means clustering algorithm .Thank you for all the projects done

  • @sachinvithubone4278
    @sachinvithubone4278 3 ปีที่แล้ว

    thanks for video.
    For clustering problem we no need to split data into train and test set?
    we are not used other features variables.. if in dataset having more columns then also we can use few columns?

    • @Siddhardhan
      @Siddhardhan  3 ปีที่แล้ว +2

      we generally don't split the data for K-Means. if we want to predict new Values, we may need to split it. it depends...
      and yes, if we have more suitable columns, we would use them. let's say that there are 3 features. in that case, we would make a 3d scatter plot.

  • @playatanu
    @playatanu 2 หลายเดือนก่อน +1

    Thanks

  • @debasmitadash9974
    @debasmitadash9974 2 ปีที่แล้ว

    Is there any video explaining the theory of k-means clustering? I would like to watch that but couldn't find in your playlist.

  • @newmail56
    @newmail56 3 ปีที่แล้ว

    Sir i can't understand where in the entire code the centroid are being calculated before finding out the WCSS from the data set ?? please explain

  • @NithyaHV
    @NithyaHV ปีที่แล้ว

    Hello Siddhardhan, How many dataset r u taking for Training and Testing?

  • @toniiicarbonelll287
    @toniiicarbonelll287 3 ปีที่แล้ว

    amazing!

  • @growingfire
    @growingfire 6 หลายเดือนก่อน

    Thanks a lot !

  • @AmitSoni-us7iq
    @AmitSoni-us7iq หลายเดือนก่อน

    Why we did not check for Outliers ?

  • @MuhammadKamran-ii4rh
    @MuhammadKamran-ii4rh 3 ปีที่แล้ว

    Hi bro glad to see but waiting for neural network video

    • @Siddhardhan
      @Siddhardhan  3 ปีที่แล้ว +1

      hi! those will be covered in deep learning course.

  • @agnivamitra8775
    @agnivamitra8775 2 ปีที่แล้ว

    Sir if we had more than two featured columns in k means clustering then how will we plot the clusters in multi dimension like 4d 5d?

  • @harinirk3365
    @harinirk3365 3 ปีที่แล้ว +1

    Sir here you are comparing annual income and spending score here. If i want to compare all the attributes ( age, gender, annual income)vs spending score how to proceed sir? Can I get the code for that?

    • @Siddhardhan
      @Siddhardhan  3 ปีที่แล้ว

      hi! I don't have the code for it at the moment. you can consider multiple dimensions. but you may not get proper clusters.

  • @kollukulurusaitharun7682
    @kollukulurusaitharun7682 3 ปีที่แล้ว

    Mr Siddhardhan can u do a project on Handwritten Digits recognition with convolution neural networks.

  • @ChandraKanthmamilla
    @ChandraKanthmamilla 3 ปีที่แล้ว

    Thanks for uploading this video. Can you please upload a video on how this customer segmentation works in any domain with end to end implementation?

    • @Siddhardhan
      @Siddhardhan  3 ปีที่แล้ว

      hi! I am planning to make videos on deployment after ML and DL course.

  • @AwanthaK
    @AwanthaK 3 ปีที่แล้ว +1

    Hi! Thanks for the content. Really appreciate your work.
    Can you do a video on ML project personality Prediction system by analyzing CVs?
    It would be really helpful.

  • @gujaratfortunegiants8263
    @gujaratfortunegiants8263 10 หลายเดือนก่อน

    I have a doubt... here for plotting the clusters we used 2 data in x array... but what if i have more than 2 features? plz suggest

  • @prithviraj2366
    @prithviraj2366 3 ปีที่แล้ว

    That was awesome video bro. Can you teach us like where to start and basics of Python. Like from the scratch to learn. thanks in advance

    • @Siddhardhan
      @Siddhardhan  3 ปีที่แล้ว +1

      hi! you can learn from this playlist: th-cam.com/play/PLfFghEzKVmjsNtIRwErklMAN8nJmebB0I.html
      it is from basics and also contains python basics.

  • @54-nileshparab96
    @54-nileshparab96 ปีที่แล้ว

    Do i need to scale independent variables before clustering?

  • @gaddipatisrikanth1177
    @gaddipatisrikanth1177 3 ปีที่แล้ว

    sir i got an error while plotting elbow graph
    it was showing that X and Y should be in same dimension!!
    pl suggest what to do!!!

  • @LoneWolf-rj1px
    @LoneWolf-rj1px 2 ปีที่แล้ว

    Please show the silhouette score for unsupervised learning!

  • @khansamar6889
    @khansamar6889 2 ปีที่แล้ว

    can we use only one column i.e based on spending score?

  • @bhavyav8355
    @bhavyav8355 3 ปีที่แล้ว

    can u explain how the centroids are placed in the graph and how the labels like 0,1,2,3,4 is assigned to data points

    • @Siddhardhan
      @Siddhardhan  3 ปีที่แล้ว

      hi! I explained it in a short way while implementing the model. you can check that out. I'll make a detailed video on this later while discussing about ML models. it's hard to explain it in the comments.

  • @anandtalware2283
    @anandtalware2283 ปีที่แล้ว

    I am confused, what if we have more than 2 features/columns,, then while plotting clusters,, then x[y_pred=0, 0] is for 1st cluster and 1st colum on x-axis,, then what about more than 2 columns while plots? I hope you would reply this comment and solve my confusion😢😢😢

  • @sashigole7588
    @sashigole7588 3 ปีที่แล้ว

    Nice one really helpful!! Can you do anything on student performance predictions using machine learning ??

    • @Siddhardhan
      @Siddhardhan  3 ปีที่แล้ว +3

      hi! I'll include it in my curriculum

  • @tendulkartejesh3318
    @tendulkartejesh3318 หลายเดือนก่อน

    Hello Brother. I'm going through all your Machine Learning playlist. I could find videos for Supervised Learning algorithms from Intuition, math behind it and building models from scratch. But I couldn't find the same for Unsupervised Learning algorithms brother. If there are videos regarding Unsupervised Learning algorithms from scratch. Could you please share the link here? Or else could you please do videos on Unsupervised Learning from Scratch, because that would be a great use to us bro. Could you please do videos on Unsupervised Learning too?

  • @mohamedaskar418
    @mohamedaskar418 6 หลายเดือนก่อน

    Please post the vedio about DBSCAN projects bro

  • @ayushrai654
    @ayushrai654 3 ปีที่แล้ว

    What are the shortcomings in this project? Are there any areas of improvement in this project and what are they? Can you please suggest how can we make this project even better?

    • @Siddhardhan
      @Siddhardhan  3 ปีที่แล้ว

      hi! you can do some model optimization. after that you can work on deployment.

  • @manepallisaisanthosh3728
    @manepallisaisanthosh3728 3 ปีที่แล้ว +1

    bro also upload a video on EDA and its DTALE tool

    • @Siddhardhan
      @Siddhardhan  3 ปีที่แล้ว +1

      I'll try to include that in future courses

  • @pushkarkulkarni6888
    @pushkarkulkarni6888 3 ปีที่แล้ว

    Doing a great job man! Can you suggest me a nice laptop for building complex ML and DL projects? Which laptop are you using?

    • @Siddhardhan
      @Siddhardhan  3 ปีที่แล้ว +1

      hi! thanks for your positive words! I am using dell g5. i won't recommend it for Deep Learning. you can watch this video of Siraj Raval. He has explained which laptop is best for ML: th-cam.com/video/dtFZrFKMiPI/w-d-xo.html

  • @adiityabairwa3444
    @adiityabairwa3444 3 ปีที่แล้ว

    Sir please start one series(theory + projects) of Deep learning like ML... We are waiting...

    • @Siddhardhan
      @Siddhardhan  3 ปีที่แล้ว +1

      hi! I'll make DL course separately after completing ML

  • @ReginaLawrence-wu3ov
    @ReginaLawrence-wu3ov ปีที่แล้ว

    Thanks for your video sir.Would you pls share the dataset and source code 🙏

  • @melodylyricskannada
    @melodylyricskannada 3 ปีที่แล้ว

    Can we download the clustered file with Names,Ages?? if yes can you explain

    • @Siddhardhan
      @Siddhardhan  3 ปีที่แล้ว

      I don't get you. can you please elaborate

  • @mohsinimam2048
    @mohsinimam2048 2 ปีที่แล้ว

    How to use KMeans for more than 2 features?

  • @achyuthkayala279
    @achyuthkayala279 4 หลายเดือนก่อน

    Bro what are the pros and cons that made you to do this project

  • @ieltswithtaru3411
    @ieltswithtaru3411 3 ปีที่แล้ว

    Hi, how can we visualize the clusters it via tsne?

    • @Siddhardhan
      @Siddhardhan  3 ปีที่แล้ว

      hi! you can go through this documentation: scikit-learn.org/stable/modules/generated/sklearn.manifold.TSNE.html

  • @SHASHANK-u3i
    @SHASHANK-u3i 2 ปีที่แล้ว

    how to get accuracy in unsupervised learning?

  • @Yuvakishore29
    @Yuvakishore29 6 หลายเดือนก่อน

    Sir can we keep these projects in our resume

  • @manojgattam8449
    @manojgattam8449 3 ปีที่แล้ว

    Bro do you have any report based on this project or any research Paper on this project?

    • @Siddhardhan
      @Siddhardhan  3 ปีที่แล้ว

      hi! I don't make reports for my project videos

    • @manojgattam8449
      @manojgattam8449 3 ปีที่แล้ว

      @@Siddhardhan Bro Can you send your email so that I can contact with you?

  • @abhiskype5291
    @abhiskype5291 3 ปีที่แล้ว

    Sir please tell clear explanation of spending scores ?
    Spending score means what?

    • @Siddhardhan
      @Siddhardhan  3 ปีที่แล้ว +1

      higher score means that a person is spending more money to buy more products.

  • @mahfuzkabirpulak1154
    @mahfuzkabirpulak1154 3 ปีที่แล้ว

    What are you using Pandas or Jupiter?

  • @arshdeepwalia1251
    @arshdeepwalia1251 3 ปีที่แล้ว

    Hey bro - do you teach? Ready to get classes

    • @Siddhardhan
      @Siddhardhan  3 ปีที่แล้ว

      hi! I am mainly focusing on teaching in TH-cam. at the moment, I am not giving individual tutoring.

  • @vedant3827
    @vedant3827 3 ปีที่แล้ว

    can i run the same code in ides like vs code??

    • @Siddhardhan
      @Siddhardhan  3 ปีที่แล้ว

      yes, u can

    • @vedant3827
      @vedant3827 3 ปีที่แล้ว

      @@Siddhardhan can i link a csv file from drive directly to this code with its url??

  • @cherubyGreens
    @cherubyGreens 3 ปีที่แล้ว

    Can you do a live session explaining ML projects?

    • @Siddhardhan
      @Siddhardhan  3 ปีที่แล้ว +1

      hi! I haven't thought about it. I'll definitely consider that.

  • @santanubhattacharya9865
    @santanubhattacharya9865 6 หลายเดือนก่อน

    Feature scaling karna hoga..

  • @bhargavsolanki6386
    @bhargavsolanki6386 3 ปีที่แล้ว

    Why didn't you run 10th shell?

  • @Ash-hz7nc
    @Ash-hz7nc 3 ปีที่แล้ว

    Why you are considering only the last two columns? you have to consider all the columns as gender and age play a role as well.

    • @Siddhardhan
      @Siddhardhan  3 ปีที่แล้ว

      it's because I wanted to create a 2 dimensional cluster. and annual income is a good parameter to choose from. of course, you can consider more parameters as well.

  • @ragavanrs97
    @ragavanrs97 2 ปีที่แล้ว

    what is the main object of this project

  • @pallapothubhargavramfromib2244
    @pallapothubhargavramfromib2244 3 ปีที่แล้ว

    What video can we expect on Monday

    • @Siddhardhan
      @Siddhardhan  3 ปีที่แล้ว

      Monday & Wednesday ML Course videos will be posted. this Monday, video will be on vectors & vector operations..

  • @harshchandra4384
    @harshchandra4384 ปีที่แล้ว

    But this data is labeled right?

  • @vismayabv482
    @vismayabv482 6 หลายเดือนก่อน

    Code to find accuracy for the model

  • @hhmmon5550
    @hhmmon5550 3 ปีที่แล้ว +1

    May I know your github account?

  • @hamzaarshad8952
    @hamzaarshad8952 ปีที่แล้ว

    plt.scatter(clustering_data[y_pred==0,0],clustering_data[y_pred==0,1],s=50, c='green',label="cluster 1")
    plt.scatter(clustering_data[y_pred==1,0],clustering_data[y_pred==1,1],s=50, c='red',label="cluster 2")
    plt.scatter(clustering_data[y_pred==2,0],clustering_data[y_pred==2,1],s=50, c='blue',label="cluster 3")
    this code is showing an error sir. can you fix it?
    this is the error:
    TypeError Traceback (most recent call last)
    ~\anaconda3\lib\site-packages\pandas\core\indexes\base.py in get_loc(self, key, method, tolerance)
    3628 try:
    -> 3629 return self._engine.get_loc(casted_key)
    3630 except KeyError as err:
    ~\anaconda3\lib\site-packages\pandas\_libs\index.pyx in pandas._libs.index.IndexEngine.get_loc()
    ~\anaconda3\lib\site-packages\pandas\_libs\index.pyx in pandas._libs.index.IndexEngine.get_loc()
    TypeError: '(array([False, False, False, ..., False, True, False]), 0)' is an invalid key
    During handling of the above exception, another exception occurred:
    InvalidIndexError Traceback (most recent call last)
    ~\AppData\Local\Temp\ipykernel_17080\3756939222.py in
    1 ## plotting all the clusters and their centriods:
    2 plt.figure(figsize=(12,6))
    ----> 3 plt.scatter(X[y==0,0],X[y==0,1],s=50, c='green',label="cluster 1")
    4 plt.scatter(X[y==1,0],X[y==1,1],s=50, c='red',label="cluster 2")
    5 plt.scatter(X[y==2,0],X[y==2,1],s=50, c='blue',label="cluster 3")
    ~\anaconda3\lib\site-packages\pandas\core\frame.py in __getitem__(self, key)
    3503 if self.columns.nlevels > 1:
    3504 return self._getitem_multilevel(key)
    -> 3505 indexer = self.columns.get_loc(key)
    3506 if is_integer(indexer):
    3507 indexer = [indexer]
    ~\anaconda3\lib\site-packages\pandas\core\indexes\base.py in get_loc(self, key, method, tolerance)
    3634 # InvalidIndexError. Otherwise we fall through and re-raise
    3635 # the TypeError.
    -> 3636 self._check_indexing_error(key)
    3637 raise
    3638
    ~\anaconda3\lib\site-packages\pandas\core\indexes\base.py in _check_indexing_error(self, key)
    5649 # if key is not a scalar, directly raise an error (the code below
    5650 # would convert to numpy arrays and raise later any way) - GH29926
    -> 5651 raise InvalidIndexError(key)
    5652
    5653 @cache_readonly
    InvalidIndexError: (array([False, False, False, ..., False, True, False]), 0)

  • @hammadyasir2174
    @hammadyasir2174 3 ปีที่แล้ว

    kindly upload subtitle also

    • @Siddhardhan
      @Siddhardhan  3 ปีที่แล้ว +1

      I'll include auto subtitles

  • @sandipansarkar9211
    @sandipansarkar9211 2 ปีที่แล้ว

    finished coding

  • @rajneeshkumar-o5p
    @rajneeshkumar-o5p 2 หลายเดือนก่อน

    Hindi subtitles 😅😅😅

  • @alfattahsuyadi3618
    @alfattahsuyadi3618 ปีที่แล้ว

    Thank you very much