Project 2: Diabetes Prediction using Machine Learning with Python | End To End Python ML Project
ฝัง
- เผยแพร่เมื่อ 26 ก.ย. 2024
- Hi! I will be conducting one-on-one discussion with all channel members. Checkout the perks and Join membership if interested: / @siddhardhan Check membership Perks: / @siddhardhan
. In this video, we are building a system that can predict whether a person has diabetes or not with the help of Machine Learning. This project is done in Python. In this project, we use Support Vector Machine model for the prediction.
All presentation files for the Machine Learning course as PDF for as low as ₹200 (INR): Drop a mail to siddhardhans2317@gmail.com
Enroll at One Neuron to learn from 100 courses in one subscription with 5% discount: courses.ineuro...
Hi guys! I am Siddhardhan. I work in the field of Data Science and Machine Learning. It all started with my curiosity to learn about Artificial Intelligence and the ability of AI to solve several Real Life Problems. I worked on several Machine Learning & Deep Learning projects involving Computer Vision.
I am on this journey to empower as many students & working professionals as possible with the knowledge of Machine Learning and Artificial Intelligence.
Hello everyone! I am setting up a donation campaign for my TH-cam Channel. If you like my videos and wish to support me financially, you can donate through the following means:
From India 👉 UPI ID : siddhardhselvam2317@oksbi
Outside of India? 👉 Paypal id: siddhardhselvam2317@gmail.com
(No donation is small. Every penny counts)
Thanks in advance!
Let's build a Community of Machine Learning experts! Kindly Subscribe here👉 tinyurl.com/md...
I am making a "Hands-on Machine Learning Course with Python" in TH-cam. I'll be posting 3 videos per week. 2 videos on Machine Learning basics (Monday & Wednesday Evening). 1 video on a Machine Learning project (Friday Evening).
Dataset File link: www.dropbox.co...
Colab File Link: colab.research...
Download the Course Curriculum File from here: drive.google.c...
LinkedIn: / siddhardhan-s-741652207
Telegram Group: t.me/siddhardhan
Facebook group: www.facebook.c...
Machine Learning Project
Python Project
Diabetes Prediction using Machine Learning with Python
Supervised Learning
Supervised Learning Algorithm
Deep Learning
artificial intelligence
#machinelearning #deeplearning #python #pythonproject Instagram: / siddhardhan23 Instagram: / siddhardhan23
Hi! You can join this Telegram group for regular updates about my videos: t.me/siddhardhan
Thank you!
Hi Siddhardhan, I have few doubts in machine learning. I would like to connect with you to clarify those.
Not able to join the group
sir can you plss upload viodeos on ranom forest algorithm and naivebay's algorithm
Not working
bro u teaching style is amazing,
i have stuck in a problem when ever i try to diploy my project by putting location of file on terminal ,i get this error
'c:\users\dell\desktop\multi_disease' is not recognized as an internal or external command,
operable program or batch file.
plz help me as soon as possible
I usually don't comment like this in public. But I honestly feel this type of works must be appreciated! Thank you so much!
thanks a lot for your positive words 😇
@@Siddhardhan nyger
Finally, someone cared to explain to me like a 5 year old. Thanks brother
I am a postgraduate student in China and I was struggling to find a project to do on ML. Now, thanks to you my brother, I did some good works. Thank you 🙏
Your course is most underrated course in ML
I am lucky i found your channel
This was my first project with Machine Learning and i it was way more than i was expecting, thank you so much
like in machine learning in which ide we want to write code i am new to machine learning
I am from Pakistan, I have learned a lot from Indian tutorials, you guys are doing great. I am in love with India and really want to visit or work there in tech companies
It's not enough to tell thanks for you sir, your videos are very helpful for me, you will explain every bit of coding, just because of your videos, i am learning ML projects. Thank you so much sir🙏🙏. keep uploading more ML projects sir.
You're most welcome 😇
Well done brother, even experienced professionals can't details like you .. keep it up.
Thank you so much brother 😇
@@Siddhardhan @munna I agree with you; I was also struggling with project ideas, and I had more confusion in ML. the way he had explained, hats off, and thank you for teaching and making us watch more videos. This is my first public comment.
me
usually don't comment like this in public. But I honestly feel this type of works must be appreciated! Thank you so much
Much much much better neat and detailed explanation of every detail. Far better from my university teachers.
Hi Sir,
Thank you for your tutorial.
Can you please share how you decided with finalizing SVM as the most suited algorithm for building the ML model?
Thank you
this channel is a gold mine sir, these videos are really helping us to apply the ML algorithms
Glad to hear it!😇
@@Siddhardhan can you suggest a good project, which i can use for my semester project?
@@deltechdiaries5907 can I use this project it self??
@@kamathprajna i did the same 🙂
@@deltechdiaries5907 project yahi kiya kyaa?ye kar sakte hai?
Pls, You should cover data visualization in the project because it helps that which algo is suitable for particular dataset.
Am 9yrs in IT and feeling of starting to love coding
That's great! You will do great things definitely.
Thank you very much Sir. This is a complete workflow to understand ML for beginner.
damn man! this was recommended by my bro! & this is full of worth it.
I can't thank you enough
You have done a great help🙏
Hi @Siddhardhan
I was searching for end to end project in machine learning and just landed on your TH-cam channel.
This is my second video and i feel you should also add feature selection , scaling , feature engineering and outliers handling so that we can learn complete process of model creation because right know I personally feel we are just creating test and train set and making our models.
hi! I'll make end to end projects once i complete ML & DL concepts.
@@Siddhardhan ohhh great , Thank you
Its amazing thanks alot ❤
Your expression is excellent
Thank you and congratulations because you teach very well
Please continue strong
Hats off sir! Great explanation❤
bro you worked hard ...you deserve this
Thank you very much Bro you have helped me to build my first project for my Internship
very nice explanation thankyou so much
The file diabetes.csv is not in the description ... pls add it
How can we improve the accuracy of our model here?
I have literally binged-watch your series on ML! I'm going through the project videos one by one. If you already have a video that deals with improving the accuracy, please point me to it. Thank you so much!
I looove your content!
hi! thanks for your positive words 😇 we can optimise our model to get better performance. those topics will be covered in upcoming modules. kindly stay tuned.
Appropriate , structure wise step by step learning approach is best way
Thanks a lot! really an amazing video👍👍👍👍....need more videos from you to enrich the ML skills
I really loved and understood the model very quickly thanky you for making such videos😍😍🤗
Hello, what about if I have categorical features in my model? how can I add those kind of variables in a output of streamlit?
finished practicing coding .Feeling confident
Hi sir, good evening. Your video is good but explain with some Matplotlib, sea born, by using visualizing data that will be very grateful.I hope you understand and do the same for next videos
very underrated thanks a lot very very useful
Really great work.From Kenya thank you just began today and was amazing
Do you got correct predictions value?
I got the wrong predictions but it doesn't show any errors,when I try to use different input datas it always show the person is diabetic even I import the non diabetic input datas,
How to correct this?
So thanks, dear my teacher, it was so useful for me ❤
Hi Siddhardhan, after analysing the dataset, I found that few features need imputation.:
My Analysis:
- 11 entries found with BMI 0, which is not practical
- similarly 35 women Blood Pressure entry was 0
- 5 women Glucose entry was 0
- 374 women were having 0 Insulin entry and surprisingly, out of them 236 were non diabetic and only 138 were diabetic.
How to impute insulin value, which is crucial in predicting a person is diabetic or not..
I am very thanks towards the you and I will now be going to Internships of your projects because. MIT is also on the search for me👀👀👀👀👀👀
hare krishna prabhu ji ....
i like the way u make videos ..
thanku for this video...
Thank you so much sir I've seen so many videos but your video is perfect
can u pls tell me how its gonna take input in real wold like how it will take the details of a actual patient and predict it we cant just give away our code and say put inputs according to the coloumns name it doesn't make sense pls explain
Exactly how many modules are used in this project ? Can i get names?
Explanation is very simple and understandable brother
It was a very helpful video. But I think Standardization should be done after splitting the data into training and test sets.
Yes, that's right. I made the change in the other videos
@@Siddhardhan okay, thanks
loved it man!!! keep up the work
thank you so much for the help, couldn't do this by myself. thanks!!!
Well done bro, even experienced professionals can't explain like you .. keep going like this only
Thank you so much 😀
Tysm Proff. for crystal clear explanation
you're most welcome😇
Great Bro. Wonderful Channel on Machine Learning.
Why can't we use linear regression model for predicting the diabetes
Great video thank you for that my question is when the dataset is a mess and you do some data manipulation works like changing " yes/no" as" 1/0" after everyting when you predicting the value as your code : input_data_as_numpy_array=np.asarray(input_data) part
are you going to use data set with" yes" in it or the converted version
thank you
This isn't actually a prediction problem rather it is a classification problem. Good work though :)
Thanks a lot for your great effort you explain it very clearly .
Thanks sir for this project. It was much useful☺️
It is usually recommended to do train test split before feature scaling. Why you did feature scaling first
Thanks a lot for this tutorial..Just one suggestion, there are way too MANY ads in all of your playlist, it is really annoying, can we please reduce them? Thanks.
Hi, I tried your code and it gives same accuracy as like your result but input results are always coming Zero. Why?
Hi Siddhardhan, Thank you very much for the tutorial. In the Rock vs Mine project, you mentioned that the data should be closer together for better predictions. However, for this video, there are 500 records for non diabetes and 268 for diabetes. The data is not close right?
Thank you wish you all the best
The Way You Explain Is Amazing...
am confused in data standardization..When I Run the same code it Gives Name error..How To fix IT?
Sir can you explain why pregnancies=6 , 7 and so on in data set , I think data should be like during pregnancy can you please explain me this sir please
very clear explanation thank you,please add a video about hybrid machine learning algorithms (like random forest with xgboost or adaboost etc) for predicting diabetes
Hi @Siddhardhan thanks for assisting budding ML engineers with your videos. I had a query regarding standard scaler. Why didn't we use it during Sonar Rock vs Mine project and used it here. How can we identify in which cases we should use standard scaler.
Thanks again
For the Standard Scaler, we used it here because the values being fed into the model totally differed in range, we had values 0.08 all the way to 846, which will not be helpful to the model so we used the standard scaler to scale all the values to values between 0 to 1 to make it easier on the model, but in the submarine one, all the values were in a similar range so we didn't have to do that.
@siddhardhan Thanks for such good videos..
Very well explained, thank you so much for making this video 🙏
My pleasure 😊
Can i know what way u learnt the machine learning? How you started with machine learning .Any suggestions for beginners.
Hi danda umasai! It all started with curiosity. Lot of things that could be achieved by Artificial Intelligence surprised me. And I started learning little by little... Then I started loving it... I couldn't think of a particular resource for learning... I used several books, online lectures, etc. Later I wanted to share what I have learnt with lots of people who are interested in the field of AI. As a beginner, you just need that curiosity to learn and you can find your path... There are several TH-cam channels like edureka which you can follow...
thank you so much for your greatest work
Hey bro but feature scaling should always be done after doing the train test split.
Hi Siddhardhan, thank you for a comprehensive project tutorial.
At 24:30, the StandardScaler() has not limited the values between [0,1]. Clearly some values exceeded 1 and some negative vales are also there. Could you please explain.
I googled and got to know that the resized distribution should have the mean=0 and standard deviation=1.
I like what you are doing would love to know colab usage maybe a little simple tutorial of sort
Thanks
Already posted a video on Colab basics. Do check out
Hi sir, Thanks for the great effort.Can u plz explain which model (linear or logisticregression or svm or random forest )has to be chosen at what criteria for the data,as i m confused to chose which model.
Excuse me sir, in this case the data is imbalanced?
or what conditions do you need to have to know that the data is imbalanced?
Great content!
Do you get right predicted value,? I got the wrong predictions but it doesn't show any errors,when I try to use different input datas it always show the person is diabetic even I import the non diabetic input datas,
How to correct this?
One main thing u didn't explain about confusion matrix rather than that all is good
Thank you very much!
Just now i subscribed u r channel
Clear explanation 👌
Thank you sir... Very well explained..
You are welcome
How to implement verious algorithm
Great videos, I am new in ML and I have followed the steps but when i input the data I always get 0 and never get 1. I dont know where I can check to make it correct pls help
Bro, Why did you used SVM only , what about other algorithms ..How we can find use Decision tree, How do you choose algorithm please explain me
Very well explained
I have a dout how could i develope this project as real time use
Hello sir, after the predictive system, how do we sort the result of the prediction system into write and wrong, since the model is not predicting 100% accuracy?
I have one question, sir, you are doing feature scaling before the train test split. This will lead to data leakage or overfitting type of problems. So I think it will be more accurate to do feature scaling after the train test split. can you tell, am I right or wrong?
Hi Gaurav! You are absolutely right! Thanks for pointing it out. It is always a good practice to do standardization after the train test split. In this case, the dataset we are using is a clean dataset i.e. there are not much outliers... So it won't make much difference... If we have several outliers, it will be a problem... Which leads to data spilling as you pointed out..
I already recorded next project video. In that also I followed the same procedure... And this is the way I generally use unless the model is overtrained...
But from the next week's video onwards I'll make sure that I change the procedure... Thanks for pointing it out...
@@Siddhardhan thank you for your response and you are doing great work 👍
Thanks for your appreciation 😇
I couldn’t understand it should be regression problem that’s clearly visible why did u use SVC ??
could you please let me know why are we using SVM for diabetes prediction and Logistic regression for sonar rock vs mine prediction as both are binary classification problem
Why did you decide to use Support Vector Machine and not something else? How do you decide what to use?
why svm has been imported y not aand other model ?
i am copy your code but in 2nd line code show import error
what i can do
Hi! You need to install the required libraries that are imported using pop install
Why does some of the values from the dataset when inserted gives incorrect answer? Can you please tell. for example when i inserted 8,125,96,0,0,0,0.232,54 into the model it says the person is not diabetic but according to dataset it is 1, which means diabetic.
Well, the accuracy didn't say 100% right, meaning it is prone to error. Not all the predictions will be correct.
At the very end, the program is predicting everything as zero which is not correct. Idk how to fix it
Amazing !
THANK YOU VERY MUCH SIR
In train test split part its giving me error that train test split is not defined. I have imported it properly still..please tell anyone
Comprehensive explanation. However, why did you use linear kernel. Are the data linearly separable or did you just jump into it as a default.
Just as a default option. It would have been too much to process for a viewer as this was one of the initial videos.
Ok. Thank you for the reply
Hello i am beginner on data sciene. I watched this video till end. Maybe my question seems silly. But Why building an predictive model that is already obvious in dataset? For what purpose build this model? Thanks!
Hello! Thank you for this awesome and helpful example. Is there a video with the improvement of accuracy for our model?
Thank you for sharing
Thank you sir
you're most welcome 😇
how can we implement it user interface?