"🔥Caltech Post Graduate Program In AI And Machine Learning - www.simplilearn.com/artificial-intelligence-masters-program-training-course?M4uJ6XGnSM&Comments&TH-cam 🔥IITK - Professional Certificate Course in Generative AI and Machine Learning (India Only) - www.simplilearn.com/iitk-professional-certificate-course-ai-machine-learning?M4uJ6XGnSM&Comments&TH-cam 🔥Purdue - Post Graduate Program in AI and Machine Learning - www.simplilearn.com/pgp-ai-machine-learning-certification-training-course?M4uJ6XGnSM&Comments&TH-cam 🔥IITG - Professional Certificate Program in Generative AI and Machine Learning (India Only) - www.simplilearn.com/iitg-generative-ai-machine-learning-program?M4uJ6XGnSM&Comments&TH-cam 🔥Caltech - AI & Machine Learning Bootcamp (US Only) - www.simplilearn.com/ai-machine-learning-bootcamp?M4uJ6XGnSM&Comments&TH-cam"
Hi, Thanks for great explanation. I have a small doubt. when you split test train in Ln [8] and in ln [9] we get how much data we have in training and testing- i get it. but when I do it in my same example- each time number of training and testing data gets different. why is it so? sometimes training data comes 120 and testing 30, sometimes 118, 32 or sometimes something else. why is it so?
WooHoo! We are so happy you love our videos. Please do keep checking back in. We put up new videos every week on all your favorite topics. Whenever you have the time, you must also check out our blog page @simplilearn.com and tell us what you think. Have a good day!
WooHoo! We are so happy you love our videos. Please do keep checking back in. We put up new videos every week on all your favorite topics. Whenever you have the time, you must also check out our blog page @www.simplilearn.com and tell us what you think. Have a good day!
Thank you for the appreciation. You can check our videos related to various technologies and subscribe to our channel to stay updated with all the trending technologies.
At 16:38 , on what basis is the prediction from Tree 2 cherries. If I see the inputs, the first split Color is not Red, so the condition yields false and thus the prediction is still orange.
I think it is a bit strange as well. First tree: Color(Orange) True, means red = false Second Tree: Color(Red) True, means orange = false That doesn't seem right to me, that it just guesses the color both times instead of sticking with one and using it through all the decision trees.
@@Medhusalem if we assume that it "chooses" randomly a color for each tree, then it makes sense. He said that they are good working with missing data, so is it possible that adding this randomness in the missing value a way to get the right prediction?
We are glad you found our video helpful, Santhosh. Like and share our video with your peers and also do not forget to subscribe to our channel for not missing video updates. You can also explore our playlist for more Machine learning videos - th-cam.com/video/7JhjINPwfYQ/w-d-xo.html.
We're so glad that you enjoyed your time learning with us! If you're interested in continuing your education and developing new skills, take a look at our course offerings in the description box. We're confident that you'll find something that piques your interest!
This video is really well done in that the teaching quality is good and the instructor understands the level of beginners by explaining everything clearly and simply
Hi Kyuhwan, thank you for appreciating our work. We are glad to have helped. Do check out our other tutorial videos and subscribe to us to stay connected. Cheers :)
Hey Filip, thank you for appreciating our work. We are glad to have helped. Do check out our other tutorial videos and subscribe to us to stay connected. Cheers :)
16:28 Why does it mark the (black fruit) as orange? I mean the data is missing? Does it pick this one Decision randomly? => If it would pick red, the whole example would not work, right?
Great explanation. I have a question (1) At 15:40, how do we get split decision "Grows in summer"? This category variable is not available in dataset na?
Dear simplilearn team here you put the best video to explain what Algorithms really are... But in LMS SELF PACED VIDEOS not so detailed explanation... Look into that and improve yourself
Thank you for letting us know know about this. Your feedback helps us get better. We are looking into this issue and hope to resolve it promptly and accurately.
Hi, initially random forest concept will using fruits concept. But in IRIS flower example it should show how random forest is working with example and diagram first. It would help to understand easily.
25:50 I have a doubt on splitting data into Test and train. Here we are not splitting exactly into 75% and 25% of data. Here we split on random percentage of data. Why don't we use "train_test_split" from "sklearn.model_selection", where we can split the data into desired amount of test and train ? Thanks alot for the video.
You have explained it very well but I have a question, why does the decision in 16:38 became cherries and yet the given parameters for its training set is given that the color of the unknown fruit is orange? thank you! I also need the answer because I will present this topic in our analytics class. thank you and more power! :D
So, initially when the example begins narrator tells us that we do not know the color of the object, which is the missing data itself, so the decsion tree cannot figure out what color it is having and istead goes to the second branch of both but the branch on right has no further branches but the branch on the left goes to the next decesion and gives us the result cherries. I, hope this helps.
Although the colour for the unknown fruit is specified in the block containing data, for this example we assume that the colour is unknown. This is also mentioned in the audio. Therefore, our second decision tree makes the first split based on colour and arbitrarily says the fruit is red.
I have a doubt with the Random Forest being able to cope with missing values. In many other places I have heard that you must replace any null values for models to work. I tested an example on another dataset with null values and got this error, "ValueError: Input contains NaN, infinity or a value too large for dtype('float32'). " . Please could you expand on this. Excellent Video - thanks :-)
Hey Rishi, thank you for appreciating our work. We are glad to have helped. Do check out our other tutorial videos and subscribe to us to stay connected. Cheers :)
Hey, thank you for appreciating our work. We are glad to have helped. Do check out our other tutorial videos and subscribe to us to stay connected. Cheers :)
Hello, thanks for viewing our tutorial. It would be helpful if you will provide your email ID to us so that we can send the requested dataset promptly. On the off chance that you need your email ID to be kept hidden from others, we can do that too. Hope that helps.
We're so glad that you enjoyed your time learning with us! If you're interested in continuing your education and developing new skills, take a look at our course offerings in the description box. We're confident that you'll find something that piques your interest!
How does tree 1 decide the colour of the fruit is orange if the colour of the fruit is unknown? Do random forests consider all possible outcomes and take the majority of those? Thanks x
Hey Rafa, thank you for appreciating our work. We are glad to have helped. Do check out our other tutorial videos and subscribe to us to stay connected. Cheers :)
First of all, the tree will ignore the missing data, since color unknown, it COULD BE true for the fruit to be apple or cherry. And then, with Circle, it COULD Be cherry. Trees tell what COULD Be true in according with the existing information.
We appreciate your effort on sharing your knowledge. Do show your love by subscribing our channel using this link: th-cam.com/users/Simplilearn and don't forget to hit the like button as well. Cheers!
Is it possible to predict a set of numbers that will output from a random number generator, finding the algorithm, in order to duplicate the same pattern of results?
I have a question about converting the species name into digits (0,1,2): what if we don't do the conversion? Can the classifier still do the prediction based on the species names(string)?
Hello, we are so happy to receive this wonderful compliment. Like and share our video with your peers and also do not forget to subscribe to our channel for not missing video updates. We will be coming up with more such videos. Cheers!
Hey! can you explain, me why didn't we split tree on the basis of color at the root node instead of using diameter and then color in the example of where in the basket there were three fruits Apple, lemon and grapes. three of them had a different color so we could have split them on the basis of color and we have got accurate results. And there wouldn't have been any need to use diameter. Can you please clear this doubt of mine. Also, Can Iris flower data set be modeled using Support Vector Machine? If yes which model is better the random forest or Support Vector Machine
For the random forest, shouldn't the same fruit bowls/datasets have the same classification trees? That is, shouldn't the same fruit bowl split the same way to maximize information gain/GINI index? In random forests, doesn't the machine aggregate decision trees built from different datasets?
Random forest creates multiple decision trees from a particular data set. Of course, each tree is formed considering a different section of the data set. Since different sections of the dataset are used to construct each classification tree, the fruit bowl will be split in different ways. random forest algorithm takes all the trees into consideration to generate the most accurate result.
Hi Thanks for this wonderful lecture but I have a query, won't a decision tree will always try to make a root node and following nodes in a manner where entropy is least? And I believe yes, then does it select root nodes at random and then follows an IG algorithm like ID3? How much 'Randomness' is there when Decision Tree decides which node will be root node, considering we have hundreds of nodes.
Thank you for this video. I have a practical work to do regarding my studies. The goal is to code a program with python concerning the image classification using Random Forest technique. Can you explain to me how to modify your code to use it on the pixels of images ? (we will test it on the famous image of Lena), and this is for the two phases: learning and evaluation according to the evaluation criteria of Levine and Nazif (Inter-region) Thank you in advance.
Hi Vashist, thanks for checking out our tutorial. You are indeed right. There are multiple ways to split the data and using sklearn's inbuilt function is surely one of them. Hope that helps!
The tutorial is a pandas tutorial after 17:55 and before 17:55 it teaches the basics only. Would be better if you could provide some more details like how a tree is built or updated.
Hi Nesat, thank you for watching our video and for the honest feedback. We will definitely look into this. Do subscribe, like and share to stay connected with us. Cheers :)
thank you for the tutorial, i have been subscribed to your channel for around a year now and i love the content, can you please send me the dataset for all the videos in this playlist that use Python.Thank you
Hello Harsimranjeet, thanks for viewing our tutorial and we hope it is helpful. It would be helpful if you will provide your email ID to us so that we could send the requested dataset promptly.
Great tutorial .....Great Tutor and well explained...I have subscribed this tutorial and I assure you that I have been learning so many things about algorithms in ML in the previous videos.......I really love this tutorial. I really appreciate also your kind help whenever I request for the datasets .......I wanna one clarification on the "load_iris" is this the in-built function (or library)...?
Hi Amilcar, thanks for subscribing to our channel and joining our community. We have shared the required dataset to your mail ID. Stay tuned for the updates!
The iris dataset is present within the sklearn library as it's one of the most commonly used one. So yes, load_iris is an inbuilt method that loads the iris dataset.
Hi, I run the same code for practicing but the prediction results are different, does anybody have any idea of why is this? Maybe due to changes in the packages versions? I get "setosa, setosa" instead of "versicolor, versicolor" in block "Out[36]"
Thankyou for the video . Can you explain why is that it has high accuracy .. is it because of bagging approach only or are there any other reasons behind it.
It is predominantly the bagging approach. The fact that the random forest algorithm works on different parts of the dataset also plays a role in providing better accuracy.
Hello, thanks for viewing our tutorial. It would be helpful if you will provide your email ID to us so that we can send the requested dataset promptly. On the off chance that you need your email ID to be kept hidden from others, we can do that too. Hope that helps.
Hi thank you. a wonderful tutorial. I have 9 features (unknown) and target. I want to predict if the customers will sign up or not. Do you think random forest can be applied here?
Hi Kritchayan, we don't have random forest video in the part of regression. However, we have Random forest video made separately in both Python and R language. If you are interested, check the below links: Random Forest in Python: th-cam.com/video/eM4uJ6XGnSM/w-d-xo.html Random Forest in R: th-cam.com/video/HeTT73WxKIc/w-d-xo.html
Hello Aisha, thanks for viewing our tutorial and we hope it is helpful. It would be helpful if you will provide your email ID to us so that we could send the requested dataset promptly.
Hey i am doing traffic prediction and feature of matrix has days and weather condition in it can i apply random forest algorithm over it and also want to know that do i have to convert all days into 0-7 kindly reply soon
"Hi Syed, We would suggest not to opt from random forest to solve this particular problem since that features are very less. So, to split the data at a particular node would be different."
Hi, can you please tell me why you've taken the training data set to be 75% of the total set? How can we find out the optimum value of the training set?
"Hi Mrugendra, There is no predefined rule or an optimum value as to how much you should assign for training and testing. Ideally, the dataset is divided into (70, 75 or 80%) for training the model and (30, 25 or 20%) to test and validate the model. "
Hello Rahul, thanks for viewing our tutorial. It would be helpful if you will provide your email ID to us so that we could send the requested dataset promptly. On the off chance that you need your email ID to be kept hidden from others, we can do that also. Hope that helps.
Great Video, thank you! Off topic question: As a non-native Englisch speaker I am wondering if the way you pronounce mEAsuring is a certain dialect or the actual correct pronounciation.
Thanks Peter, we are glad you found this content useful. That is his accent :) We have come up with new videos on Machine Learning, do check it out here: th-cam.com/play/PLEiEAq2VkUULYYgj13YHUWmRePqiu8Ddy.html Happy learning from Simplilearn team!
Hello Arif, thanks for viewing our tutorial and we hope it is helpful. It would be helpful if you will provide your email ID to us so that we could send the requested dataset promptly.
I have seen everyone use clf as the variable name for instantiating the random forest classifier. What is the abbreviation of CLF?? Just out of curiosity.
Hi Anjith, thanks for watching our video. CLF just stands for "classifier". Hope that clarifies your curiosity. Do support us by subscribing to our channel using this link: th-cam.com/users/Simplilearn.
Great explanation. Is the python code available for download anywhere? Are random forests a good choice for binary classifiers? Or are there other algorithms that do a better job?
Hello Stephen, thanks for viewing our tutorial and we hope it is helpful. It would be helpful if you will provide your email ID to us so that we could send the requested dataset promptly.
Hello Wong, thanks for watching our tutorial. It would be helpful if you will provide your email ID to us so that we could send the requested dataset promptly. Cheers!
"4:52 "training time is less" - Less than what? Why at 16:18 the black object (Outline of an orange) is classified as "color orange?" equals true? Shouldn't it be classified as False because clearly it is NOT orange (black) and would be classified as False. You don't give good explanation for this. Is it so that if the data is missing you pick randomly one of the options and this time it happened to be "True"?
The problem is while explaining Random Forest he used all the predictors whereas the decision trees choose the set of variables randomly. So I guess he could have demonstrated with more variables and used different sets in different decision trees
Hello, thanks for viewing our tutorial. It would be helpful if you will provide your email ID to us so that we can send the requested dataset promptly. On the off chance that you need your email ID to be kept hidden from others, we can do that too. Hope that helps.
"🔥Caltech Post Graduate Program In AI And Machine Learning - www.simplilearn.com/artificial-intelligence-masters-program-training-course?M4uJ6XGnSM&Comments&TH-cam
🔥IITK - Professional Certificate Course in Generative AI and Machine Learning (India Only) - www.simplilearn.com/iitk-professional-certificate-course-ai-machine-learning?M4uJ6XGnSM&Comments&TH-cam
🔥Purdue - Post Graduate Program in AI and Machine Learning - www.simplilearn.com/pgp-ai-machine-learning-certification-training-course?M4uJ6XGnSM&Comments&TH-cam
🔥IITG - Professional Certificate Program in Generative AI and Machine Learning (India Only) - www.simplilearn.com/iitg-generative-ai-machine-learning-program?M4uJ6XGnSM&Comments&TH-cam
🔥Caltech - AI & Machine Learning Bootcamp (US Only) - www.simplilearn.com/ai-machine-learning-bootcamp?M4uJ6XGnSM&Comments&TH-cam"
We hope this video was useful. The link for the dataset used in the video is provided in the description. Thanks!
Hi,
Thanks for great explanation. I have a small doubt. when you split test train in Ln [8] and in ln [9] we get how much data we have in training and testing- i get it. but when I do it in my same example- each time number of training and testing data gets different. why is it so? sometimes training data comes 120 and testing 30, sometimes 118, 32 or sometimes something else. why is it so?
Can you send me the Jupyter notebook file of code??
Wow, the amount of effort to create these slides for teaching the material is obviously very high. Simply amazing :).
WooHoo! We are so happy you love our videos. Please do keep checking back in. We put up new videos every week on all your favorite topics. Whenever you have the time, you must also check out our blog page @simplilearn.com and tell us what you think. Have a good day!
This channel has one of the best machine learning videos available on the internet
WooHoo! We are so happy you love our videos. Please do keep checking back in. We put up new videos every week on all your favorite topics. Whenever you have the time, you must also check out our blog page @www.simplilearn.com and tell us what you think. Have a good day!
Sure, I can attest to this.
Thanks for your love and support!
never had any tutorial/lecture explaining so well, so simply yet so detailed; thank you so so so much !
Thank you for the appreciation. You can check our videos related to various technologies and subscribe to our channel to stay updated with all the trending technologies.
Amazing tutorial and best explanation ever with the fruits. Also I love how clearly you explain the code
Glad it was helpful!
At 16:38 , on what basis is the prediction from Tree 2 cherries. If I see the inputs, the first split Color is not Red, so the condition yields false and thus the prediction is still orange.
I think it is a bit strange as well.
First tree: Color(Orange) True, means red = false
Second Tree: Color(Red) True, means orange = false
That doesn't seem right to me, that it just guesses the color both times instead of sticking with one and using it through all the decision trees.
@@Medhusalem if we assume that it "chooses" randomly a color for each tree, then it makes sense. He said that they are good working with missing data, so is it possible that adding this randomness in the missing value a way to get the right prediction?
31:07 instead of pd.factorize(train['species'])[0]; we could also use "hot encoding" right?
you are excellent in explaining the full process and code step to step. GREAT JOB.
Glad you enjoyed our video! We have a ton more videos like this on our channel. We hope you will join our community!
Very clear description of Random Forest technique and the codes
You guys explain the concepts really well!!!
We are glad you found our video helpful, Santhosh. Like and share our video with your peers and also do not forget to subscribe to our channel for not missing video updates. You can also explore our playlist for more Machine learning videos - th-cam.com/video/7JhjINPwfYQ/w-d-xo.html.
amazing explanation , so simply and detailed , thank you so much sir
We're so glad that you enjoyed your time learning with us! If you're interested in continuing your education and developing new skills, take a look at our course offerings in the description box. We're confident that you'll find something that piques your interest!
This video is really well done in that the teaching quality is good and the instructor understands the level of beginners by explaining everything clearly and simply
Hi Kyuhwan, thank you for appreciating our work. We are glad to have helped. Do check out our other tutorial videos and subscribe to us to stay connected. Cheers :)
You are a great lecturer, thank you for explanation!
Hey Filip, thank you for appreciating our work. We are glad to have helped. Do check out our other tutorial videos and subscribe to us to stay connected. Cheers :)
Awesome tutorial by simplilearn. Thank you so much!
16:28 Why does it mark the (black fruit) as orange? I mean the data is missing? Does it pick this one Decision randomly? => If it would pick red, the whole example would not work, right?
Amazing explanation 👌
Hope you enjoyed our video! We have a ton more videos like this on our channel. We hope you will join our community!
So great explanation. Thank you!
Hope you enjoyed our video! We have a ton more videos like this on our channel. We hope you will join our community!
Great skill with explaining everything in simple words!
Hey, just awesome video ! Concept were explained clearly
Glad you liked it!
Awesome work done by u🔥
Thank you so much 😀
Amazing way of explanation...
Glad you liked it
Great explanation. I have a question (1) At 15:40, how do we get split decision "Grows in summer"? This category variable is not available in dataset na?
Hi Balajee, we assume this factor is present only for the sake of understanding. Thanks.
Thank you so much m. I’ve learnt alot from you
You are so welcome
Beautfiully explained. Thanks!
Glad it was helpful!
Nice explanation!
Glad it was helpful!
I am not python person but no doubt your explanation of concept is simply awesome
Glad you enjoyed our video! We have a ton more videos like this on our channel. We hope you will join our community!
Thanks, it helps me a lot!
Glad it helped!
Appreciated , really i enjoy learning with you , keep going :) :)
Glad you enjoyed our video! We have a ton more videos like this on our channel. We hope you will join our community!
Dear simplilearn team here you put the best video to explain what Algorithms really are... But in LMS SELF PACED VIDEOS not so detailed explanation... Look into that and improve yourself
Thank you for letting us know know about this. Your feedback helps us get better. We are looking into this issue and hope to resolve it promptly and accurately.
Very impressive, thank you
Glad you liked it!
Hi, initially random forest concept will using fruits concept. But in IRIS flower example it should show how random forest is working with example and diagram first. It would help to understand easily.
Convention....True on Left 😊
Nice explanation thanks!!
Glad it was helpful!
25:50 I have a doubt on splitting data into Test and train. Here we are not splitting exactly into 75% and 25% of data.
Here we split on random percentage of data.
Why don't we use "train_test_split" from "sklearn.model_selection", where we can split the data into desired amount of test and train ?
Thanks alot for the video.
You got still no answer?
You have explained it very well but I have a question, why does the decision in 16:38 became cherries and yet the given parameters for its training set is given that the color of the unknown fruit is orange? thank you! I also need the answer because I will present this topic in our analytics class. thank you and more power! :D
I guess whenever the decision split is about color, it will automatically goes to true branch, since there is no color information in the inital input
So, initially when the example begins narrator tells us that we do not know the color of the object, which is the missing data itself, so the decsion tree cannot figure out what color it is having and istead goes to the second branch of both but the branch on right has no further branches but the branch on the left goes to the next decesion and gives us the result cherries. I, hope this helps.
Although the colour for the unknown fruit is specified in the block containing data, for this example we assume that the colour is unknown. This is also mentioned in the audio. Therefore, our second decision tree makes the first split based on colour and arbitrarily says the fruit is red.
thank you , very well explained . found this very helpful .
Glad it was helpful!
Nice explanation 👌
Thank you 🙂
I have a doubt with the Random Forest being able to cope with missing values. In many other places I have heard that you must replace any null values for models to work. I tested an example on another dataset with null values and got this error, "ValueError: Input contains NaN, infinity or a value too large for dtype('float32'). " . Please could you expand on this.
Excellent Video - thanks :-)
if your data set is large then simply drop NAN rows
Thanks for your input!
Nan values cannot be compared with float32 type values. This is why it's important to remove all Nan values.
Can't we use train_test_split to train the model instead of all the steps in the prep?
Many Thanks. Nicely explained.
Hey Jackson, thank you for watching our video. We are glad that you liked our video. Do subscribe and stay connected with us. Cheers :)
A great tutorial to get an understanding of what random forest is. Great work and Thanks :)
Hey Rishi, thank you for appreciating our work. We are glad to have helped. Do check out our other tutorial videos and subscribe to us to stay connected. Cheers :)
In 17:07 First decision tree showing its color is orange that's true. then in the second tree why it is showing color=red is also true?
The best explanation. Thanks for sharing.
Hey, thank you for appreciating our work. We are glad to have helped. Do check out our other tutorial videos and subscribe to us to stay connected. Cheers :)
Great teacher
Thank you! 😃
I have done Decision Tree before. Can I just change the classifier to Random Forest? Or I need to follow this one?
"Hi ,
You can leverage your decision tree, update the parameters and change it into a Random Forest Classifier."
A very great tutorial indeed. I understood the explanation so well. Could I pease have the dataset and code for this tutorial?
Hello, thanks for viewing our tutorial. It would be helpful if you will provide your email ID to us so that we can send the requested dataset promptly. On the off chance that you need your email ID to be kept hidden from others, we can do that too. Hope that helps.
well explained, sir
We're so glad that you enjoyed your time learning with us! If you're interested in continuing your education and developing new skills, take a look at our course offerings in the description box. We're confident that you'll find something that piques your interest!
How does tree 1 decide the colour of the fruit is orange if the colour of the fruit is unknown? Do random forests consider all possible outcomes and take the majority of those? Thanks x
Welll.......Explained 👌👌👌
Hey Lalit, thank you for watching our video. We are glad that you liked our video. Do subscribe and stay connected with us. Cheers :)
You guys are the bomb! Thanks!
Hey Rafa, thank you for appreciating our work. We are glad to have helped. Do check out our other tutorial videos and subscribe to us to stay connected. Cheers :)
Hi, can i ask at 31:27 when you execute clf.fit(train[features],y) what happens if Number of labels=______ does not match number of samples=_____?
what if my data is already numerical what is the step to implement instead of factorizing?
well explained!!
Thanks a lot. Do subscribe to our channel and stay tuned.
Hello Sir!!! Can you please tell me,how did we figure out the unknown fruit as cherry at 16:37
First of all, the tree will ignore the missing data, since color unknown, it COULD BE true for the fruit to be apple or cherry. And then, with Circle, it COULD Be cherry. Trees tell what COULD Be true in according with the existing information.
We appreciate your effort on sharing your knowledge. Do show your love by subscribing our channel using this link: th-cam.com/users/Simplilearn and don't forget to hit the like button as well. Cheers!
Is it possible to predict a set of numbers that will output from a random number generator, finding the algorithm, in order to duplicate the same pattern of results?
I have a question about converting the species name into digits (0,1,2): what if we don't do the conversion? Can the classifier still do the prediction based on the species names(string)?
No, all of these models, operate on numbers. you must convert them into their numerical representation
Thanks for your input!
@@SimplilearnOfficial No, Thank 'YOU' for being such a great Channel. I Enjoyed extremely well.
Keep up the great work
Hello, we are so happy to receive this wonderful compliment. Like and share our video with your peers and also do not forget to subscribe to our channel for not missing video updates. We will be coming up with more such videos. Cheers!
Hey! can you explain, me why didn't we split tree on the basis of color at the root node instead of using diameter and then color in the example of where in the basket there were three fruits Apple, lemon and grapes. three of them had a different color so we could have split them on the basis of color and we have got accurate results. And there wouldn't have been any need to use diameter. Can you please clear this doubt of mine. Also, Can Iris flower data set be modeled using Support Vector Machine? If yes which model is better the random forest or Support Vector Machine
For the random forest, shouldn't the same fruit bowls/datasets have the same classification trees? That is, shouldn't the same fruit bowl split the same way to maximize information gain/GINI index? In random forests, doesn't the machine aggregate decision trees built from different datasets?
Random forest creates multiple decision trees from a particular data set. Of course, each tree is formed considering a different section of the data set. Since different sections of the dataset are used to construct each classification tree, the fruit bowl will be split in different ways. random forest algorithm takes all the trees into consideration to generate the most accurate result.
👍 Awesome, thanks for this! 😊 💗 🙌
Hey Holly, thank you for watching our video. We are glad that you liked our video. Do subscribe and stay connected with us. Cheers :)
Hi
Thanks for this wonderful lecture but I have a query, won't a decision tree will always try to make a root node and following nodes in a manner where entropy is least? And I believe yes, then does it select root nodes at random and then follows an IG algorithm like ID3? How much 'Randomness' is there when Decision Tree decides which node will be root node, considering we have hundreds of nodes.
Great video and explanations are top, but I can't run the code at 27:43, what is the problem if i may ask?
Hi Lethabo, thanks for appreciating our work. We have forwarded your query to our team. Be assured, your queries will be addressed.
Try to Separate the code from ## train , test to ....... ##
train = df[df['is_train']==True]
test = df[df['is_train']==False]
hope it helps
We appreciate your help! Keep engaging with our channel and stay tuned for more. Cheers!
Thank you for this video.
I have a practical work to do regarding my studies.
The goal is to code a program with python concerning the image classification using Random Forest technique.
Can you explain to me how to modify your code to use it on the pixels of images ?
(we will test it on the famous image of Lena), and this is for the two phases: learning and evaluation according to the evaluation criteria of Levine and Nazif (Inter-region)
Thank you in advance.
Glad you enjoyed
could we use split function for train and testing set
so if the data is missing . Is the result TRUE always?
well i think its depend on accuracy of the model
Why can't you use the in inbuilt method of sklearn to split the data 8n training and test datasets
Hi Vashist, thanks for checking out our tutorial. You are indeed right. There are multiple ways to split the data and using sklearn's inbuilt function is surely one of them. Hope that helps!
thanks a lot
You are most welcome
The tutorial is a pandas tutorial after 17:55 and before 17:55 it teaches the basics only. Would be better if you could provide some more details like how a tree is built or updated.
Hi Nesat, thank you for watching our video and for the honest feedback. We will definitely look into this. Do subscribe, like and share to stay connected with us. Cheers :)
Excellent
Hey James, thank you for watching our video. We are glad that you liked our video. Do subscribe and stay connected with us. Cheers :)
I really liked your slides :p :p
Hi Ibrahim, we appreciate the kind comment! enjoy!
Great video thank you
Hey Cory, thank you for watching our video. We are glad that you liked our video. Do subscribe and stay connected with us. Cheers :)
thank you for the tutorial, i have been subscribed to your channel for around a year now and i love the content, can you please send me the dataset for all the videos in this playlist that use Python.Thank you
Hello Harsimranjeet, thanks for viewing our tutorial and we hope it is helpful. It would be helpful if you will provide your email ID to us so that we could send the requested dataset promptly.
@@SimplilearnOfficial its harsimranjeet1996@gmail.com
Thks sir
Very welcome!
why train_test_split is not used in this method? is there any specific reason
Great tutorial .....Great Tutor and well explained...I have subscribed
this tutorial and I assure you that I have been learning so many things
about algorithms in ML in the previous videos.......I really love this
tutorial. I really appreciate also your kind help whenever I request for
the datasets .......I wanna one clarification on the "load_iris" is this the in-built function (or library)...?
Hi Amilcar, thanks for subscribing to our channel and joining our community. We have shared the required dataset to your mail ID. Stay tuned for the updates!
@@SimplilearnOfficial many thanks. Got it.
Very welcome!
The iris dataset is present within the sklearn library as it's one of the most commonly used one. So yes, load_iris is an inbuilt method that loads the iris dataset.
@@SimplilearnOfficial hello..great video..please send the python code and the file...
Hi, I run the same code for practicing but the prediction results are different, does anybody have any idea of why is this?
Maybe due to changes in the packages versions?
I get "setosa, setosa" instead of "versicolor, versicolor" in block "Out[36]"
Thankyou for the video .
Can you explain why is that it has high accuracy .. is it because of bagging approach only or are there any other reasons behind it.
It is predominantly the bagging approach. The fact that the random forest algorithm works on different parts of the dataset also plays a role in providing better accuracy.
Terimakasih. Thank you!
You are very welcome!
is the jupyter notebook available online ?
Hello, thanks for viewing our tutorial. It would be helpful if you will provide your email ID to us so that we can send the requested dataset promptly. On the off chance that you need your email ID to be kept hidden from others, we can do that too. Hope that helps.
Nice video!
Greetings! Thank you for your kind words. Spread the word by liking, sharing and subscribing to our channel! Cheers :)
Hi thank you. a wonderful tutorial. I have 9 features (unknown) and target. I want to predict if the customers will sign up or not. Do you think random forest can be applied here?
Try different model thn check which one give your desired output
Do you have the random forest video in the part of the regression? Thanks.
Hi Kritchayan, we don't have random forest video in the part of regression. However, we have Random forest video made separately in both Python and R language. If you are interested, check the below links:
Random Forest in Python: th-cam.com/video/eM4uJ6XGnSM/w-d-xo.html
Random Forest in R: th-cam.com/video/HeTT73WxKIc/w-d-xo.html
Can we have access to the notebook please?
Hello, thanks for viewing our tutorial. You can find your requested dataset in the video description. Hope that helps.
Thank you Simplilearn team for the clear explanation. Can you please provide the dataset and the python notebook used in the video?
Hello Aisha, thanks for viewing our tutorial and we hope it is helpful. It would be helpful if you will provide your email ID to us so that we could send the requested dataset promptly.
perfect sir
Thank you!
Can you show the overfitting and underfitting with python code
Hey i am doing traffic prediction and feature of matrix has days and weather condition in it can i apply random forest algorithm over it and also want to know that do i have to convert all days into 0-7 kindly reply soon
"Hi Syed,
We would suggest not to opt from random forest to solve this particular problem since that features are very less. So, to split the data at a particular node would be different."
Hi, can you please tell me why you've taken the training data set to be 75% of the total set? How can we find out the optimum value of the training set?
"Hi Mrugendra,
There is no predefined rule or an optimum value as to how much you should assign for training and testing. Ideally, the dataset is divided into (70, 75 or 80%) for training the model and (30, 25 or 20%) to test and validate the model. "
@@SimplilearnOfficial Thank you! Great Work !!! Keep it up
From where can i get the data sets used in all the videos from simplilearn?
Fast help would be highly appriciated?
Hello Rahul, thanks for viewing our tutorial. It would be helpful if you will provide your email ID to us so that we could send the requested dataset promptly. On the off chance that you need your email ID to be kept hidden from others, we can do that also. Hope that helps.
how do you measure entropy? is there a formula?
"Hi Suzie,
Please check the following link to learn more on how to calculate the entropy
www.math.unipd.it/~aiolli/corsi/0708/IR/Lez12.pdf"
@@SimplilearnOfficial thanks
Great Video, thank you! Off topic question: As a non-native Englisch speaker I am wondering if the way you pronounce mEAsuring is a certain dialect or the actual correct pronounciation.
Peter Presonic it’s just his accent. Normal pronunciation is “meh”, not “may”.
Thanks Peter, we are glad you found this content useful. That is his accent :)
We have come up with new videos on Machine Learning, do check it out here: th-cam.com/play/PLEiEAq2VkUULYYgj13YHUWmRePqiu8Ddy.html
Happy learning from Simplilearn team!
very good explanation sir. will u share the code and dataset please
Hello Arif, thanks for viewing our tutorial and we hope it is helpful. It would be helpful if you will provide your email ID to us so that we could send the requested dataset promptly.
I have seen everyone use clf as the variable name for instantiating the random forest classifier. What is the abbreviation of CLF?? Just out of curiosity.
Hi Anjith, thanks for watching our video. CLF just stands for "classifier". Hope that clarifies your curiosity. Do support us by subscribing to our channel using this link: th-cam.com/users/Simplilearn.
Very good
Thank you for watching!
awesome video
Hey Mandela, thank you for watching our video. We are glad that you liked our video. Do subscribe and stay connected with us. Cheers :)
Great explanation. Is the python code available for download anywhere? Are random forests a good choice for binary classifiers? Or are there other algorithms that do a better job?
Hello Stephen, thanks for viewing our tutorial and we hope it is helpful. It would be helpful if you will provide your email ID to us so that we could send the requested dataset promptly.
Great Video,thank you and please share the dataset
Hi, we have shared the dataset to your mail ID. Happy Learning!
Can you please send me the dataset as well? Thank you.
Hello Wong, thanks for watching our tutorial. It would be helpful if you will provide your email ID to us so that we could send the requested dataset promptly. Cheers!
"4:52 "training time is less" - Less than what?
Why at 16:18 the black object (Outline of an orange) is classified as "color orange?" equals true? Shouldn't it be classified as False because clearly it is NOT orange (black) and would be classified as False. You don't give good explanation for this. Is it so that if the data is missing you pick randomly one of the options and this time it happened to be "True"?
The problem is while explaining Random Forest he used all the predictors whereas the decision trees choose the set of variables randomly. So I guess he could have demonstrated with more variables and used different sets in different decision trees
Thank you! It was amazing with lots of information. Can I get access to the python code, please?
Hello, thanks for viewing our tutorial. It would be helpful if you will provide your email ID to us so that we can send the requested dataset promptly. On the off chance that you need your email ID to be kept hidden from others, we can do that too. Hope that helps.
@@SimplilearnOfficial sachinrdoddamani@gmail.com
how did the 3rd tree figure out the color was orange? If it didn't know that, how was it able to classify the object as an orange??