After krish naik, yours is second best video on imputing missing value. Kindly create couple of more video on Feature engineering and Feature Selection.
I have a disputed question. As the knn imputer works on the principles same as knn algo, it does share the pros and cons of knn algo, right. So wont it be better to simply scale the data first ? Also, in case I am separating out the train and test data in order to avoid data leakage, should I split the data and then scale, impute ? Or should I impute and then split,scale it ? In case I split first...which is the most common preference which stats should I use for the user input. And lastly how should I handle the label encoded columns if any ? Nobody is discussing on this when it is one of the most imp problems a person would likely face. Can you please make a video on this ?
Thanks a ton for sharing this! A minor doubt: If two variables are linearly correlated then would it not be better to omit variable/feature with higher missing values? Other feature is not adding anything to the model and would add to number of features in the model (overfitting.)
After krish naik, yours is second best video on imputing missing value. Kindly create couple of more video on Feature engineering and Feature Selection.
use full insights ...good
Thanks @Paramita. Nicely Explained. Keep up the good work
Beautiful work here, so so helpful for me. Thank you so much!! Appreciate it
This was really insightful. You got a new subscriber
Thank you..
nice guide, easy to understand. thank you
I have a disputed question. As the knn imputer works on the principles same as knn algo, it does share the pros and cons of knn algo, right. So wont it be better to simply scale the data first ? Also, in case I am separating out the train and test data in order to avoid data leakage, should I split the data and then scale, impute ? Or should I impute and then split,scale it ? In case I split first...which is the most common preference which stats should I use for the user input. And lastly how should I handle the label encoded columns if any ? Nobody is discussing on this when it is one of the most imp problems a person would likely face. Can you please make a video on this ?
👍
Thanks.
Thanks a ton for sharing this!
A minor doubt: If two variables are linearly correlated then would it not be better to omit variable/feature with higher missing values? Other feature is not adding anything to the model and would add to number of features in the model (overfitting.)
Thanks
nice
Kindly creating missing value imputation on categorical features.
why you stop making new videos?
can you plz share the code file?
miss ap hindi me nhi bna sakhti ho video