If you found this video helpful, then hit the *_like_* button👍, and don't forget to *_subscribe_* ▶ to my channel as I upload a new Machine Learning Tutorial every week.
Oh wow. When I was start learning ML, last year, your linear regression was one of the vids I first watched. Now, I'm a data scientist, you're still uploading high quality vids. Thank you! Hopefully we could get to see LSTMs and Transformers in the future. :P good day.
@@MachineLearningWithJay Wow nice! Can I make a suggestion? Maybe in the future you can include weight initialization like Xavier and He Norm. Those topics tend to be ignored because the computer is basically covering those, (I'm guilty of that :P) without knowing the reason behind it, e.g the disadvantages of weight initialization with 0 value.
So from what I understand, in Mini Batch Gradient Descent, model will train 1st mini batch, update the weights and then those updated weights will be used to train the 2nd mini batch, update and then the 3rd mini batch,..., till the last mini batch (1 epoch), then the last mini batch updated weight will be again used on 1st mini batch during 2nd epoch and so on? Do correct if wrong.
You are correct on your understanding Abhinav. I would just like to correct the words. You can say that the updated weights after training on any mini-batch is used to propoagate forward, and then they are updated again in backward propagation. Eg, randomly initialise weights at the beginning. Propagate forward (perform forward propagation) using 1st mini batch, then perform backward propagation, then update weights. Use those updated weights to perform forward propagation using 2nd minibatch, then backward propagation, update weights again and so on.
Hi... I don’t refer any book so can’t suggest you any. Although you can search for good books on ML online. I once found an article which showed top 10 books for learning ML.
If you found this video helpful, then hit the *_like_* button👍, and don't forget to *_subscribe_* ▶ to my channel as I upload a new Machine Learning Tutorial every week.
This is by far the best video on introduction to optimizers.
Very precise, articulated and clears all the doubts.
Thanks a lot brother !
Glad it helped you 😇
Oh wow. When I was start learning ML, last year, your linear regression was one of the vids I first watched. Now, I'm a data scientist, you're still uploading high quality vids. Thank you! Hopefully we could get to see LSTMs and Transformers in the future. :P good day.
Wow... Really good to know this. Thank you for sharing your story! And Yes I will be uploading videos on LSTM.
@@MachineLearningWithJay Wow nice! Can I make a suggestion? Maybe in the future you can include weight initialization like Xavier and He Norm. Those topics tend to be ignored because the computer is basically covering those, (I'm guilty of that :P) without knowing the reason behind it, e.g the disadvantages of weight initialization with 0 value.
@@arvinflores5316 Thank you for giving this suggestion. I will definitely consider making videos on these topics.
good video! but I have only one question: where does the noise come from, that you mentioned at 5:11?
Thats how the loss changes when we have more number of features
Brilliant explanation... Keep it up!
Thank you!
Very nice Explanation. Super
very good explanation. u need more views
amzing, deeply explained. thanks
Amazing helpful video!
So from what I understand,
in Mini Batch Gradient Descent, model will train 1st mini batch, update the weights and then those updated weights will be used to train the 2nd mini batch, update and then the 3rd mini batch,..., till the last mini batch (1 epoch), then the last mini batch updated weight will be again used on 1st mini batch during 2nd epoch and so on?
Do correct if wrong.
You are correct on your understanding Abhinav. I would just like to correct the words. You can say that the updated weights after training on any mini-batch is used to propoagate forward, and then they are updated again in backward propagation. Eg, randomly initialise weights at the beginning. Propagate forward (perform forward propagation) using 1st mini batch, then perform backward propagation, then update weights. Use those updated weights to perform forward propagation using 2nd minibatch, then backward propagation, update weights again and so on.
@@MachineLearningWithJay Thanks :)
very informative and precise.
Thank you!
very good explanation! well done!
Thank you!
great video
Glad it was helpful!
Amazing dude. Keep it up.
Thank You So Much !!
Great job
Thank you!
you got a new sub bro, good video
Great job!
Your videos are helpful, Can you suggest a good book on same...
Hi... I don’t refer any book so can’t suggest you any. Although you can search for good books on ML online. I once found an article which showed top 10 books for learning ML.
@@MachineLearningWithJay Ok ! Thanks!...But have topics like batch normalisation and standard network like LeNet, AlexNet, AGG, GoogleNet in detail
@@EEDNAGELIVINAYAKSHRINIWAS Have you read the original research papers for these? I think you can learn about these in their research papers only.
@@MachineLearningWithJay Ok..can you mail me on my id lets discuss separately on some projects
Hi Vinayak, you can mail me on codeboosterjp@gmail.com with your query. I will see what i can do to help.
Thank you so much for your videos.
You're Welcome. Glad you like them! 😇
ty
Welcome!
👌
good boy