What is Back Propagation
ฝัง
- เผยแพร่เมื่อ 10 ก.พ. 2025
- Learn about watsonx→ ibm.biz/BdyEjK
Neural networks are great for predictive modeling - everything from stock trends to language translations. But what if the answer is wrong, how do they “learn” to do better? Martin Keen explains that during a process called backward propagation, the generated output is compared to the expected output, and then the error contributed by each neuron (or “node”) is examined. By adjusting the node’s weights and biases, error is reduced and thus the overall accuracy improved.
Get started for free on IBM Cloud → ibm.biz/sign-u...
Subscribe to see more videos like this in the future → ibm.biz/subscri...
Back propagation to neural networks is what negative feedback is to closed loop systems. The understanding come pretty much naturally to the people which studied automation and control engineering.
However - many articles tend to mix thing up. In this case back propagation and gradient descent. Back propagation is the process of passing the error back through the layers and using it to recalculate the weights. Gradient descent is the algorithm used for recalculation. There are other algorithms for recalculation of the weights.
thanks for getting me through my exams cause the script from my professor helps literally nothing in understanding deep learning. Cheers mate
Remember the chain rule that you learned in high school.Well that’s what is used in Backpropogation
Brilliantly simplified explanation for a fairly complex topic. Thanks, Martin!
marvin u mean, smh
Thanks for this Great explanation MARK 😃
Beautifully explained backpropagation🌸
Incredible, Martin example made me understand the concept exactly. Your real life examples are great alongside entertaining.
Thank you for explaining this. I'm reading “Architects of Intelligence” and that's the first time I’ve heard the term backpropagation. Your examples and drawings help me better understand the topic.
Thanks Marlon.
I can't get enough of you brilliant videos. Thank you for making what it seemed to me before as complicated easy to understand . Could you please post a video about loss functions and gradient decent?
What did he even taught actually?
Thank you Mariin😃
Hi Marklin!
Thank you for such a great explanation.
(btw, I know your name is Martin. 😂 )
The gradient is passed backward using the chain rule from calculus. The gradient is just a multivariable form of the derivative. It is an actual numerical quantity for each "atomic" part of the network; usually a neuron's weights and bias.
Very Good Video …. Thank you very much ❤
Lovely man..... excellent videos..all complexities eliminated. thanks a lot 😊
Thank you so much Morlin! Great video
Great work, so easy to understand
Awesome thank you Marvin.
5:36 Actually, convergence is does not necessarily mean the network is able to do its task reliably. It just means that its reliability has reached a plateau. We hope that the plateau is high, i.e. that the network does a good job of predicting the right outputs. For many applications, NNs are currently able to reach a good level of performance. But in general, what is optimal is not always very good. For example, a network with just 1 layer of 2 nodes is not going to be successful at handwriting recognition, even if its model converges.
Hi, you seem to have good knowledge on this, can I ask you a question please. Do you know if neural networks will be good for recognizing handwritten math expressions? (digits, operators, variables, all elements seperated to be recognized individually). I need a program that would do that and I tried a neural network, it is good for images from dataset but terrible for stuff from outside the dataset. Would you have any tips? I would be really greatful
Excellent session ..thank you !!
Thanks Mardnin!
This would really benefit from a (simple) worked example
Awesome.....awesome superb explanation sir
thanks Marvin!
In IBM we trust ✊😔
A great video!
Very well explained 🎉
Off topic: what drawing board setup do these IBM videos use ?
it's really great.
I'd like to know, too.
I find it ,he is drawing on the Glass
cheers Marvin
thanks marvin
Great video! 😊
Wait ... the video is online for four hours ... and still no question how he manages to write mirrored?
There’s a separate video for that
Ha, that's so true. Here you go: ibm.biz/write-backwards
Just flip the video horizontally
At 2:20 when he was talking about biases, does it have any relation with Hyperparameter or regularization unit?
probably would have been good to describe that this is supervised learning as this would nit translate well for a beginner trying to apply this to other form of NNs
Thank you!
When doing the Loss Function hove is the "Correct" output given? Is it training data and the compared an other data file with desired outcomes? In the example of "Martin" how does the neural network get to know that your name was not Mark?
Thank you for the information.
Could you please tell if the the BP is only available and applicable for Supervised models, as we have to have a pre computed result to compare against!!
Certainly, unsupervised models could also use this theoretically but does / could it effect in a positive way?
Additionally how the comparison actually performed?
Especially for the information that can't be quantised !
thank youu
Isn't Back Propagation used to lower the computation needed to adjust the weights? I understand that doing so in a "forward" fashion is much more expensive than in a "backward" fashion?
Save yourself 8 minutes.. It's a FEEDBACK loop - FFS !
why you didn't explain how the network updates the weight
ANY CHANCE TO GIVE 1000 LIKES ???😩
Thanks still I need to understand how technically does it happen
It’s done by calculating the derivatives of the y hats with respect to the weights, and the function done backwards in the network applying the chain rule of calculus
Try to use more unexplained terminology to explain the terminology you try to explain is the source of confusion. 😂 Thumb down.
A neural network cannot be connected by weights, this is nonsense. It can be connected by synapses, that is, by resistances. The way the network learns is incredibly tricky: not only does it have to remember the correct result, which is not easy in itself, but it has to continue to remember the correct result while remembering a new correct result. This is what distinguishes a neural network from a fishing net.