Great work. I like how you made youtube chapters to explain independent techniques like NMS. Really useful. Many people don't have the time to go through papers in details and just run the codes to get things done. Your videos could be helpful to solve that problem. I'm personally hoping to see videos on YOLO series especially the YOLOX model :) You could also talk about the object detection models landscape and how each model has pros/cons w.r.t. inference time (FPS) and performance.
Nice, this topic deserves its own playlist. RCNN has so many component, you can make separated short video for each component, so it wont be overwhelming for the viewers.
Thanks, Muhammad. I actually want to create videos for other object detection algorithms as well and put them in a playlist. From my past experience and based on the videos I've seen, usually, long videos get more viewers. I already separated this video into different chapters and viewers can watch each one on their own time. It's a kinda subjective opinion I believe.
Very Nice Explanation Just one question they use SVM in the final steps to make prediction. Is that for the class prediction or for the Bounding Box prediction. Also how to we know that we need to predict (c + 1) classes ? Do we know beforehand what classes of objects are present in our image ?
@@soroushmehraban Oh I spoke too fast, (bc I watched some parts of the video several times, I thought you used the expression several times)... Yeah I take it back apologies, oc everyone can use this expression!
Thanks, Mohandes. I'll try enhancing the quality by changing my recording method but still it's not gonna be perfect. At least not in the first few videos.
That's a great question. I think I should have mentioned that. Our model might predict different bounding boxes pointing to the same object. In such a scenario, we do the following: 1) Sort all the predicted bounding boxes based on the class score (In descending order). 2) Pick the first bounding box that has the highest probability score. 3) Compute the IoU of the selected bounding box with other bounding boxes pointing to the same class. 4) If the IoU of any bounding box with this bounding box is larger than a threshold (such as 0.5), then we remove the bounding box having the lower class score. I hope it's clear.
@@soroushmehraban i think following conditions might not be sufficient, because even if we sort and pick highest one... again we left with question of all these are pointing to same object location or reference really in a image ? same object references might be at multiple places please clarify this doubt
That's true we might have same objects at multiple places. let's say we have object A at location (x1, y1) and (x2, y2). for location (x1, y1) our model might predict multiple bounding boxes all refer to the object A. Out of all these bounding boxes we only keep the one that has the highest score and others if they have IOU higher than a threshold with this bounding box, we remove them. For object A at place (x2, y2), since it's in different area of the image, the IoU with the one having highest score is less than a threshold, so we keep the second one having the highest threshold and again others having IoU higher than a threshold, we remove them. @@NagarajuSeru-rc7lb
Very Nice.. Thank you so much.... I have a question related to NMS... that As you explained about NMS, IOU of classified object regions will calculated over the ground truth value at the time of training and validation but what about at the time of inference ? since you have grouth truth values at time of train and validate only but not at inference. awaiting for your response.... thank you so much adavance
One of the best videos I have watched. Very detailed Explanations. Keep up the good work
Thanks 🙂
Great Work! You explained 1000 times better than my uni lecturer :D
Such a great video!! Keep them coming!
the way you organised the following content are just awesome ..
Such an underrated video. Well done mate!
Glad you enjoyed it!
bro you did actually the best video for eexpaling Rcnn
Great work.
I like how you made youtube chapters to explain independent techniques like NMS. Really useful.
Many people don't have the time to go through papers in details and just run the codes to get things done.
Your videos could be helpful to solve that problem.
I'm personally hoping to see videos on YOLO series especially the YOLOX model :)
You could also talk about the object detection models landscape and how each model has pros/cons w.r.t. inference time (FPS) and performance.
Wonderful feedback, Gota. I'll make sure to create them in the future
Very nice! I can't wait to see more videos like this!
Thanks, Jeffrey! Wait for the better ones then 😄
Very well explained . Thank you
oh my it explains everything at once! Thank you for making this video!
Simple and easy to understand! Thank you for making this video :)
Glad it was helpful!
This so easy how i can uderstand about RCNN and that is because your explanation!
thank you very much, i love your video
Glad you liked it!
Thanks for your work! It's helps me a lot! Appreciate that~
Very nicely explained with animation 💜
Nice, this topic deserves its own playlist. RCNN has so many component, you can make separated short video for each component, so it wont be overwhelming for the viewers.
Thanks, Muhammad. I actually want to create videos for other object detection algorithms as well and put them in a playlist. From my past experience and based on the videos I've seen, usually, long videos get more viewers. I already separated this video into different chapters and viewers can watch each one on their own time. It's a kinda subjective opinion I believe.
@@soroushmehraban how about Yolo?
best explanation ever!
Thanks a lot for this! It was really clean and precisely explained. mAP explanation was on point.
Glad you liked it!
This is great. Nice work!! Waiting for more such videos.
Thanks, Raghuveer! Appreciate it.
Awesome video Now I can read the paper and use the video as a guide.
Glad you liked it!
Good job Soroush, Very nice video! It helped me a lot specially to understand the mAP metric. Just Keep going :)
Glad you liked it :)
Nice video! Keep up the great work
Thank you, Bellz!
clean explanation give this man more sub !
Very Nice Explanation
Just one question they use SVM in the final steps to make prediction. Is that for the class prediction or for the Bounding Box prediction.
Also how to we know that we need to predict (c + 1) classes ? Do we know beforehand what classes of objects are present in our image ?
Well done. That was great
Thanks Aref
I really appreciate it, very good explanation. Thanks!
Cool video! Keep them coming
Thanks, Mohamed!
Very interesting! need more videos.
dude!!! that was such a nice explanation
Thanks!
Cool! Nice work💥
Cool! Nice work.
Thanks, Seokeon. I hope you find it useful.
thank you for your great explanation! keep going!
Thanks!
Great explanation
Great video. keep up the good work
Great explanation, keep doing it!
Thanks, Alexander!
Thanks very much for this, it's much clearer to me know (after starting from just the paper). (Edit : this Paper is clearly explained in every way)
Thanks for the honest feedback 😃 looking at the previous videos posted, I’m not using that phrase anymore.
@@soroushmehraban Oh I spoke too fast, (bc I watched some parts of the video several times, I thought you used the expression several times)... Yeah I take it back apologies, oc everyone can use this expression!
Great video. Good job. Request for follow up videos: Faster R-CNN, Mask R-CNN, DETR
Thanks, Yaser. I'll post them. But first I'll post Fast R-CNN
great work!
Great explanation❤
Nice job! Keep up the good work!
Thanks for the positive energy, Chayan!
Informative video!
Thanks, Kaan!
Congrats. Good work.
Thanks, Can! Appreciate it.
Nice one! Please make more
Thanks, Ishaan. Sure!
thank you so much , such an amazing video . Can i ask which tool/app you using for this slide? i love how they working
Thanks for the feedback Huy 🙂It's just a powerpoint.
literally , Clearly EXPLAINED
Very nice! Thanks a lot! May you please upload your slides, too?
what is the background music you are using in the video ?
I don't remember that was a long time ago. I'm not using any background music anymore.
bright explanation Thanks
Thanks, Alireza. I hope you found it useful.
Great Job, Can't wait to see more videos of you. Can you fix your microphone for next videos?
Thanks, Mohandes. I'll try enhancing the quality by changing my recording method but still it's not gonna be perfect. At least not in the first few videos.
Thank you so much
Keep up the good work
Thanks!
good work
Thanks, Lakshay.
Nice work
Thanks, Pouya.
Great.
Thank u
How does NMS works in inference? As we won't be having ground truth
That's a great question. I think I should have mentioned that. Our model might predict different bounding boxes pointing to the same object. In such a scenario, we do the following:
1) Sort all the predicted bounding boxes based on the class score (In descending order).
2) Pick the first bounding box that has the highest probability score.
3) Compute the IoU of the selected bounding box with other bounding boxes pointing to the same class.
4) If the IoU of any bounding box with this bounding box is larger than a threshold (such as 0.5), then we remove the bounding box having the lower class score.
I hope it's clear.
@@soroushmehraban i think following conditions might not be sufficient, because even if we sort and pick highest one... again we left with question of all these are pointing to same object location or reference really in a image ? same object references might be at multiple places
please clarify this doubt
That's true we might have same objects at multiple places. let's say we have object A at location (x1, y1) and (x2, y2). for location (x1, y1) our model might predict multiple bounding boxes all refer to the object A. Out of all these bounding boxes we only keep the one that has the highest score and others if they have IOU higher than a threshold with this bounding box, we remove them. For object A at place (x2, y2), since it's in different area of the image, the IoU with the one having highest score is less than a threshold, so we keep the second one having the highest threshold and again others having IoU higher than a threshold, we remove them. @@NagarajuSeru-rc7lb
nice one thx
Very Nice.. Thank you so much....
I have a question related to NMS... that
As you explained about NMS, IOU of classified object regions will calculated over the ground truth value at the time of training and validation but what about at the time of inference ? since you have grouth truth values at time of train and validate only but not at inference.
awaiting for your response.... thank you so much adavance
great
Thanks, Louis.
Hoppe Lights
Thompson Spur
Collier Ramp
McCullough Manors
Halvorson Crest
659 Esmeralda Passage
843 Cyrus Pass
3195 Marty Squares
Kara Road
Bauch Estates
Stokes Rue
Wallace Land
Fudge, you copy other's work
4463 Leannon Lodge
Nais work man, keep this up, I wanna see moo 🤌❤️
Thanks, man! I'll try my best.