Introduction into YOLO v3

แชร์
ฝัง
  • เผยแพร่เมื่อ 3 ม.ค. 2025

ความคิดเห็น • 76

  • @abdshomad
    @abdshomad 3 ปีที่แล้ว +22

    ~ Timeline for watching again later ~
    00:01 Intro
    01:17 What is YOLO?
    03:13 Architecture of YOLO v3
    05:28 Input
    07:27 Detections at 3 Scales
    09:28 Detection Kernels
    12:02 Grid Cells
    14:23 Anchor Boxes
    18:25 Predicted Bounding Boxes
    21:41 Objectness Score
    Conclusion

  • @bakervhaigaming9746
    @bakervhaigaming9746 3 ปีที่แล้ว +13

    I regret why I haven't found this gem earlier! I had to go through 5-6 papers and hours of reading to understand these topics but your video made it very clear and specific. Please make more quality content like this. Thanks a lot.

    • @valentynsichkar
      @valentynsichkar  3 ปีที่แล้ว +1

      Thank you for the feedback! Will do!

  • @azmyin
    @azmyin 3 ปีที่แล้ว +2

    This is one of the simplest and most articulated explanation of YOLOv3. Thank you very much for this video and please keep up the good work.

    • @valentynsichkar
      @valentynsichkar  3 ปีที่แล้ว +1

      Thank you for the feedback! Will do!

  • @m5a1stuart83
    @m5a1stuart83 3 ปีที่แล้ว +2

    I was code in YoloV3 from Indian TH-camr, and now here I am learning the true nature of Yolo. It helps alot for this OCR Project where I can ignore the image that did not intended to be uploaded to Server.

  • @sumitbali9194
    @sumitbali9194 3 ปีที่แล้ว +1

    I have seen lot of videos on CNN, mostly crap. But your video is a gem. Appreciate the effort you have put into making this video. Diagrams are a great help in understanding the architecture. Thanks again

  • @adithyanarayan9976
    @adithyanarayan9976 3 ปีที่แล้ว +1

    Spent multiple hours trying to read through various papers in order to understand some of the topics. Should've stumbled upon your channel and the video much earlier. Love the fact that everything is explained to the point. You've earned yourself a subscriber in me. Can't stress this enough, but please put out more videos like these, along the lines of Computer Vision. Well done mate and once again, THANK YOU SO MUCH!

    • @valentynsichkar
      @valentynsichkar  3 ปีที่แล้ว +1

      Thank you for the feedback! Will do!

  • @mitultandon5227
    @mitultandon5227 4 ปีที่แล้ว +3

    one of the best explanations of YOLO!

  • @dp.9130
    @dp.9130 4 ปีที่แล้ว +3

    Great video!
    We need more detailed explanation-videos like this, any other video i've watched are same few lines of explanation of YOLO where can be found all over the internet.

  • @hima-220
    @hima-220 4 ปีที่แล้ว +2

    This video really contains the details of yolov3! It helps me a lot! Thx!

  • @krishhhhh1717
    @krishhhhh1717 4 ปีที่แล้ว +2

    This is one of the best I have seen . Thank you

  • @iProFIFA
    @iProFIFA 3 ปีที่แล้ว +9

    Legitmely the clearest video I could find on this topic, amazing! Thanks a lot and keep up the great work Valentyn! :-)

    • @valentynsichkar
      @valentynsichkar  3 ปีที่แล้ว +1

      Thank you for the feedback! Will do!

  • @mtmotoki2
    @mtmotoki2 4 ปีที่แล้ว +1

    It is explained with a lot of diagrams, so even though I am not very good at English, I was able to understand it. Thank you

  • @hoangvancuong4868
    @hoangvancuong4868 4 ปีที่แล้ว +3

    thank for detail and easy to understand video. I love it.

  • @naufalramadhani9166
    @naufalramadhani9166 4 ปีที่แล้ว +3

    thank you for thorough explanation sir, much appreciated it, keep it this way it is great.. cheers sir

  • @Can-ue7de
    @Can-ue7de 2 ปีที่แล้ว +1

    Amazing Explanation of Yolo v3. Thank you very much.

  • @saptarshidattaaot
    @saptarshidattaaot 4 ปีที่แล้ว +2

    Thanks for the great explanation!!

  • @simonbernard4216
    @simonbernard4216 4 ปีที่แล้ว +2

    You should do another video for YOLOv4

  • @pascalschluchter209
    @pascalschluchter209 3 ปีที่แล้ว +2

    Hey, can someone explain to me, why the detection is happening in Layer 82, 94 and 106. Is there any mathmatical background or is it like a fix parameter of YOLOv3?

  • @hanglethithu2873
    @hanglethithu2873 4 ปีที่แล้ว +3

    Great. Thank you, it helps me a lot!

  • @syafiqbasri8789
    @syafiqbasri8789 3 ปีที่แล้ว +3

    thank you so much sir.Its very useful and great explanation!

  • @shannondoyle5143
    @shannondoyle5143 3 ปีที่แล้ว +2

    Really great detailed explanation. I don't get exactly what the ground truth values are determined for grid cells close to the centre grid cell of an object. Would you be able to explain this ?

  • @ozne_2358
    @ozne_2358 4 ปีที่แล้ว +2

    Great tutorial, thanks !

  • @Alpha-hj2ss
    @Alpha-hj2ss 3 ปีที่แล้ว +2

    Great Video! Can you please come with more videos

  • @neerajruhela92
    @neerajruhela92 4 ปีที่แล้ว +2

    Nice explanation!! Thank you

  • @sachinbharadwajm2120
    @sachinbharadwajm2120 3 ปีที่แล้ว +1

    great explanation & presentation!!!

  • @mmshafique8491
    @mmshafique8491 4 ปีที่แล้ว +1

    hats off sir. thank you very much for such a nice briefing.

  • @yasminalothmani4445
    @yasminalothmani4445 4 ปีที่แล้ว +2

    perfect explanation thanks

  • @shubhanubanerjee2098
    @shubhanubanerjee2098 4 ปีที่แล้ว +3

    Thank you very helpful . Can you make a series on deep learning please ?

    • @valentynsichkar
      @valentynsichkar  4 ปีที่แล้ว +1

      Thanks for the feedback! For sure, will do!

  • @fujiawang4326
    @fujiawang4326 4 ปีที่แล้ว +3

    very well explained

  • @rishabhgupta734
    @rishabhgupta734 2 ปีที่แล้ว +3

    One question, is ground truth bounding box and anchor boxes used here interchangeably?

  • @sameershaik7250
    @sameershaik7250 3 ปีที่แล้ว +6

    Explained very well.... great

  • @aasishkc1799
    @aasishkc1799 4 ปีที่แล้ว +3

    Well explained 👍

  • @kristopherhuber3356
    @kristopherhuber3356 4 ปีที่แล้ว +1

    I enjoyed your video. Thank you for putting in the effort. Could you comment on the receptive field of YoloV3? For example if I put in a shape=(416,416,3) image; then as you said, YoloV3 decimates by 32, to produce an output feature map at layer 82 of shape=(13,13,255). This shown quite clearly in your video (15:50 mark). My question is what is the receptive field for that first cell in the output feature map? (ie. the top left cell - of shape=(1,1,255) )? To ask another way, what portion of the original 416,416,3 image is mapped to the 1,1,255 feature cell?

  • @glowwell4292
    @glowwell4292 3 ปีที่แล้ว +1

    Thanks a lot. Explained neatly.
    Please make videos on V4 and V5 too.

  • @zubairsk1624
    @zubairsk1624 3 ปีที่แล้ว +4

    hello dear
    i hope you are okay
    i want to ask you few questions
    1- can i apply some edit on yolo equation to get better detection
    2- can you recommend me some videos that explain every thing about YOLOv4
    3- how can i write these equations in python?
    i hope you answer me thank you

    • @jessmendoza1483
      @jessmendoza1483 2 ปีที่แล้ว

      i've read some articles where they improve yolov3 by adding an equation, you should search some, maybe it could help you

  • @sekharbabu8498
    @sekharbabu8498 2 ปีที่แล้ว +3

    Good explanation. Thank you sir

  • @apurbaroy8411
    @apurbaroy8411 3 ปีที่แล้ว +2

    Is it possible to integrate the YOLO algorithm with arduino or raspberry pi using a webcam?

  • @seolakim5667
    @seolakim5667 2 ปีที่แล้ว +2

    Thank you so much for this amazing video. Just one question : at 23:58 , why would you define the "t_0" inside the sigmoid? In the loss function of Yolo v3 they directly use p_0 so I would like to know why! Is this just to make sure that the p_0 is between 0 and 1? Does this t_0 appear somewhere in the model when we implement it? Thanks in advance to anyone who would reply :)

  • @bharathnvadla
    @bharathnvadla 2 ปีที่แล้ว +1

    Hi Thank you for the explanation ,I have one question, How is the Objectiveness score calculated during the inference ? There is no groundtruth to refer to, on what basis the objectiveness score is measured ?

  • @kyawnaingwin8300
    @kyawnaingwin8300 3 ปีที่แล้ว +1

    Should the input image for detection be same size as training images used in model fitting? Or how big is an input image size ok?

    • @valentynsichkar
      @valentynsichkar  3 ปีที่แล้ว +3

      Hello there,
      There is no need to resize images before training or testing after training. The framework (e.g. the one on GitHub framework for YOLO) will take care of resizing. Moreover, separate images, both for training and testing, can be also of different dimensions.

    • @kyawnaingwin8300
      @kyawnaingwin8300 3 ปีที่แล้ว +1

      @@valentynsichkar thanks for reply. In my case my test image is 20,000 x 20,000 size (drone photo mosaic) and model cannot detect. Only when I split the input image as tiles of same size of training images, it work. According to you, I think I can make bigger tiles for detection but just want to know the limit of input size.

  • @blueknight6906
    @blueknight6906 4 ปีที่แล้ว +1

    how many classes can yolo detect?

    • @valentynsichkar
      @valentynsichkar  4 ปีที่แล้ว +1

      It depends on how many classes it is set for training. For instance, YOLO trained on COCO dataset detects and classify 80 classes.

    • @blueknight6906
      @blueknight6906 4 ปีที่แล้ว +1

      @@valentynsichkar yolo v3 ?

    • @valentynsichkar
      @valentynsichkar  4 ปีที่แล้ว +1

      It doesn't matter which algorithm. As mentioned in the message above, it depends on what number of classes is specified for training. It can be YOLO v2, v3, v4 or any other algorithm.

  • @mainulalam7767
    @mainulalam7767 3 ปีที่แล้ว +5

    Thank you for this super explanation. I have a question regarding the objectness score. As you explained mathematically : P0 = sigmoid ( to) = P(object) * IoU -> my question is how we obtain this "P(object)" - predicted probability ? Thanks in advance for your support ..

    • @bharath5666
      @bharath5666 3 ปีที่แล้ว +1

      yes,it is predicted probability by the network.

    • @jessmendoza1483
      @jessmendoza1483 2 ปีที่แล้ว

      @@bharath5666 can i find how does the network predices P(object), but like mathematically or somewhere in the code?

  • @merlinkurian719
    @merlinkurian719 4 ปีที่แล้ว +1

    Thanks a lot. Please make a vedio on YOLOv4

  • @akhilraj2091
    @akhilraj2091 4 ปีที่แล้ว +1

    great video, thanks
    for this..

  • @mohssineserraji1098
    @mohssineserraji1098 3 ปีที่แล้ว +1

    Great presentation

  • @Stilbrech3rin
    @Stilbrech3rin 3 ปีที่แล้ว +1

    I can just follow the others. This video is very helpful. Did you publish a paper? I would like to cite you for my project.

  • @travel7517
    @travel7517 4 ปีที่แล้ว +1

    Nicely explained

  • @rlb5261
    @rlb5261 3 ปีที่แล้ว +1

    Thank. It is excellent!

  • @erack1
    @erack1 3 ปีที่แล้ว +1

    New to machine learning and I'm wanting to create an object detection for video games. What are some good resources to start learning, I know the basics essentially of neural networks and their functions. I've bought your course and will be starting to learn that.

  • @kondurusrikanth7620
    @kondurusrikanth7620 4 ปีที่แล้ว +1

    nice explaination

  • @pulkitverma1507
    @pulkitverma1507 2 ปีที่แล้ว +1

    Very helpful thanks!

  • @ahhhwhysocute
    @ahhhwhysocute 3 ปีที่แล้ว +2

    Amazing explanation !! Thank you

  • @Илья96-с7б
    @Илья96-с7б 3 ปีที่แล้ว +1

    Топчик просто. Сразу всё понятно стало. Стало хоть ясно, что за якоря такие

  • @muhammad_talha
    @muhammad_talha 4 ปีที่แล้ว +1

    ~perfacto!

  • @valentinfontanger4962
    @valentinfontanger4962 4 ปีที่แล้ว +1

    perfect !

  • @fadouaamraniidrissi819
    @fadouaamraniidrissi819 4 ปีที่แล้ว +1

    thank you

  • @JNK369
    @JNK369 4 ปีที่แล้ว +1

    Thanks sir

  • @fatiah541
    @fatiah541 3 ปีที่แล้ว +1

    Thanks 🌹🌹🌹🌹

  • @fatiah541
    @fatiah541 3 ปีที่แล้ว +2

    🍀🍀🍀🍀🍀🇮🇶

  • @listenbyheart5552
    @listenbyheart5552 3 ปีที่แล้ว +1

    really awesome explanation it was!
    thanks a lot

  • @SM--wb4vg
    @SM--wb4vg 3 ปีที่แล้ว +1

    Very well explained