C4W4L07 What are deep CNs learning?

แชร์
ฝัง
  • เผยแพร่เมื่อ 17 ก.ย. 2024

ความคิดเห็น • 19

  • @aayushpaudel2379
    @aayushpaudel2379 4 ปีที่แล้ว +1

    my logic of understanding- the further you are from images- the more (wider) you see!! :D

  • @sandipansarkar9211
    @sandipansarkar9211 3 ปีที่แล้ว

    great explanation

  • @MG5350
    @MG5350 4 ปีที่แล้ว +3

    I can't understand how these visualizations can be real if images are progressively being convoluted and have less and less resolution throughout the convnet

    • @JIMMYLIU5
      @JIMMYLIU5 4 ปีที่แล้ว +9

      You go backwards to find the patch in original image that is giving maximum response at a certain layer.

  • @emmanueltd7628
    @emmanueltd7628 5 ปีที่แล้ว +1

    why does the first layer see only a little of the image? it convolves through the entire image right?

    • @lizeng489
      @lizeng489 4 ปีที่แล้ว +8

      You are thinking the deep CN learning as a "forwarding process", but it is actually a "backwarding process". Instead of thinking the first layer params are generated from a convolution of the image, better think the first layer params are told to be so by the second layer which is again told to be so by higher layers all the way to the loss function of input data. It is this structure that decided the first layer better only "see only a litter fraction of the image" will perform the best for the purpose.

    • @puneetsingh5219
      @puneetsingh5219 4 ปีที่แล้ว +1

      @@lizeng489 So, in a way, the CNN in breaking the image down moving from higher level to lower level layers?

    • @abdulmukit4420
      @abdulmukit4420 4 ปีที่แล้ว

      It is due to something called the receptive field. Look at this documentation from cs231n.
      cs231n.github.io/convolutional-networks/

    • @pablopicasso2208
      @pablopicasso2208 4 ปีที่แล้ว

      It convolves through the entire image but with it's filters i.e. the weights that are only looking for pattern in the whole image like a diagonal or a straight line we can only see the edges(Activation only values it as edge or not edge).Next is pooling, Imagine it as something that reduces the resolution but shows the same image(The image hear being the edges but just in a smaller resolution).We convolve it again and start looking for shapes like circle through the whole image features which we got from the previous result(i.e. the images) and it goes .LMK if it makes sense

    • @gorgolyt
      @gorgolyt 3 ปีที่แล้ว +1

      @@lizeng489 Not sure why so many upvotes, this answer is just wrong and irrelevant to the question.

  • @kihongkim8726
    @kihongkim8726 3 ปีที่แล้ว

    "Pick a unit in layer 1" means 110x110x1 from 110 x 110 x 96 ???

  • @ahmedyehia5553
    @ahmedyehia5553 6 ปีที่แล้ว

    3:00

  • @hirakmondal6174
    @hirakmondal6174 4 ปีที่แล้ว +3

    Who on earth gives picture of his wife in examples..?😂

    • @udaykadam5455
      @udaykadam5455 4 ปีที่แล้ว

      How is this a problem when our country is going bananas.
      Let's get to this when we solve like millions of actual problems in our lives.

    • @psibarpsi
      @psibarpsi 2 ปีที่แล้ว

      Andrew Ng.

  • @thovinh5386
    @thovinh5386 4 ปีที่แล้ว

    I understand every English words he said in this video, I just don't understand the content.

    • @xl0xl0xl0
      @xl0xl0xl0 2 ปีที่แล้ว +1

      Do you understand it now?