Heroes of Deep Learning: Andrew Ng interviews Andrej Karpathy

แชร์
ฝัง
  • เผยแพร่เมื่อ 24 พ.ย. 2024

ความคิดเห็น • 113

  • @maganaluis92
    @maganaluis92 7 ปีที่แล้ว +115

    Andrew Nj and Andrej in the same video, this will go down in history!

    • @selva279
      @selva279 4 ปีที่แล้ว +2

      It would have been complete if Chris olah also joined them

    • @jorjiang1
      @jorjiang1 8 หลายเดือนก่อน

      stop cheerleading and code yourself

  • @aitarun
    @aitarun 7 ปีที่แล้ว +306

    I have to listen Andrej part at 0.75 speed. :)

    • @gianluke
      @gianluke 7 ปีที่แล้ว +33

      I've checked a couple of time the youtube settings because I thought the video was accelerated :|

    • @quietbydayYT
      @quietbydayYT 7 ปีที่แล้ว +45

      Yes, the sign of a brain limited by the bandwidth of speech. lol

    • @SaiFi0102
      @SaiFi0102 7 ปีที่แล้ว +3

      Haha, it sounds much better 0.75 :'D

    • @itttottti
      @itttottti 7 ปีที่แล้ว

      hahaha, show and gone

    • @pakigya
      @pakigya 7 ปีที่แล้ว +3

      Thank you. I was watching at 2x speed before lol

  • @nickang6647
    @nickang6647 7 ปีที่แล้ว +137

    Prof Andrew is a really humble person! Thanks for taking the time to interview and share this.
    13:02 - Advice for people thinking about entering the field of AI, deep learning

    • @saikrishnaklu
      @saikrishnaklu 6 ปีที่แล้ว +2

      Thats an amazing reply #13:02

  • @Thegreatindiaexpedition
    @Thegreatindiaexpedition 7 ปีที่แล้ว +76

    Andrew Ng is my hero .... He motivated me the first time from his lecture series on Machine learning

  • @curioussand1339
    @curioussand1339 7 ปีที่แล้ว +39

    Andrej Karpathy talks in such a way that I briefly thought I had the clip running @ 1.25

  • @PrabinKumarRath-kf1rv
    @PrabinKumarRath-kf1rv 2 หลายเดือนก่อน

    Awesome to know that Andrej and OpenAI really made it happen! Some of the terms that Andrej mentions AGI, agents, end-to-end models, they were always on the right track! We realized all of this after ChatGPT in 2023

  • @blueborne4031
    @blueborne4031 4 ปีที่แล้ว +3

    Both of these people are my heroes. I would not have gone into deep learning without them

  • @bellaggio1770
    @bellaggio1770 6 ปีที่แล้ว +4

    Humbling to hear people who are way smarter than us

  • @morebaie3412
    @morebaie3412 5 ปีที่แล้ว +7

    What an amazing interview! Andrej Karpathy is making a great work intersecting NLP with computer vision, it's a huge move in the AI era.

  • @cupajoesir
    @cupajoesir 7 ปีที่แล้ว +15

    The energy present in this discussion is fantastic. Thanks for sharing.

    • @vladimirbosinceanu5778
      @vladimirbosinceanu5778 2 ปีที่แล้ว +1

      It's amazing how we can perceive honesty/passion and how we can resonate with it. Thank you Andrew and thank you Andrej!

  • @mdougf
    @mdougf 5 ปีที่แล้ว +9

    Thanks for this interview, Andrew; you're the man. And hello to my fellow learners! Is anyone interested in starting a weekly machine learning research paper reading and discussion group with me?

  • @fabianmarin8514
    @fabianmarin8514 ปีที่แล้ว +1

    The two folks from which I've learned the most about AI. Thanks so much!

  • @dciug
    @dciug 7 ปีที่แล้ว +24

    until 1:40
    YES! That is exactly how I felt during the AI class that I took. I really thought that those methods do not deserve to be named AI. NNs and Boltzmann Machines are what really got me started into this field. I can do this all day and not feel tired, and that's awesome.

  • @rajatrao5632
    @rajatrao5632 5 ปีที่แล้ว +3

    Important statement Andrej made was " we truly understand the library/things that abstract away many low level complex things..when we once are in a position to write something from scract low level and then we will be comfortable to use the libraries who are doing the same and modify " truly a great statement

  • @adityasoni121
    @adityasoni121 7 ปีที่แล้ว +8

    I wonder what will happen if Andrej would cite a story to a toddler...
    Great Lecturer!!(Really enjoyed CS231N)
    Thank you..

    • @guestimator121
      @guestimator121 6 ปีที่แล้ว +1

      +Aditya Soni "..Cooncretely, Hansel has put all the pebbles in his pocket in a way... well, you really don't need to know all of the details of how did he do it to understand the rest o the story, the important thing for you to understand was that he had pebbles in his pocket..."

  • @AnkitBindal97
    @AnkitBindal97 7 ปีที่แล้ว +12

    DFS, BFS, Alpha-beta pruning....... Exactly! Even undergraduates are taught these things. It's nowhere near what is actually happening in machine learning.

  • @kssreesha
    @kssreesha 3 ปีที่แล้ว +2

    This has some of the best insights !!

  • @sarahjamal86
    @sarahjamal86 5 ปีที่แล้ว +2

    Well he is my hero as well ... because of him I could understand the concepts and implement them before moving to use tensorflow and pytorch.
    Thanks Karpathy, your contributions to the CS community are so valuable. :-)

    • @phillaysheo8
      @phillaysheo8 ปีที่แล้ว

      Women who like ML are hot 🤩

  • @vq8gef32
    @vq8gef32 7 หลายเดือนก่อน

    He is a real hero, I am watching his lessons : Love + AI === Andrej

  • @YULi-qf1wq
    @YULi-qf1wq 7 ปีที่แล้ว +8

    Andrej is less confident than he was in cs231 class but cuter for his humbleness in this interview without any direct gaze to camera :D

  • @preethamgali3023
    @preethamgali3023 4 ปีที่แล้ว

    Exactly, implementing from scratch does help one to understand better.

  • @shubharthaksangharsha6248
    @shubharthaksangharsha6248 ปีที่แล้ว +1

    2 legends in one frame

  • @maciejbalawejder1819
    @maciejbalawejder1819 3 ปีที่แล้ว +3

    10:57 - but that's exactly tesla's approach to self-driving, creating separate models and merge them together

  • @inilahsaltakadnak
    @inilahsaltakadnak 7 ปีที่แล้ว +6

    Very insightful. At 10:15 the split of AI is interesting

  • @benitoteehankee3014
    @benitoteehankee3014 6 ปีที่แล้ว +7

    "... not decomposing but having a single neural network, a complete dynamical system, that you're always working with -- a full agent. The question is: 'How do you actually create objectives such that when you optimize over the weights to make up that brain, you get intelligent behavior out?' " Really interesting. That sounds a lot like the goal of teaching human beings, too. How do you teach without decomposing knowledge into subjects and teach from a holistic point of view?

    • @stock99
      @stock99 6 ปีที่แล้ว +2

      Benito Teehankee this question is the best part of the entire interview to me. Good question is half of the answer. Digging into it.... Very interesting..

  • @myspacetimesaucegoog5632
    @myspacetimesaucegoog5632 7 ปีที่แล้ว +3

    I'm super keen to hear how Andrei's ideas for an overall "just learn everything about everything" type AI progress. I kind of imagine a "baby" AI system following humans around watching imitating absorbing and learning - somehow., gradually growing up...

  • @mynameisZhenyaArt_
    @mynameisZhenyaArt_ 7 ปีที่แล้ว +3

    thanks for preserving knowledge :)

  • @bntagkas
    @bntagkas 4 ปีที่แล้ว +1

    i have to listen to this at 1.25 speed only instead of usualy 1.5 or 1.75, nice

  • @6thHorseMan
    @6thHorseMan 7 ปีที่แล้ว +9

    Start out with what is under the hood and build your knowledge from there.
    To fully understand ML you can't just be a library user.

  • @BrutalStrike2
    @BrutalStrike2 2 ปีที่แล้ว +1

    Now Andrej made own mini course on his TH-cam

  • @motiurrahman
    @motiurrahman 7 ปีที่แล้ว +6

    Such a cool interview - the mentor interviewing the mentee.

  • @5gururaj5
    @5gururaj5 6 ปีที่แล้ว +3

    I turned it to 1.25x as usual, and I had to switch back to 1x 😄

  • @MartinLichtblau
    @MartinLichtblau 6 ปีที่แล้ว +4

    Our biggest fallacy: if we model each human ability by hand we will have a AI.
    Same fallacy was committed before with feature modelling. Today we know better. Or at least we thought so..... unreflected we are!

  • @malikhamza9286
    @malikhamza9286 3 ปีที่แล้ว

    This is the first video I haven't watched in 1.25 or 1.5x

  • @phillaysheo8
    @phillaysheo8 ปีที่แล้ว +1

    "It's not rocket science or nuclear physics" 😀
    "You just need to know linear algebra and calculus" 😔

  • @iamr0b0tx
    @iamr0b0tx หลายเดือนก่อน +1

    Proof that Andrej is an LLM 1:00 😅

  • @jackxiao8140
    @jackxiao8140 5 หลายเดือนก่อน

    Love the little laugh at 12:58

  • @relganz4663
    @relganz4663 7 ปีที่แล้ว +1

    12:55 best part. Whatever his idea is, it's probably right. But why no question about Tesla? not even high level?

  • @ChandlerRandolph-yc5re
    @ChandlerRandolph-yc5re ปีที่แล้ว

    very informative!

  • @waynelau3256
    @waynelau3256 2 หลายเดือนก่อน

    The two gods

  • @randywelt8210
    @randywelt8210 7 ปีที่แล้ว

    Can please explain anyone ctc loss and beam search decoding in numpy? That is implemented in tensorflow, but it is really hard to understand what is going on.

    • @dgimop
      @dgimop 7 ปีที่แล้ว

      In case you have not yet figured this out: I skimmed over the CTC paper, cited by tensorflow, for a minute. Are you talking about how CTC works as a whole or only about how the cost/loss is calculated in the softmax (output) layer, as in how the loss function works for this classification algo? I can give some pointers on what I understood about the latter. My explanation might be either naive or complicated, depending on how deeply you understand ML.
      CTC calculates the cost of an error using the principles of maximum likelihood estimation (MLE). In particular, 'minimising it [the cost function] maximises the log likelihoods of the target labellings' - as the authors say. To label the output, it uses one extra unit in the softmax layer than the number of output labels, unlike traditional methods that use as many output units as there are labels to classify. The extra unit is reserved for observing a 'blank' or 'no label' class. If my understanding is correct, this gives the algorithm some breathing room to skip over labelling the data that is does not understand correctly and save it for later (?) rather than falsely classifying it as one of the labels because it was forced to do so.
      Couldn't get the time to learn about beam search decoding :)

  • @realGBx64
    @realGBx64 6 ปีที่แล้ว

    It is so weird for me when they emphasize the importance of knowing the basics. InEastern Europe we learned almost everything from bottom up. I had abstract maths before calculus, wrote algorithms on paper, calculated matrix determinants by hand, etc.

  • @markhofstede
    @markhofstede 6 ปีที่แล้ว +1

    Would love to see him speak with Elon!

    • @godspeed133
      @godspeed133 4 ปีที่แล้ว

      He now works for Elon (maybe he had started by then and you knew(?))

  • @omeryalcn5797
    @omeryalcn5797 6 ปีที่แล้ว +3

    Warning !! real time of video is 20.1333333333 :)

  • @hasnainabbasdilawar8832
    @hasnainabbasdilawar8832 7 ปีที่แล้ว +1

    This guy talks fast!

  • @michaellidster1389
    @michaellidster1389 5 ปีที่แล้ว +1

    Heroes hey

  • @ehfo
    @ehfo 6 ปีที่แล้ว

    he talks so fast!

  • @israel_abebe
    @israel_abebe 7 ปีที่แล้ว

    what course is he talking about?

    • @taylordelehanty8008
      @taylordelehanty8008 7 ปีที่แล้ว +3

      Israel Abebe they're talking about the Stanford course here cs231n.stanford.edu

  • @abdAlmajedSaleh
    @abdAlmajedSaleh 6 หลายเดือนก่อน

    I didn't know it was dog network.

  • @KaiyuZheng
    @KaiyuZheng 5 ปีที่แล้ว

    I actually didn’t set the speed to 2

  • @dvm509
    @dvm509 7 ปีที่แล้ว +2

    when AI god speaks ...

  • @rubixcom
    @rubixcom 6 ปีที่แล้ว

    hang on... but had he actually trained himself on that dataset, he would be performing better than ML

  • @BrianBull
    @BrianBull 6 ปีที่แล้ว

    Tesla AKnet

  • @-mwolf
    @-mwolf ปีที่แล้ว

    10:55

  • @Jerry-yy1qy
    @Jerry-yy1qy 4 ปีที่แล้ว

    说话速度有点快

  • @dexmoe
    @dexmoe 7 ปีที่แล้ว +1

    human benchmark lol

  • @tianshiliao5372
    @tianshiliao5372 7 ปีที่แล้ว +2

    Just not a big fan of udemy ML ads.. spent 20 hrs on it without learning the proper definition and math expression of cost function.. what a waste of time I have to say

    • @dgimop
      @dgimop 7 ปีที่แล้ว +3

      The course Andrew NG was talking about is in Coursera, not Udemy, if I understand your concern correctly. This is a brand new specialization. However, the best available Machine Learning course online, in my opinion, is Andrew NG's own course titled 'Machine Learning'. It's absolutely amazing, very detailed and free. It is probably the very first online ML course. I dropped out of a grad course at the university and spent that entire semester on this course. It eased me into my grad research.

  • @CorporateDrone
    @CorporateDrone 2 ปีที่แล้ว

    It isn’t obvious to me that Andrej is not a genius

  • @StanislavMasharsky
    @StanislavMasharsky 5 ปีที่แล้ว +1

    А почему такое всратое качество в 2017-м году?

  • @surfermx
    @surfermx 2 ปีที่แล้ว +1

    Mercedes-Benz is already level 3,
    while Tesla is just level 2,
    this weirdo seems has no noticed it yet

    • @Tom-ku8bu
      @Tom-ku8bu 2 ปีที่แล้ว

      Mercedes is only level 3 on certain situations on the highway but Tesla is on the way to be highest level on any road and every situation. The computer of the Tesla's are probably more powerful than of Mercedes. But why do you mention it on a video that is 5 years old? At that time Mercedes was no where with self driving and in Tesla's it was already an early not so good version available. Now fsd beta gets every update better and is already pretty amazing how it handles heavy traffic in cities which Mercedes can't.

  • @samahirrao
    @samahirrao 7 ปีที่แล้ว

    Andrew Ng does not feel like a good person.. Kind of started hating him. But his research is no doubt great.

    • @Samir_Zope
      @Samir_Zope 7 ปีที่แล้ว +5

      Why is he not good person?

    • @reetigarg7398
      @reetigarg7398 7 ปีที่แล้ว

      R1nz0R I think it's largely because of the way he interacts with others. But I think you're mistaken there, he might come across as not a good guy when he actually is.

    • @Samir_Zope
      @Samir_Zope 7 ปีที่แล้ว +4

      Reeti Garg imo he actually seems like a kind person but ok xD

    • @myspacetimesaucegoog5632
      @myspacetimesaucegoog5632 7 ปีที่แล้ว +6

      Gosh I thought Andrew seems an extremely good person, watching him in this video.