PyTorch at Tesla - Andrej Karpathy, Tesla

แชร์
ฝัง

ความคิดเห็น • 360

  • @buzz8545
    @buzz8545 4 ปีที่แล้ว +468

    How many of you checked the playback speed?

    • @carvalhoribeiro
      @carvalhoribeiro 4 ปีที่แล้ว +5

      looks like Abigail Doolittle from bloomberg 1.5x speed

    • @Wingman77tws
      @Wingman77tws 4 ปีที่แล้ว +10

      came here to say that. might be the only video ever worth watching at .75

    • @sebbecht
      @sebbecht 4 ปีที่แล้ว +7

      Dead serious, many videos I watch on youtube is on 1.25 or often 1.5 speed. but this guy, everything is 0.75, just to make sure I dont miss anything

    • @akarmdit2267
      @akarmdit2267 4 ปีที่แล้ว +3

      usually 1.5 but his 1.25

    • @pretamane9646
      @pretamane9646 4 ปีที่แล้ว

      yessss

  • @AgentOffice
    @AgentOffice 4 ปีที่แล้ว +530

    Karpathy, very fitting name

    • @ishunyu
      @ishunyu 4 ปีที่แล้ว +22

      Agent Office LOL true. Never realized. 🤣🤣🤣

    • @gridcoregilry666
      @gridcoregilry666 4 ปีที่แล้ว +2

      might explain pls?

    • @borisdemelo
      @borisdemelo 4 ปีที่แล้ว +12

      Julio Chao he has ‘kar’ in his name and works for a ‘car’ company. :)

    • @AllanSustainabilityFan
      @AllanSustainabilityFan 4 ปีที่แล้ว +77

      ​@@gridcoregilry666 Kar-Pathy -> Car + Path -> self driving cars.

    • @AgentOffice
      @AgentOffice 4 ปีที่แล้ว +5

      @@gridcoregilry666 car path

  • @la-civetta
    @la-civetta 4 ปีที่แล้ว +160

    Some people have great empathy, Andrej has great carpathy.

    • @lonnybulldozer8426
      @lonnybulldozer8426 2 ปีที่แล้ว +8

      *self-driving carpathy. (Some people are self-driven by empathy. Andrej is a self-driving carpathy.

    • @ashwithabanoth3520
      @ashwithabanoth3520 10 วันที่ผ่านมา

      it's karpathy actually

  • @thischannelhasaclevername5481
    @thischannelhasaclevername5481 4 ปีที่แล้ว +82

    11:02: "Thank you"
    11:03: Exit to work

    • @FireFly969
      @FireFly969 หลายเดือนก่อน

      😂😂😂😂

  • @cappuccinopapi3038
    @cappuccinopapi3038 4 ปีที่แล้ว +289

    *Me who’s struggling even with my basic calculus class:
    Fascinating

    • @alekseysoldatenkov5675
      @alekseysoldatenkov5675 4 ปีที่แล้ว +41

      All you need is curiosity and resilience! You got this dude!

    • @GRMREAP3R97
      @GRMREAP3R97 4 ปีที่แล้ว +4

      That's the mark of a true teacher, they can make the most difficult of concepts seem easy and fascinating

    • @Wulfcry
      @Wulfcry 4 ปีที่แล้ว +3

      Google Children wooden educational toy's Montessori math , Did you know we all played with those didn't sink in then, Well these structure are practically the basic shape which also to find in calculus. If you can't rotate them in you're head get them and visualize the image space.
      These shapes also hold structures done with programming. It might seem simplistic after all these years but breaking you're head to grasp something that has these shape why not.
      Circle -> loop , iteration, segmentation etc. Try fill in the blanks with Square, Triangle and Pentagon shape. and cube some. We all are well endowed with knowledge use it.

    • @WandererOfWorlds0
      @WandererOfWorlds0 4 ปีที่แล้ว +1

      There's very little calculus in neural networks besides differentials (gradients).

    • @kalp2586
      @kalp2586 3 ปีที่แล้ว

      You don't need to know calculus for deep learning. Like you don't need to learn assembly language for building web apps.

  • @TheNyatzAnger
    @TheNyatzAnger 4 ปีที่แล้ว +383

    Always great to hear Andrej's talks. He's left an indelible impression on my research career in deep learning through CS231N

    • @UmuroElema
      @UmuroElema 4 ปีที่แล้ว +3

      🙏

    • @sreekarnim163
      @sreekarnim163 4 ปีที่แล้ว +10

      CS231n is IMO the BEST online CS course on the internet.

    • @aishahsofea3128
      @aishahsofea3128 4 ปีที่แล้ว +2

      can i take the course if im not a stanford student?

    • @sreekarnim163
      @sreekarnim163 4 ปีที่แล้ว

      @@aishahsofea3128 yes

    • @MdSheraj
      @MdSheraj 3 ปีที่แล้ว +8

      @@aishahsofea3128 th-cam.com/play/PL3FW7Lu3i5JvHM8ljYj-zLfQRF3EO8sYv.html&feature=share
      There you go.

  • @ashh3051
    @ashh3051 4 ปีที่แล้ว +116

    That was awesome. Lots of new insights beyond what was presented at Autonomy Day. I wish he had 1 hour to talk.

  • @kintaro_f
    @kintaro_f 4 ปีที่แล้ว +785

    there should be a "game" in model 3 where people can tag things like traffic lights and other unsolved obstacles manually so the AI is learning from as many humans as possible. Maybe you reward them with free supercharging or something. 🤘😜

    • @cleberz8072
      @cleberz8072 4 ปีที่แล้ว +107

      I think we all do that manually every time we take over and correct the autopilot when driving a Tesla. I bet Tesla uploads the data and uses the action taking as the training feedback

    • @mattz2729
      @mattz2729 4 ปีที่แล้ว +5

      This is a great idea, you should somehow share this

    • @dougdstecklein
      @dougdstecklein 4 ปีที่แล้ว +18

      Cleber Zarate
      Disengaging autopilot does not label objects.
      If a deer runs across the road and the driver swerves to avoid hitting it, autopilot will disengage and the driver’s actions can be used to teach the neural net how to react in that situation.
      But driver input(steering, braking) does not label the object.
      You can teach a neural net how to properly recognize objects but it requires labeled data.
      So the OP is correct that this could be helpful.
      Whether Tesla needs help labeling objects 🤷‍♂️ .

    • @cleberz8072
      @cleberz8072 4 ปีที่แล้ว +11

      @@dougdstecklein you're right, it won't label it but it will vastly reduce the amount of data to go through, as you could, for instance, know all those pictures had a red light since the car had to step on the brake coming to an intersection when no objects were in front of it. See what I'm talking about?

    • @JordanPriede
      @JordanPriede 4 ปีที่แล้ว +17

      Many captchas on websites allow humans to label such data.

  •  4 ปีที่แล้ว +180

    Wow Karpathy's a fast talker, its like the video was sped up.

    • @Findalfen
      @Findalfen 4 ปีที่แล้ว +9

      I slowed the video down to 0.75 and it was finally intelligible. Speaking as fast as he is doing is not a good trait. If you want people to understand and remember what you said; you need to let them time to process it.
      Quite the irony for someone working in data processing. Does it believe we are all machines?

    • @cleberz8072
      @cleberz8072 4 ปีที่แล้ว +13

      It's interesting to hear your opinions - I've been living in the US for 14 years and have no problem understanding him, as he's still relatively slower than how people generally speak (perhaps not in presentations). He has an accent which makes slight harder to understand than a native but it's still not as fast as it appears. But yeah I definitely would not have kept up 14 years ago, the 0.75 is a nice trick you guys got there, I wish I had that back then

    • @m3po22
      @m3po22 4 ปีที่แล้ว +5

      Yeah I was like, "Did I already speed this up? Oh, it's just him."

    • @joelodlund6979
      @joelodlund6979 4 ปีที่แล้ว +4

      talking fast makes him able to make complex points while still keeping the listeners attention. I have no trouble following along, and I'm not a native speaker.

    • @cleberz8072
      @cleberz8072 4 ปีที่แล้ว +3

      @@joelodlund6979 I'm thinking the issue with the native speakers complaining here must be because most of the tech vocabulary he uses are unfamiliar terms to them. Complainers, please enlighten me. I'm used to most if not all of those terms given my professional background. How about you? My brain processing will certainly slowdown when I encounter a completely new subject regardless of the languages I'm fluent at

  • @DirtyTesla
    @DirtyTesla 4 ปีที่แล้ว +50

    Amazing stuff. I love seeing this stuff improve every few weeks first hand.

    • @RubenKelevra
      @RubenKelevra 4 ปีที่แล้ว +2

      I still think they currently hold back a huge step forward due to inconsistency. Probably navigation of roundabouts and automatic traversing of intersections with yield, stop signs and traffic lights straight.

    • @DirtyTesla
      @DirtyTesla 4 ปีที่แล้ว +1

      @@RubenKelevra I totally agree. We've seen Green use stop sign and traffic recognition on his S months ago and it was pretty damn good. Not flawless tho, and you can't mess around with a red light.
      I wish we could opt into some extreme beta program :)

    • @RubenKelevra
      @RubenKelevra 4 ปีที่แล้ว

      @@DirtyTesla Mapillary shows a bit more in depth what they are capable of in terms of detection.
      th-cam.com/video/3IIlc0HzES0/w-d-xo.html
      You can even go on their website and look at pictures other people has provided, so completely different angles, cameras, countries and climates and their detections are pretty much spot on when it comes to "where are cars? where is the road? Where are obstacles?"
      And the sign detection is able to identify the most important signs as far as a human would be able to, but with an average smartphone as camera.

  • @pks.
    @pks. 3 ปีที่แล้ว +16

    no LIDAR? just wowww
    As always, Andrej is the best in explaining Computer Vision :)

  • @JoseDiaz12
    @JoseDiaz12 4 ปีที่แล้ว +19

    High levels of intelligence and passion always make for an astounding presentation. Andrej is the f*ing man.

  • @MuscleTeamOfficial
    @MuscleTeamOfficial 4 ปีที่แล้ว +10

    I saw the name and I had to click this, Andrej Karpathy is one of the greats.

  • @DogaOzgon
    @DogaOzgon 3 ปีที่แล้ว +2

    This was very insightful, would love to get a follow up!

  • @handsonlabssoftwareacademy594
    @handsonlabssoftwareacademy594 4 ปีที่แล้ว +1

    Pytorch configuration in Linux environment can take up to 3hrs esp if ur building it freshly from source. However, it's one of my favorite deep learning framework's besides keras an Tensorflow. Great presentation pls keep it up n coming.

  • @Peter8831
    @Peter8831 4 ปีที่แล้ว +7

    I've used Andrej's RNN techniques to do significant work in medicinal chemistry - great stuff!!!

    • @rangv733
      @rangv733 4 ปีที่แล้ว +1

      Hey there. For what have you used it ?

    • @safekidda46
      @safekidda46 4 ปีที่แล้ว

      Humble brag

    • @Peter8831
      @Peter8831 4 ปีที่แล้ว +2

      @@rangv733 So, far I've used RNNs for De Novo Drug generation, ie. create new potential drugs from scratch. You can read more here - www.wildcardconsulting.dk/teaching-computers-molecular-creativity/
      Another team, which uses a very similar technique provided a good visualization-
      github.com/MarcusOlivecrona/REINVENT/blob/master/images/celecoxib_analogues.gif

  • @lukelukelukeluke
    @lukelukelukeluke 4 ปีที่แล้ว +7

    No need to set the speed to 1.25x when Andrej is doing a presentation

  • @nicop6750
    @nicop6750 4 ปีที่แล้ว +96

    Tesla is winning autonomy folks. Watch the stock price over next 2 years. Should look like a falcon heavy launch accelerating into atmosphere.

    • @cestlavieeee
      @cestlavieeee 4 ปีที่แล้ว +9

      Yup. Bought in june at 199USD. Thought I was late. Now in november its 350USD.

    • @CreativeBuilds
      @CreativeBuilds 4 ปีที่แล้ว +3

      Falcon heavy? I think it'll look more like Starship 😎

    • @nicop6750
      @nicop6750 4 ปีที่แล้ว +1

      @@CreativeBuilds Damnit you're right! It's gonna be epic bro

    • @__ihexx__5654
      @__ihexx__5654 4 ปีที่แล้ว

      I dunno, waymo isn't sleeping either

    • @SweatySockGaming
      @SweatySockGaming 4 ปีที่แล้ว

      Bump

  • @maverick3069
    @maverick3069 4 ปีที่แล้ว +4

    Brings back 231N memories!

  • @alexissuazo3122
    @alexissuazo3122 3 ปีที่แล้ว

    Brilliant session, thanks for the info.

  • @1989arrvind
    @1989arrvind 10 หลายเดือนก่อน

    Andrej Karpathy exquisite technical explanation on Tesla Autopilot 👍👍👍

  • @reportingavenues1975
    @reportingavenues1975 ปีที่แล้ว

    Anytime I think I have got a hang of how to use deep learning, get to see a video like this.

  • @MrNightLifeLover
    @MrNightLifeLover 4 ปีที่แล้ว

    Are the slides available anywhere?

  • @sagarmeena0210
    @sagarmeena0210 4 ปีที่แล้ว +2

    Great Presentation

  • @saminchowdhury7995
    @saminchowdhury7995 4 ปีที่แล้ว +51

    great talk.
    try to give the photo of the speaker on the thumbnail.
    thanks

    • @PyTorch
      @PyTorch  4 ปีที่แล้ว +15

      Hi Samin. Thanks for the feedback, we'll be sure to pass it along to our team!

  • @ABC2007YT
    @ABC2007YT 4 ปีที่แล้ว

    Amazing design and engineering!

  • @adriancabrera4870
    @adriancabrera4870 4 ปีที่แล้ว

    Where can I find a written version of this?

  • @0xNameless
    @0xNameless 4 ปีที่แล้ว +11

    He's breaking my neural net with his speech speed...

  • @eaglesofmai
    @eaglesofmai 4 ปีที่แล้ว +10

    This guy should really be appretiated...using State-of-the-art algorithms directly into Production, its a big risk but also a big achievement...plus Tesla's approach is safer to Human eyes as certain LiDars can cause blindness.

  • @akarmdit2267
    @akarmdit2267 4 ปีที่แล้ว

    indeed the right kind of business path and beyond kudos to ALL

  • @kuaranir2440
    @kuaranir2440 2 ปีที่แล้ว

    I love PyTorch and Tesla

  • @CabrioDriving
    @CabrioDriving 4 ปีที่แล้ว +2

    Is it speed up 3 times ? ;)

  • @gitc13
    @gitc13 4 ปีที่แล้ว +51

    6 people who have disliked the video are from Waymo :)

  • @LarryPanozzo
    @LarryPanozzo 4 ปีที่แล้ว

    I learned about machine learning from Andrej in a video 4 years ago whoa

  • @runvnc208
    @runvnc208 4 ปีที่แล้ว +19

    In my opinion, this roll-out without having injuries or fatalities was one of the greatest engineering accomplishments of the last decade. However, there have been some dangerous near-misses with recent versions, and I left a comment on my other account under this video about one of them. The comment was removed. I think suppressing critical comments is dangerous and is an abuse of TH-cam's moderation system.

    • @cleberz8072
      @cleberz8072 4 ปีที่แล้ว +3

      The current generation of autopilot relies on the owner paying attention and be ready to intervene so I would like to know more about how these near-misses were so terrible.

    • @runvnc208
      @runvnc208 4 ปีที่แล้ว +5

      @@cleberz8072 th-cam.com/video/fKyUqZDYwrU/w-d-xo.html Its not that they are "so terrible". Its that its a bad idea to pretend they don't happen and hide comments about them. What I was asking was for Karpathy to address that particular near miss. One big thing is actually that the owner in the video says that he believes that Tesla will automatically receive a bug report. But I have a feeling there is actually not an automatic way for Tesla to know that this disengagement was a bug rather than a normal disengagement. So at the very least, there needs to be an easy way to report these "life on the line" bugs and all Tesla owners need to be properly informed about it if/when that exists.

    • @RichOrElse
      @RichOrElse 4 ปีที่แล้ว +2

      @@runvnc208 according to Elon during autopilot all driver input are considered errors, which is an automatic bug report.

    • @joythought
      @joythought 4 ปีที่แล้ว +2

      Losin TH-cam comments on random videos is not a conspiracy to silence you...

    • @runvnc208
      @runvnc208 4 ปีที่แล้ว

      @@RichOrElse There is nothing to distinguish driver input in a minor situation or from a driver that likes to give unnecessary input from a life-threatening situation.

  • @Mrwiseguy101690
    @Mrwiseguy101690 4 ปีที่แล้ว +23

    It's crazy how the human brain can perform the task so easily, yet state of the art computers and algorithms find it very difficult.

    • @HimanshuAroraa
      @HimanshuAroraa 4 ปีที่แล้ว +2

      That is kind of obvious. Technology is nothing in front of nature. This is why AI is so hyped up right now even though it is good only in a few very specific tasks.

    • @leonardselksnis4326
      @leonardselksnis4326 4 ปีที่แล้ว +13

      Just remember that vision had 543 million years to evolve. Computer vision algorithms are here only for 55 years. Also extremely impressive how far we have come in such a short time.

    • @TheZeeray
      @TheZeeray 4 ปีที่แล้ว +3

      This statement is true yet misleading, we can perform driving, but we don't do it well. We screw up often in many ways every time we drive a car, we all get some form of road rage and some point, we all have attention issues when it comes to driving, we're terrible at staying centered in our lanes and following the rules of the road. Driving is easy, but we're relatively bad at it. Autopilot however has none of these issues, it never gets tired, or needs a break, nor has road rage nor attention issues. It is trained and drives the car while having vastly better vision and situational awareness of the cars environment. All it takes is improving the software and convolutional neural networks as described in this video as well as a few others from Andrej

    • @JohnDoe-xo2yf
      @JohnDoe-xo2yf 4 ปีที่แล้ว +1

      @@TheZeeray and it can use the turning signal! Way ahead of my fellow drivers

    • @stevethompson210
      @stevethompson210 4 ปีที่แล้ว

      They used to say the same about adding big numbers. That day has long passed. Soon it will be the same with driving a car.

  • @anubhavanand6573
    @anubhavanand6573 4 ปีที่แล้ว +1

    At 8:26, when he says predictions can't regress, what does he mean ?,
    Any explanations/links ?

    • @ryman1
      @ryman1 4 ปีที่แล้ว +9

      Basically, when adding new functionality to the autopilot, it should be tested to ensure that the existing functionality doesn't break/get worse.

  • @matteovalenza
    @matteovalenza 4 ปีที่แล้ว +2

    wow well done !

  • @Saad-mh8rb
    @Saad-mh8rb 4 ปีที่แล้ว +2

    i am proud to be a python machine learner prodigy after seeing this video

  • @bidhanmajhi
    @bidhanmajhi 4 ปีที่แล้ว +4

    Is he talking really fast or there is a playback speed bug in TH-cam?

  • @Suro_One
    @Suro_One 4 ปีที่แล้ว +5

    Does Tesla train using their own hardware or in the cloud? And if so, how long does training take with these methods?

    • @DevonHensley211
      @DevonHensley211 4 ปีที่แล้ว +13

      They use GPU's (it was in presentation). And it takes a lot of time. 70.000+ GPU hours for full stack (1 nod with 8 GPU would take more then a year. My guess is they have many nods with lots of GPU's but not sure how many. If they would have 70.000 GPU's that means they can train full stack in 1 hour (70.000 GPU's x 1 hours = 70.000 GPU hours), but that would be huge super computer. You can put around 20-ish nods in one server rack (42U) so that means one rack would have around 160 GPU cards. In order to train this network in relative fast time, lets say you have 20 rack servers that would give you 20 racks x 160 GPU's = 3200 GPU's x 24 hours per day = 76.000 GPU hours. So every 24 hours they can train network again. Network = each time they want to train / upgrade network, they would need to wait 24 hours to see if new network is better then older one. In short, they use a lot of resources to make this work, and he also talked about Dojo project. Tesla Dojo is super powerful training computer that would replace GPU's (this is my best guess). It's dedicated hardware and Dojo can possible improve performance bt factor x10-20-ish so that would means that if they need now 24 hours to train full network, it would only take 2.4 hours. This will speed up things and they can test more variations and what not.

    • @ipconfigrenew
      @ipconfigrenew 4 ปีที่แล้ว +2

      In the past Tesla has used AWS to host a lot of their backend services. I know at one point it was reported that some AWS instances were mentioned being used for Autopilot training, but that was a few years ago and I haven't heard anything new since then. I suspect they are still using cloud services for now - with the plan being to move things to their new Dojo hardware once it's up and ready. They showed the development timeline for the FSD computer in their Investor Autonomy event, and if Dojo is following a similar timeline it will probably be up and ready in the next year which will line up nicely with their plans for ramping up autopilot capabilities (like the Taxi network).

    • @rkan2
      @rkan2 4 ปีที่แล้ว

      It seems they need about one nuclear plant's hourly production to train the network once... That is quite the electricity cost. (250W*70000 hours)

    • @DevonHensley211
      @DevonHensley211 4 ปีที่แล้ว +2

      @@rkan2 3200 GPU cards would be around 1 MW (give or take few, since you need to power servers as well, network hardware and everything in between, not just GPU's). 1 MW peak usage is a lot, but nuclear plants can do anywhere from 550-ish megawatts (MW) up to 4 GW-ish (that's 4000 MW). Even so 1 MW is huge amount of power usage and it makes sense for them to try to find better way to do the processing, ergo Dojo project.

    • @DevonHensley211
      @DevonHensley211 4 ปีที่แล้ว

      @@ipconfigrenew I honestly dont know if they using AWS or dedicated clusters, I was just doing math based on some simple numbers. I dont have any inside Tesla information :) I work in industry (servers, cloud etc) so I just run the basic numbers for fun! But for sure Dojo project could make a lot of difference for them if they can build it cost effective.

  • @PiduguSundeep
    @PiduguSundeep 4 ปีที่แล้ว

    Fascinating

  • @RishabhGKoenigseggRegera
    @RishabhGKoenigseggRegera 4 ปีที่แล้ว

    Is that Hwy 403 on the way to Hamilton?

  • @SaurabhGuptacurious
    @SaurabhGuptacurious 2 ปีที่แล้ว

    the number of knowledgeable folks in comments section is just overwhelming

  • @SourceCodeProjects
    @SourceCodeProjects 4 ปีที่แล้ว

    Nice tutorial, *Car pathy*

  • @Anonymous-nj2ow
    @Anonymous-nj2ow 4 ปีที่แล้ว +27

    this would be a dream job, damn working on AI at Tesla..

    • @gregh5061
      @gregh5061 3 ปีที่แล้ว

      The pay and work hours are crap though

    • @gametony947
      @gametony947 3 ปีที่แล้ว

      @@gregh5061 how?

    • @gregh5061
      @gregh5061 3 ปีที่แล้ว

      @@gametony947 ive had some developers talk about it to me. mechanical engineers and a few computer science professionals.

    • @pauljnellissery7096
      @pauljnellissery7096 3 ปีที่แล้ว

      @@gregh5061 what did they say

    • @gregh5061
      @gregh5061 3 ปีที่แล้ว

      @@pauljnellissery7096 that they're terrified of musk showing up at the office because he fires people on a whim, the work hours are way too long and the pay is way lesser than their counterparts in other companies like google, apple, microsoft etc, which have more flexible work hours and better pay ( and pretty much have the same hiring standards). Although working at tesla would look really good on my resume so i'd probably take it up if i had a chance lol

  • @vishwanath-ts
    @vishwanath-ts 4 ปีที่แล้ว +2

    PyTorch is really cool. 😎 😎 😎

  • @Eminosrrr
    @Eminosrrr 4 ปีที่แล้ว +8

    He is speaking like x1.5. So I reduced the playback speed to 0.75 and it makes more sense now.

  • @SAINIVEDH
    @SAINIVEDH 3 ปีที่แล้ว

    Whoa, epic !!!!

  • @dzerres
    @dzerres 4 ปีที่แล้ว +2

    I wish my Tesla had a "learn/train" button when I'm driving around and I KNOW that the car won't be able to handle the upcoming traffic circle, for example. I would hit "train" in advance and gather data for the next 5 minutes to be sent to Tesla for their database to watch and learn how I drove the car around the circle, dodged the incoming cars from the left and then from the right, and maneuvered the car over to take the correct exit. I was wondering if Tesla remembers or builds up a view from my car and other Teslas driving around that particular traffic circle? If not, why not?

    • @TheZeeray
      @TheZeeray 4 ปีที่แล้ว +1

      The cars are "learning" whether autopilot is engaged or not, every mile you drive is being recorded by the cameras for the company to fetch thousands of different specific events and study how the car behaves in those

    • @jumpingblue1623
      @jumpingblue1623 3 ปีที่แล้ว

      I thought there was a button exactly for this. You press it when your car didn't drive ideally.

  • @CHAITHANYAkitta
    @CHAITHANYAkitta 4 ปีที่แล้ว +8

    144 TOPS is 144 trillion operations per second! it is an astronomical figure that even nvidia doesnt have at that watt hours! it deserves title "insane". Imagine when you get a 300TOPS chip on a phone, laptop, watch, ipads! that is godly power..

    • @myRed8Rain
      @myRed8Rain 4 ปีที่แล้ว +5

      to check out instagram :D

    • @tiro0oO5
      @tiro0oO5 4 ปีที่แล้ว

      Jap, that is impressive

    • @TheLastCrankers
      @TheLastCrankers 4 ปีที่แล้ว

      I would like to see how they achieved that given NVidia's best accelerators right now are 47 TOPS while consuming 2,5 times more power. It's either a breakthrough or a lie.
      Edit: ah, nevermind, I was looking at 5 year old gpu accelerators. NVidia doesn't say how do modern cards do in int8 TOPS, but they have around 130 TFLOPS Tensor-wise

  • @BlackHermit
    @BlackHermit 3 ปีที่แล้ว +2

    PyTorch is quite good.

  • @mosesindecks
    @mosesindecks 4 ปีที่แล้ว +1

    Wow.

  • @Dr-Asim
    @Dr-Asim 4 ปีที่แล้ว +9

    There must something wrong with the editing of this video. Is the speed set higher? Had to set the video speed to 0.75 to watch and understand.

    • @JohnDoe-xo2yf
      @JohnDoe-xo2yf 4 ปีที่แล้ว

      I heard him in other videos, this is how he talks

  • @leixun
    @leixun 4 ปีที่แล้ว +12

    *My takeaways:*
    1. They use shared backbone network because if each task has its own neural network, the computation is not affordable 3:00
    2. Their inference hardware 9:00

  • @badrekb5175
    @badrekb5175 4 ปีที่แล้ว +6

    to understand this guy, i had to put the vid on 0.5 speed :P

  • @ivankuljismusic8895
    @ivankuljismusic8895 4 ปีที่แล้ว +2

    MAGICAL!!!!! Elon, Andrej, Pete Bannon, etc. etc. etc., OUR ONE WORLD LOVES YOU!
    See you on Mars___'One World 2.0'

  • @vancekang
    @vancekang 2 ปีที่แล้ว +1

    Tell me you are smart without telling me you are smart: I watch Andrej at 1.25x speed

  • @DanFrederiksen
    @DanFrederiksen 4 ปีที่แล้ว +5

    Is int8 good enough?

    • @Lord2225
      @Lord2225 4 ปีที่แล้ว +2

      Yes. While high accuracy is required during training, on prediction you can round the calculation to 16 bits or even 8.

    • @DanFrederiksen
      @DanFrederiksen 4 ปีที่แล้ว

      @@Lord2225 I wonder if there is a point in that if a classifier net relies on finer precision than 8bit that it's too fragile. Maybe sigmoid invites fine balances and some things need threshold.

    • @Lord2225
      @Lord2225 4 ปีที่แล้ว +2

      ​@@DanFrederiksenIt makes sense. The average activation of (neurons or layers) is on close to zero and there is no large standard deviation. even using better functions than sigmoid (elu, relu). In general, you can do 8 bit multiplication and 16 bit sum if someone is worried about a problem.
      heartbeat.fritz.ai/8-bit-quantization-and-tensorflow-lite-speeding-up-mobile-inference-with-low-precision-a882dfcafbbd ~ you can get better results by comparing time
      petewarden.com/2016/05/03/how-to-quantize-neural-networks-with-tensorflow/ ~ tricks removing problems with low precision.

    • @Lord2225
      @Lord2225 4 ปีที่แล้ว

      Only in bad models weights explode to huge numbers.

    • @eugenedsky3264
      @eugenedsky3264 4 ปีที่แล้ว

      @@Lord2225 Also there are other ways to compress learned data: twitter _ com/NENENENENE10/status/1151530562844332033

  • @michaellidster1389
    @michaellidster1389 4 ปีที่แล้ว +33

    Runs away after finishing his talk

    • @youtubehelge5049
      @youtubehelge5049 4 ปีที่แล้ว +1

      These are probably lightning talks.

    • @Allumik
      @Allumik 4 ปีที่แล้ว +4

      It is just to keep up with the talking speed.

    • @AnyFactor
      @AnyFactor 3 ปีที่แล้ว

      Doesn't want to answer questions

  • @mmanuel6874
    @mmanuel6874 4 ปีที่แล้ว +9

    So pytorch >> tensorflow?

  • @abdoulayediallo3777
    @abdoulayediallo3777 4 ปีที่แล้ว

    And what about if you use tensorflow.

    • @jumpingblue1623
      @jumpingblue1623 3 ปีที่แล้ว

      It would probably run too hot on nvidia.

  • @nycandre
    @nycandre 4 ปีที่แล้ว

    Any thoughts about using extra data, like from V2X /V2M sources? It would be like cheating, I know, BUT why not use what is available to train the NN even faster? I would imagine even adding V2X /V2M hardware in large cities like New York, LA, San Francisco might be cost effective.

  • @lauriekane772
    @lauriekane772 4 ปีที่แล้ว +2

    Tesla paygrade based on the number of times you can drop "order of magnitude" into your presentations. Paygrade escalation rate is of course an order of magnitude greater than order-of-magnitude-isms / hour * base-pay-rate

  • @kesavae9552
    @kesavae9552 3 ปีที่แล้ว +1

    Watch it at 0.75x

  • @MrKeilstrup
    @MrKeilstrup 3 ปีที่แล้ว

    The nerdiest presentation ever! Love it

  • @starshipcaptain4753
    @starshipcaptain4753 3 ปีที่แล้ว

    Another reason why Tesla will completely dominate, glad he is on our side.

  • @DougGrinbergs
    @DougGrinbergs 4 ปีที่แล้ว

    9:01 FSD computer discussion

  • @TaylorAlexander
    @TaylorAlexander 4 ปีที่แล้ว

    Does anyone have any papers or examples of “hydra nets” like this? I want to implement a system with a few hydra heads.

    • @akhilkatpally4188
      @akhilkatpally4188 4 ปีที่แล้ว +2

      I guess they have came up with that term. Look for Feature Pyramid Networks, concept seems to be same.

  • @user-tr9yx2qk5m
    @user-tr9yx2qk5m 3 ปีที่แล้ว

    FUCKING AWESOME

  • @user-hs5qf2ij1m
    @user-hs5qf2ij1m 4 ปีที่แล้ว

    Visualization about Recurrent Network can be referred here: vision.stanford.edu/pdf/KarpathyICLR2016.pdf

  • @zoemayne
    @zoemayne 3 ปีที่แล้ว +1

    This is probably the first time i've had to slow a video down to 0.75 instead of speed it up

  • @tanyouliang
    @tanyouliang 4 ปีที่แล้ว +4

    The end game: Operation Vacation

  • @murtazanazir9997
    @murtazanazir9997 4 ปีที่แล้ว +13

    Who's here after lecun roasted Elon?

  • @josy26
    @josy26 4 ปีที่แล้ว +1

    8:27 what does he mean by "make sure that none of this 1000 predictions that we make, can regress"?

    • @jacobholloway7653
      @jacobholloway7653 4 ปีที่แล้ว +7

      There are 1000 things the full network is trying to do (such as label curbs, lights, other cars, is the car going to cut me off, etc.)
      Each of these 1000 things will have their own accurcy across their test data (labeling a light a light and not labeling a stop sign a traffic light, etc).
      When you regress, you are losing accuracy. So they might be at 99% accuracy in labeling stoplights, but as they train to recognize curbs for the Smart Summon feature, the network might forget something about recognizing stoplights, and that 1 prediction now has regressed to 98.5% accuracy.
      They want to make sure they gather more information into the network without losing anything they previously learned.

    • @josy26
      @josy26 4 ปีที่แล้ว

      @@jacobholloway7653 Thanks Jacob! That makes sense. In this context, regress = loosing accuracy.
      I looked up the definition just to add: return to a former or less developed state.

  • @Takeonm
    @Takeonm 3 ปีที่แล้ว

    Super Interesting. If you follow his speech by mimicking it in your own thought speech mind it’s actually easy to keep up with him while taking all of this information in. Great technique

  • @AmCanTech
    @AmCanTech 4 ปีที่แล้ว

    Amazing technology, apple also uses python a lot for their ml projects.

  • @user-ke2ug1dk2s
    @user-ke2ug1dk2s 4 ปีที่แล้ว +7

    Look where our badmephisto is now

    • @petko4733
      @petko4733 4 ปีที่แล้ว

      Wait... This is badmephisto?

    • @mirguitarist6656
      @mirguitarist6656 4 ปีที่แล้ว

      @@petko4733 yaa, he is the one that teach us on tubing, can watch him at his old tube 'badmephisto'

  • @yosanmelese2094
    @yosanmelese2094 4 ปีที่แล้ว

    1:28 "we dont use lidar...evrything comes from the 8 cameras" if i am right, he seems to be saying the autopilot uses only cameras as input. am i interpreting this right? because it conflicts with other information i have abt the sensors that autopilot uses like radar. what am i missing?

    • @V4ker
      @V4ker 4 ปีที่แล้ว

      Other companies do use lidar, but Tesla only has forward radar, which is capable of scanning ~160-170m in front of the vehicle depending on h/w version. Not sure why it wasn't mentioned here tho

  • @Level6
    @Level6 3 ปีที่แล้ว +1

    1. 브스스 + 브오스 + 뎁스 구현
    2. OTA 구현
    3. 운전자 정보로 검증 shadow mode 구현

  • @alanzom1503
    @alanzom1503 3 ปีที่แล้ว +1

    I usually watch at x1.25 or even x1.5 but I had to watch x0.75 for this

  • @ka9dgx
    @ka9dgx 4 ปีที่แล้ว +2

    I'm interested in this subject, but something is wrong with the compression... the audio sounds like it's been fed through a time compression algorithm.... makes it unwatchable.

    • @Findalfen
      @Findalfen 4 ปีที่แล้ว +1

      Unfortunately no. It looks like he just speaks that fast. But possibly the video editing made it worse, I don't know.

    • @MoeSalih
      @MoeSalih 4 ปีที่แล้ว +2

      Try watching at 0.75x speed. Might help

    • @bidhanmajhi
      @bidhanmajhi 4 ปีที่แล้ว

      0.75 x

  • @StEvUgnIn
    @StEvUgnIn 2 ปีที่แล้ว

    *Switch speed from normal to 0.75*

  • @rkan2
    @rkan2 4 ปีที่แล้ว +1

    It seems they need about one nuclear plant's hourly production to train the network once... That is quite the electricity cost. (250W*70000 hours)

    • @cleberz8072
      @cleberz8072 4 ปีที่แล้ว +6

      if that was done in a day, the math has it would require 17MWh. According to the EIA website the smallest American nuclear plant (R.E. Ginna) with a 584MW capacity would generate over 13GWh in 24hours so seems like you're off by a few zeros. 17MWh is the equivalent to 170 Model S P100D batteries and the nuclear power plant would be able to supercharge them all in 4h using only 4.5MW, which is less than 1% of such power plant capacity.
      That said, they probably use way less than that thanks to the fact the multitasking he refers to in the 48GPU system likely parallelizes tasks heavily so the max power they'll need at a given time is 250W*48=12kW which is the equivalent of a basic Tesla Solar Roof.

  • @vikaskrishnan4018
    @vikaskrishnan4018 4 ปีที่แล้ว +6

    He speaks so fast, I thought my video speed was 1.5x

  • @prashantgupta6885
    @prashantgupta6885 3 ปีที่แล้ว

    When your brain is faster than normal...you speak faster than normal

  • @ahh8895
    @ahh8895 2 ปีที่แล้ว

    I'm from twitter

  • @nanosmart2553
    @nanosmart2553 4 ปีที่แล้ว

    @schräg schau dir das mal an als Autopilot-Tester ;)

  • @dakshbhatnagar
    @dakshbhatnagar 9 หลายเดือนก่อน

    Wondering how many cars got smashed in the process of actually getting this to work initially.

  • @MarushDenchev
    @MarushDenchev 4 ปีที่แล้ว +2

    Here is a job I can never get ... head of AI at Tesla

  • @HariKishanAlluri
    @HariKishanAlluri 4 ปีที่แล้ว +6

    Hail Hydra!! 😂

  • @rashmiabbigeri3268
    @rashmiabbigeri3268 4 ปีที่แล้ว

    The fact that Elon Musk companies manufacture their own components is how they can price their products a little bit lower and also have control and provide quality. Ex : SpaceX

  • @aimatters5600
    @aimatters5600 ปีที่แล้ว

    never knew pytorch is used in tesla

  • @NRV44
    @NRV44 4 ปีที่แล้ว

    Please make Sentry Mode on CLOUD. Tesla can make money and owners can subscribe

  • @k.chriscaldwell4141
    @k.chriscaldwell4141 3 ปีที่แล้ว

    Now do one called _Rent-Seeking at Tesla._

  • @hole62
    @hole62 ปีที่แล้ว +1

    I am humbled and beyond thankful to the Andrej Karpathy, the Ai team, and Elon Musk for providing the service of uploading human consciousness into electronics such as Teslas. #ForeverGrateful

    • @akzsh
      @akzsh 6 หลายเดือนก่อน

      ???

  • @harryandriyan6430
    @harryandriyan6430 4 ปีที่แล้ว

    playback_speed = 0.75

  • @alluprasad5976
    @alluprasad5976 3 ปีที่แล้ว

    I thought Andrej Karpathy is 70+ years old person !

  • @tacitlytesla7912
    @tacitlytesla7912 4 ปีที่แล้ว

    Andrej sounds so nervous

  • @ondrejtkadlec6807
    @ondrejtkadlec6807 4 ปีที่แล้ว +1

    this guy has been watching too much 0.75 x speed videos on TH-cam