Let's reproduce GPT-2 (124M)

Andrej Karpathy

มุมมอง 536 633

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 28 ก.ย. 2024

ความคิดเห็น • 958

@kiw2535 3 หลายเดือนก่อน ⁺⁸¹¹
It’s rare to find such high-quality, free resources that make complex topics accessible and engaging!
@xspydazx 3 หลายเดือนก่อน
i think you will find out that some people know that they wil be taking this technology away from the public ....... same as google glass .... as soon as america find out they cannot sread dissemination of knowledge they will cuts all internet cables !
@Handlebrake2 3 หลายเดือนก่อน ⁺³⁰
Jesus - that's some tip!
@bbcc2960 3 หลายเดือนก่อน ⁺¹
❤
@unclecode 3 หลายเดือนก่อน ⁺¹⁰
@@Handlebrake2 :)) Jesus - That's some 4hrs of brilliant content!
@fallingexistence 2 หลายเดือนก่อน ⁺¹²
Dude this guy's net worth is $50M, you could've bought yourself 10 burritos at chipotle.
@carlosgermosen1103 3 หลายเดือนก่อน ⁺¹²⁸¹
Do not ever look at how long your videos are. Your content is perfect and you should keep explaining things step by step. You are doing a great job. I believe you will be remembered in history as one of the pillars of AI.
@pin65371 3 หลายเดือนก่อน ⁺¹⁴
I think one cool thing with videos like this is once Google implements their AI into youtube anyone will be able to watch this and just start asking questions. I've been learning a lot by watching videos like this and just copying parts of the transcript into ChatGPT to ask questions when I dont understand something.
@sohambit9393 3 หลายเดือนก่อน ⁺¹⁰
I was excited by how long it was instead 😂😂
@quackwilliams5933 3 หลายเดือนก่อน ⁺²
@@pin65371 jesus christ what a thought...and Andrej just starts talking back to you, answering your exact questions...
@barackobama4552 3 หลายเดือนก่อน
@@pin65371 What other videos do you recommend that have helped you?
@xspydazx 3 หลายเดือนก่อน ⁺³
Yes a great sharer of the important information and implementation. As well as to let you know it's in your hands to make your own before the internet is closed down or restricted in your areas or your cable gets cut !!
Great work ❤
@neilamrathod122 3 หลายเดือนก่อน ⁺³²⁸
Sorry , i love your videos and what you doing for me. I couldn't attend Stanford or get into openai but learning from you is blessing to me.. i would pay you back 100times in coming years. And i was watching your git repository last two months, i could see many git code push in private ,but i was confused what he is working on.. this is he was working on. To provide quality pratical knowledge to us all on youtube.
@shutup1209 3 หลายเดือนก่อน
Hey, the webside for the GPT2 is down, is there anyway to dowload it ?
@neilamrathod122 3 หลายเดือนก่อน
Sorry i will have look up to it ..i will do today night will reply to you after that @@shutup1209
@DingLi-hw4ul 3 หลายเดือนก่อน
hf@@shutup1209
@hobbytan6841 3 หลายเดือนก่อน
@@shutup1209 you should follow the video and make it from scratch! 😀
@dinorossi6611 3 หลายเดือนก่อน
Those are generous tips :). I wanna learn basics so I can understand this first.
@Doggi2dog 3 หลายเดือนก่อน ⁺¹¹⁶⁸
My life is simple;
Andrej drops GPT-2 The Movie, I watch.
@AndrejKarpathy 3 หลายเดือนก่อน ⁺⁴⁷¹
"GPT-2 The Movie" 😅
@DamianReloaded 3 หลายเดือนก่อน ⁺¹⁶
The movie and the sequel. I had to force myself to stop watching after I realized an hour had passed.
@asatorftw 3 หลายเดือนก่อน ⁺¹⁶
@@AndrejKarpathy The sequel "GPT The Movie" will be a old Hong Kong style "martial arts" movie about GPT getting beaten up by the Loss Function, then entering his training phase with Gradient Descent Sensei and the final showdown vs the big evaluation boss.
@georgiandanciu3482 3 หลายเดือนก่อน ⁺²
You know you gotta bring the popcorn
@AbhigyanKeshav169 3 หลายเดือนก่อน ⁺¹
Andrej poster video on your biography@@AndrejKarpathy
@unclecode 3 หลายเดือนก่อน ⁺⁵⁵
Thanks! 4 hours of decoding a "Decoder-Transformer", Kudos and appreciate your existence in this field.
@chenmarkson7413 3 หลายเดือนก่อน ⁺¹⁸
I am an undergraduate student. This is the lost lecture that professors never touched upon but absolutely crucial, thank you!!
I especially love how you start from the basics for so many notions, and I really learned a lot.
@ppyogesh7394 3 หลายเดือนก่อน
Which year in you are and which country
@chenmarkson7413 3 หลายเดือนก่อน
@@ppyogesh7394 I am at the University of Toronto, going to the third year this September
@waytolegacy 3 หลายเดือนก่อน ⁺²¹
This guy is "the one" in the industry, who has helped me understand the LLMs. I respectfully love this man. Hats off.
@Khobalt664 3 หลายเดือนก่อน ⁺¹⁹
You are the Excalibur of cutting through the hype. Thank you so much. Your ethics are inspiring, and your educational materials priceless.
@themenon 2 หลายเดือนก่อน ⁺⁶
Many Thanks to Andrej for making this tutorial available to everyone! I have never seen a clearer explanation of a nn before stumbling upon this zero to hero series. This will help all the people articulate the inner workings of neural net and help people understand deeper concepts, that is hard to understand. Looking forward to learning more with Andrej!
@dylan_curious 3 หลายเดือนก่อน ⁺¹⁵
I like when you add comments/metaphors about your intuition for how and why it works. Thanks you.
@pxbroccoli 3 หลายเดือนก่อน
Checkout this man here, he got the best Ai news
@tanaysood 3 หลายเดือนก่อน ⁺³⁶
Thanks AK, appreciate you sharing your knowledge with the world!
@C0D3633K 3 หลายเดือนก่อน ⁺²⁰³
Andrej is doing himself what OpenAi was supposed to do in the early days - make AI open. Thank you, Andrej!
@rainwang77 3 หลายเดือนก่อน ⁺²⁸
Hello Andrej, thank you so much for the sharing and effort! Really appreciate it!
@JT-mr3db 3 หลายเดือนก่อน ⁺¹³
The intellectual generosity of this man is of the highest standard.
@qiuchenguo2788 3 หลายเดือนก่อน ⁺⁴
Simply the best deep learning and LLM series online! Please keep making more videos and I'd love to be part of the journey!
@somdubey5436 3 หลายเดือนก่อน ⁺¹⁵
The longer your videos are, the better it is for humanity. I think you are such a wonderful person and providing this stuff for everyone for free, can't thank you enough.
@forrestye2194 3 หลายเดือนก่อน ⁺¹³
Finally, finished watching such a long video. Thank Andrej for sharing so many details of your knowledge. Like your teaching style so much since Tesla AI day. You are the best AI teacher!
@lorenzos-g9o 17 วันที่ผ่านมา ⁺²
Thanks Andrej! Tons of stuff in the video explained in simple terms, I learned a lot from it.
@gromeronaranjo 25 วันที่ผ่านมา ⁺¹
I rarely comment on videos, but I had to here I had to. Your in-depth high-quaity resources are something to talk about. You make very complicated topics easy and engaging, your provide the knowledge for anyone to learn these highly-regarded concepts. Fruthermore, you are truly advancing the general knowledge of the public by providing these powerful videos. I would just like to express my gratitude for your videos, and how they really are making a positive impact. Thank you for dedicating many hours of work to upload these videos.
@jstello 3 หลายเดือนก่อน ⁺⁶
Haven't been this excited about a TH-cam video since makemore! Your videos are like an antidepressant. Such a joy to watch and follow and completely send contained. It's like having Mozart explain his art note by note
@Themojii 3 หลายเดือนก่อน ⁺¹⁶
I've learned a lot from your Neural Network video playlist. Thank you
@souravzzz 3 หลายเดือนก่อน ⁺¹⁴
🤗What an absolutely fantastic explanation! Every minute is filled with nuggets of deep insights!
@Ip_man22 2 หลายเดือนก่อน ⁺⁴
Thanks! Really appreciate the effort you put into making these high quality educational videos!
@Alex-qz4nk 3 หลายเดือนก่อน ⁺³⁸
That’s cool how Andrej explains right after releasing code
@XuanThao23 10 วันที่ผ่านมา
The fact that this is free is so incredible. A perfect content. For those looking for something industry-specific, Immersive Translate now allows you to customize your own AI expert, it also allows translations in the technology field become more accurate and professional.
@frodo114 3 หลายเดือนก่อน ⁺³
Hi Andrej, just wanted to thank you. You are a truly inspiration. Thanks for all the effort you put in this videos and all the tremendous value they offer when being publicly spread
@IY-0219 2 หลายเดือนก่อน ⁺¹
Thank you for doing this Andrej❤ As an undergraduate student I really appreciate having access to such incredible contents. Best of luck to your startup! Also looking forward to some computer vision related videos.
@SpenserFL 3 หลายเดือนก่อน ⁺¹¹
Thanks very much Andrej! Your videos are real gifts to the whole world.
@zeweichu550 2 หลายเดือนก่อน ⁺¹
This is an unbelievably high quality lecture! I always learn a ton of new things from Andrej Karpathy. Actually I believe if I have to rank the amount of knowledge I learned from a single person, Andrej would easily rank as #1.
@IgorTsvetkov 3 หลายเดือนก่อน ⁺⁹
Thanks for your Zero-to-hero series!
@AndrejKarpathy 2 หลายเดือนก่อน
wow you're very thankful ty! :)
@Jonathan-ru9zl 3 หลายเดือนก่อน ⁺⁶
We are living in great times, where geniuses like Karpathy offers their invaluable knowledge for free, and people are rewarding him with the sum of money they can afford 🎉
@coolarun283 3 หลายเดือนก่อน ⁺⁹
To anyone looking for the possible cause of the error in the parameter count: It is due to the vocabulary size. In GPT-1 it was around 40000, whereas in GPT-2 the vocab_size is around 50000. So, with 40K we will get 117M and with 50K we will get 124M.
@saurabhchalke 3 หลายเดือนก่อน ⁺¹
Thank you ser, this is priceless. Felt sad that it had to end at some point. Please cover more topics like mech interp, fine-tuning, mixture models, etc.
@hengry2 3 หลายเดือนก่อน ⁺³
You are the reason I got interested in neural networks, thank you for being a great teacher.
@veluvishwa6915 3 หลายเดือนก่อน
Hii bro, can i get roadmap for ML an deep learning please
@SaulRamirez-x6e 3 หลายเดือนก่อน
This is one of the best overviews I've seen not just on LLMs, but on the entire Deep Learning process. Thank you for going into so much detail, you're expertise really shows through your explanations.
Would I watch another 4 hour video from you? Absolutely, any day!
@nchahine 3 หลายเดือนก่อน ⁺⁶
Having to work when you just want to watch Andrej's videos is like being invited to an open buffet but you're on a diet :)
@andreyashgaliev9372 3 หลายเดือนก่อน ⁺¹
Currently, I'm just watching your videos. They makes me calm and happy. Hope to continue studying later this year.
@KapilSharma-lt4gm 2 หลายเดือนก่อน
Thanks for this incredible resource.
For anyone wondering about the transposes in the parameter copying from HF GPT2 model to implemented one.
HF model uses nn.Conv1d for qkv projection while Andrej uses nn.Linear. The weights dimensions in Conv1d are transposed. Hence, we need to transpose some of these weights before copying them over to Andrej's model.
@davidlyng2485 2 หลายเดือนก่อน
This video is absolutely brilliant! Thank you so much Andrej for taking the time to share your knowledge with us!
@nickbrooks5684 3 หลายเดือนก่อน ⁺³
Thank you for contributing to Open Source models! And not just open weights!
@AIForHumansShow 3 หลายเดือนก่อน
So thrilled to have you making stuff on here. It's the best version of what TH-cam can be.
@tijm6140 3 หลายเดือนก่อน ⁺¹
Thanks for the video. I like your intuition for weight decay. Since the decay is proportional to the value, it encourages the contributions to the residual stream to be spread over more neurons.
@hipotures 3 หลายเดือนก่อน ⁺⁶
Thanks for sharing your knowledge!
@CarlosReyes-ku6ub 3 หลายเดือนก่อน ⁺¹
Kind remainder that GOOD videos are NEVER too long
@rohollahhosseyni8564 3 หลายเดือนก่อน
I just watched your video about 'Let's Build GPT from Scratch' yesterday. You are a great teacher and clearly explain complicated concepts. Thanks!
@rolandrobertsons3069 หลายเดือนก่อน
Thank you andrej! I have watch all your videos about gpt and learn a lot! As a poor college student, It's your videos that leading me to the road of llm.
@김화겸-y6e หลายเดือนก่อน
I've never seen and experienced like you teaching me making me think i can learn everything with your teaching
@mohammedjaddoa9783 3 หลายเดือนก่อน ⁺²
your explanation is really amazing, please keep fulfilling the gap >>>> build things from scratch
@zendr0 3 หลายเดือนก่อน ⁺¹
Huge respect for Andrej🤗. Sharing knowledge for free is incredible.
@Issam0hm 3 หลายเดือนก่อน ⁺¹⁶
Another piece of art 🔥
@aliyovic10 3 หลายเดือนก่อน
You made me a bit emotional, knowing how much impact your videos will have for some people who will use this knowledge to make a living and improve their lives and the lives of those around them...Thank you!
@chuckchen 3 หลายเดือนก่อน ⁺¹
Another epic tutorial to build models from scratch. Thank you, Andrey!
@r0f115L4m 3 หลายเดือนก่อน
I’m so thankful and grateful that these videos are available to view for free. Thank you Andrej!
@nitinnilesh 2 หลายเดือนก่อน
The whole optimisation part in this video is something incredible. It is just impossible to find out these optimisation techniques on internet for DL models. Andrej doesn't have much research papers, but I believe that each one his videos is equivalent to a research paper having equal impact as of the original transformer paper.
@WannabeALU 3 หลายเดือนก่อน
I hope people realise how impactful what you are doing really is. This channel, this level of content, is empowering tens of thousands of smart, motivated, people in changing the World. Thank you.
@IrisSees 3 หลายเดือนก่อน ⁺⁶
Thanks Andrej!
@burakkurt3027 2 หลายเดือนก่อน
how is it Andrej, being one of those legends that will always be remembered? Your art will be viewed by generations!
@colinzhou9560 3 หลายเดือนก่อน ⁺⁶
OMG a 4hr movie!
@smitshukla6077 3 หลายเดือนก่อน
As a student, you have been my biggest inspiration and the best mentor in the field of NLP and Computer vision. Will forever be grateful!
@webgpu 3 หลายเดือนก่อน ⁺³
YOU are Awesome, Andrej!! 🥂🤖
@shristikedia9983 2 หลายเดือนก่อน
Thanks for the detailed video and explanation Andrej, have really learnt a lot watching your videos and the Makemore series!
@PopescuAlexandruCristian 16 วันที่ผ่านมา
This is the best learning resource on language models bar none.
@tempestuousfabe 3 หลายเดือนก่อน ⁺²
Love your content, thanks!
@antmantan 2 หลายเดือนก่อน
This is amazing! I've learned so much practical knowledge about how to build these models and it's helped me for my Machine Learning Engineer interview
@nothing_is_real_0000 3 หลายเดือนก่อน
@Andrej Karpathy, Awesome! You don't know how much this means to so many people around world! Thank you so much! You are our hero!
@aureliencobb199 3 หลายเดือนก่อน
Towards the end I thought, what, is that it? But many thanks for the effort and for bringing the material in such a clear and cohesive way.
@bobC-f5x 3 หลายเดือนก่อน ⁺³
Hi Andrej, what's the difference between this one and your "Let's build GPT" video? Which one should one learn first/which one is preferred?
@muhammadharris4470 3 หลายเดือนก่อน
Was wondering the same 😅
@hengry2 3 หลายเดือนก่อน
Use the "lets build" first, then this one; it goes over the understanding of it first, like the tokenization one as well.
@shairuno 3 หลายเดือนก่อน
When I see this video, I know I need to make time for this.
There is a huge difference between watching someone work out and workout by myself !
@user-yw5me7pb2x 3 หลายเดือนก่อน ⁺⁴
the GOAT has returned!
@EpicGamer-ux1tu 3 หลายเดือนก่อน
I love you andrej, this video is incredibly useful, because you show many many different parts of the model training phase in high quality and explain everything really well. Personally I've learned so much from just the first half of the video. Thank you so much and be well ❤
@BunyaminCIFTCI-c6i หลายเดือนก่อน ⁺³
if i know everthing taught in that tutorial with details and i am also able to apply them by myself, can i count myself as an advanced AI developer?
@allvods1385 18 นาทีที่ผ่านมา
no, you are like a beginner+. all the data handling and data cleaning that is necessary + bug fixing u have not had to deal with
@SLAM2977 3 หลายเดือนก่อน ⁺¹
Andrej releasing such unique super high quality content and for free, I am speechless.
@riverland0072 3 หลายเดือนก่อน ⁺³
Thanks!
@natebrake4114 3 หลายเดือนก่อน ⁺¹
Thank you Andrej for the lecture, enjoyed every minute of it! I especially found the discussion about torch compile to be helpful and interesting. I had been doing some experiments on how to speed up Mistral 7B inference in huggingface and was not seeing any improvement from torch compile. This is motivating for me to go back and try to understand what might be going wrong 😅. Thanks!
@GiuseppeRomagnuolo 2 หลายเดือนก่อน ⁺¹
thank you for yet another amazing video Andrej!
@akarshrastogi3682 3 หลายเดือนก่อน
Never stop. Keep publishing videos like these please. What a delight.
@denisroghelia 3 หลายเดือนก่อน
Thanks Andrej, your upload always gives me some motivation to study and understand these topics properly, in addition you have very amazing mentor skills, it's always a pleasure to see a new Andrej Karpathy upload, I appreciate all these lectures, thank you very much for all of them.
@ExploringandCoding 3 หลายเดือนก่อน
Videos like these is the reason, I am here at YT
and rarely leave comment
Thanks Andrej for this! Massive respect.
@flikwonda 3 หลายเดือนก่อน
Absolute gem of a video. Although one thing which became pretty evident to me whilst watching this is that if Andrej doesn't even understand some of the PyTorch docs and internals, that when you know the library is a bit of a clusterfuck and just shows how novel a lot of this new AI technology is.
@lorenzoleongutierrez7927 3 หลายเดือนก่อน
Simply I can't believe how generous this man is...saludos gran Andrej !, you are the best human being of your generation. !
@riochuong105 3 หลายเดือนก่อน
legend !!! all I can do is buy you a thank to show my appreciation. your videos changes the way I learn deep learning
@MichaelKleyn 3 หลายเดือนก่อน ⁺³
Legend
@ShadKhan 2 หลายเดือนก่อน ⁺¹
Thanks. Eagerly waiting for your LLM 101 course🤟🤟
@satyamgupta3456 3 หลายเดือนก่อน
Thanks for making such a quality-content available to the every corners of the world!!!
@adityagulati1540 20 วันที่ผ่านมา
Andrej Karpathy is like the Roger Federer of AI - makes the hardest stuff look easy!
@bald_agent_smith 3 หลายเดือนก่อน
That's just a ton of insights and tips for anyone who getting into LLMs.Thank you for your work Andrej!
@hoz85 3 หลายเดือนก่อน
Your videos are just perfect for those who wants to go deeper in this field. Are there any other guys like you in yt?
@shuozhang429 3 หลายเดือนก่อน
You're saving my master's thesis, I have to pay back some! 😂Thank you for doing the 'open' part of 'open'AI, it will help a lot of people!
@eliahmbwilo1312 3 หลายเดือนก่อน ⁺¹
Hi Andrej, Thank you so much for these videos. LLMs would be a black box without them. By any chance when you find some time please also include in this series the RLHF part of model fine tuning.
Thanks 🙏
@ZeParagon 3 หลายเดือนก่อน
Here in the first hour of the video release. Came here while I was studying for my Security+ Exam; was looking into the Sandbox environment topic, asked ChatGPT, for set-up possibilities, realized it was down, went to Twitter and I see this video.
3 หลายเดือนก่อน
This video can open a new era and close an old one for many topics. Thanks, dude.🤗
@amitabhachakraborty497 3 หลายเดือนก่อน
Learning from you is blessings to me. Thank you sir. Please upload such contents more.
@04maj 2 หลายเดือนก่อน
The most impressive thing about Andrej is that unlike the rest of us, with every minute of coding he looks more refreshed and clean shaved 😂 Joking, of course this is the second most impressive thing about him!
@varunjain8981 3 หลายเดือนก่อน
Thanks from the bottom of my heart. It is amazing to see such a high quality content.
@1ShoopManySheep 3 หลายเดือนก่อน ⁺¹
This was awesome! Was glued to the screen for the whole video.
Could you implement RLHF? I've never heard any in-depth explanation of it.
@TheLokiGT 3 หลายเดือนก่อน
Andrej, we sorely miss the little bunch of flowers from the early, glorious days of NN-zero-to-hero..
@minh-dungdang7388 หลายเดือนก่อน
Very in details, but with the highest level of overview. "Machine Learning", "Deep Learning Specialization" on Coursera of Andrew Ng. and this series zero-2-hero of Andrej are really special.
@mdeasy 3 หลายเดือนก่อน
Andrej man, thank you, really appreciate you gifting this to the world. Great content and presentation!!!
@taido4883 3 หลายเดือนก่อน
Thanks Andrej for the incredible video. Actually all your videos are incredible. I have learnt a ton!!!
@azizmugayel3025 22 วันที่ผ่านมา
Thanks Andrej for this awesome knowledge

ต่อไป

เล่นอัตโนมัติ

[1hr Talk] Intro to Large Language Models