This Algorithm Could Make a GPT-4 Toaster Possible

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Accelerating scientific discovery with AI

ต้าห์อู๋-ออฟโรด ขอฝึกวิชาเซียน จับหมูป่ามือเปล่า | เฮ็ดอย่างเซียนหรั่ง FULL EP.21 | One Playground

【หนังพากย์ไทย】ยอดฝีมือสังหารนักโทษ แต่นักโทษเป็นปรมาจารย์กังฟูที่ซ่อนอยู่ เขาจัดการทั้งหมดในทันที

ผู้หญิงแต่งงานกับขอทาน แต่กลับถูกดูหมิ่น ในที่สุดชายขเทานก็เผยตัวตย#ละครหวานๆ#ชอบ

RL Foundation Models Are Coming!

Edan Meyer

มุมมอง 22 407

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 9 ม.ค. 2025

ความคิดเห็น • 53

@theodoreshachtman7360 ปีที่แล้ว ⁺²⁷
This is a really high quality video, on par with 2 minute papers but with a more detail oriented approach. Also you have a lovable vibe king, keep it up
@THarshavardhanReddy ปีที่แล้ว ⁺⁴
I used to love 2 Minute Papers. But it's become very repetitive now, and just too fluffy. Probably I'm not in the target audience anymore.
@herpderp728 ปีที่แล้ว ⁺⁵
I absolutely hate 2 minute papers. It's all hype and no substance. I physically cringe every time I hear the guy say "now hold onto your papers everybody! this is gonna be crazy!" and then he tells you the most boring anti-climactic shit possible.
@theodoreshachtman9990 ปีที่แล้ว
Yeah, but how come your stinky doo doo though…
@theodoreshachtman9990 ปีที่แล้ว
Yeah, but how come your stinky doo doo though…
@theodoreshachtman7360 ปีที่แล้ว
@@herpderp728 Yeah, but how come your stinky doo doo though…
@MickGardner-vc4us ปีที่แล้ว ⁺²
edan bro makes my dopamine policy gradients high everytime. fingers crossed we get open rl foundation models.
@zxgrizzly3401 ปีที่แล้ว ⁺¹
Thanks for your videos, but at 7:44, efficient zero and mu zero do not reconstruct the raw observation/image, mu zero learns it’s latent representation based on value equivalence only while efficient zero also cares about temporal consistency, so they take next observation to supervise the representation and dynamics part of the model in an unsupervised manner(simsiam)
@tchlux ปีที่แล้ว ⁺⁷
Another way to frame the problem of neural network representations becoming “too specific” to learn new tasks at 25:59 is to consider exactly how the gradient of weights is computed.
It’s the matrix multiplication between the directional error after a layer and the directional values before the layer. When the values become totally orthogonal to the error (they contain no information relative to the error), then it’s impossible to reduce the error by changing the weights in that layer.
The reason weight randomization helps with this problem is it introduces new values after the layer that was randomized. However a much more efficient way to do this is to instead reduce the existing weights in a layer with linear regression over a representative sample of data to “pack” the good information into fewer existing neurons. Then you’re free to randomly initialize the remaining neurons, or even better to initialize weights that produce values already aligned with the directional error! I’ve got some ongoing research in this area if anyone is interested in collaborating. 🤓
@MickGardner-vc4us ปีที่แล้ว ⁺¹
sounds pretty badass. might be easier to do a backward pass through lin-reg as well
@jadenlorenc2577 ปีที่แล้ว
I'd be interested! How do I get in contact?
@tchlux ปีที่แล้ว
@@jadenlorenc2577 my TH-cam profile has links to different places, whatever is easiest for you!
@CristianGarcia ปีที่แล้ว ⁺⁴
Just give this environment to speed runners, watch the true potential of what humans can do with games.
Thanks for the video!
@vsiegel ปีที่แล้ว ⁺³
At 7:10, the first pronounciation of Muesli is right. German Müsli, Muesli may be the Swiss-German spelling.
@exoqqen ปีที่แล้ว ⁺¹
amazing breakdown, thank you for making this paper accessible to me!
@henrycook859 ปีที่แล้ว ⁺¹
22:55 uhh 5 x 300 isn't 1800 lmao
@ch1n3du3 ปีที่แล้ว ⁺¹
Do you think the approaches here could be applied to Dreamer V3?
@ChocolateMilkCultLeader ปีที่แล้ว
If you're ever interested in collaborations, let me know. I'd love to have you on my newsletter to cover some of your most interesting ideas.
@Kram1032 ปีที่แล้ว
I wonder if there is any benefit to be had at all from, like, across multiple full training iterations, distill a large model into a smaller one and then distill the small one back into a larger one (vs. *just* repeatedly distilling a large model into a model of the same size)
@billykotsos4642 ปีที่แล้ว ⁺³
sounds like RL is progressing? maybe I should jump back in !
@chickenp7038 ปีที่แล้ว ⁺⁴
since wandb doesn’t work for me i will actually try clearml thanks to you
@kemalware4912 ปีที่แล้ว ⁺¹
I really liked vscode theme on the clear ml section. Can you share it?
@kemalware4912 ปีที่แล้ว ⁺¹
Community Material Theme ocean high contrast
@dragossorin85 ปีที่แล้ว
Been thinking about this for some time
@before7048 ปีที่แล้ว ⁺²
7:10 Myu-slee. It's a quick, easy and tasty breakfast so that you too, can be reinforced!
@EdanMeyer ปีที่แล้ว ⁺²
Lmao I don’t think I could have been any further from the mark
@alexcai1320 ปีที่แล้ว
@@EdanMeyer no worries -- it was incredibly entertaining XD
@Blacky372 ปีที่แล้ว
I wonder if you could train a model that could beat a human in Rock Paper Scissors, but with retained memory in a best of 7 or so. That would only require it to train on human behavior episodes, which would be hard to acquire. But if this was possible with synthetic games, this would be the best party trick ever.
@wpgg5632 ปีที่แล้ว
Really love it !
@zigzag4273 ปีที่แล้ว ⁺⁵
My 2nd petition on this matter. Please make a video of how you read and implement papers. Thank you **kiss**
@EdanMeyer ปีที่แล้ว ⁺¹
Still considering. Part of the issue is that every paper is just so different when it comes to this, and lots of the background is going to be dependent on the paper. Still might try as I guess maybe I can extract some general guidelines from my process
@_RMSG_ ปีที่แล้ว
@@EdanMeyer Where to start would be a pretty good help
@shadamethyst1258 ปีที่แล้ว ⁺⁵
Why did they have to choose the same name as the Ada programming language ._.
They did the same thing with MLKit, which was a model language suite of tools, which google decided should instead be a machine learning kit
@EdanMeyer ปีที่แล้ว ⁺²
I’m pretty sure every short name in ML papers shares a name with something else at this point lol
@user-kp7xs4rb3t ปีที่แล้ว
The hell, we have the same name!
@afish5581 ปีที่แล้ว
Coffee is culture too!
@pauljones9150 ปีที่แล้ว
Good stuff
@robertsimonuy9743 ปีที่แล้ว ⁺¹
"ADA" and "Muesli"
Thought this was about the cardano ecosystem. lol
@JinKee ปีที่แล้ว
I worry that independent agents will make mistakes faster than we can realign their goals.
@Ideagineer ปีที่แล้ว
An army of GPU's? time to break open the piggy bank.
@sitrakaforler8696 ปีที่แล้ว
Wow x)
@angelowentzler9961 ปีที่แล้ว ⁺⁷
Muesli is pronounced "MEW-zlee" HTH
@omeadpooladzandi9786 ปีที่แล้ว
i cant even train ciar10 in 15 mins
@polecat3 ปีที่แล้ว ⁺³
20:25 I laughed
@DanielTorres-gd2uf ปีที่แล้ว
I cried
@SENTRY456123 ปีที่แล้ว ⁺¹
I sobbed
@Sviktam ปีที่แล้ว
Like for a cultured matcha enjoyer
@johnnylatenight ปีที่แล้ว ⁺²
first

ต่อไป

เล่นอัตโนมัติ

This Algorithm Could Make a GPT-4 Toaster Possible

This Algorithm Could Make a GPT-4 Toaster Possible

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Accelerating scientific discovery with AI

Accelerating scientific discovery with AI

ต้าห์อู๋-ออฟโรด ขอฝึกวิชาเซียน จับหมูป่ามือเปล่า | เฮ็ดอย่างเซียนหรั่ง FULL EP.21 | One Playground

ต้าห์อู๋-ออฟโรด ขอฝึกวิชาเซียน จับหมูป่ามือเปล่า | เฮ็ดอย่างเซียนหรั่ง FULL EP.21 | One Playground

【หนังพากย์ไทย】ยอดฝีมือสังหารนักโทษ แต่นักโทษเป็นปรมาจารย์กังฟูที่ซ่อนอยู่ เขาจัดการทั้งหมดในทันที

【หนังพากย์ไทย】ยอดฝีมือสังหารนักโทษ แต่นักโทษเป็นปรมาจารย์กังฟูที่ซ่อนอยู่ เขาจัดการทั้งหมดในทันที

ผู้หญิงแต่งงานกับขอทาน แต่กลับถูกดูหมิ่น ในที่สุดชายขเทานก็เผยตัวตย#ละครหวานๆ#ชอบ

ผู้หญิงแต่งงานกับขอทาน แต่กลับถูกดูหมิ่น ในที่สุดชายขเทานก็เผยตัวตย#ละครหวานๆ#ชอบ

หนูขอไปด้วย #แม่สุซูกัส #ตลก #shorts

หนูขอไปด้วย #แม่สุซูกัส #ตลก #shorts

2 Years of My Research Explained in 13 Minutes

2 Years of My Research Explained in 13 Minutes

Transformers (how LLMs work) explained visually | DL5

Transformers (how LLMs work) explained visually | DL5

Why Does Diffusion Work Better than Auto-Regression?

Why Does Diffusion Work Better than Auto-Regression?

MIT Introduction to Deep Learning | 6.S191

MIT Introduction to Deep Learning | 6.S191

How an A.I. is Becoming the World's Best Pokemon Player

How an A.I. is Becoming the World's Best Pokemon Player

7 Outside The Box Puzzles

7 Outside The Box Puzzles

Yann LeCun: Why RL is overrated | Lex Fridman Podcast Clips

Yann LeCun: Why RL is overrated | Lex Fridman Podcast Clips

Has Generative AI Already Peaked? - Computerphile

Has Generative AI Already Peaked? - Computerphile

A new way to generate worlds (stitched WFC)

A new way to generate worlds (stitched WFC)

Live!🔴 สิงคโปร์ VS ทีมชาติไทย เชียร์สดฟุตบอลฟุตบอล ASEAN Mitsubishi Electric Cup™ 2024

Live!🔴 สิงคโปร์ VS ทีมชาติไทย เชียร์สดฟุตบอลฟุตบอล ASEAN Mitsubishi Electric Cup™ 2024

วาทะลูกหนังขอเสนอ"แมนเชสเตอร์ ซิตี้ VS แมนเชสเตอร์ ยูไนเต็ด หลังเกม เรือใบสีฟ้าแพ้ปีศาจแดงคาบ้าน"

วาทะลูกหนังขอเสนอ"แมนเชสเตอร์ ซิตี้ VS แมนเชสเตอร์ ยูไนเต็ด หลังเกม เรือใบสีฟ้าแพ้ปีศาจแดงคาบ้าน"

เจ้าของแทบทรุด บ้านสร้างได้ 3 เดือน พังทรุดตัว เพจดังชี้สาเหตุ ไม่ใช่เกิดจากเสาเข็ม

เจ้าของแทบทรุด บ้านสร้างได้ 3 เดือน พังทรุดตัว เพจดังชี้สาเหตุ ไม่ใช่เกิดจากเสาเข็ม

ต้าห์อู๋-ออฟโรด ขอฝึกวิชาเซียน จับหมูป่ามือเปล่า | เฮ็ดอย่างเซียนหรั่ง FULL EP.21 | One Playground

ต้าห์อู๋-ออฟโรด ขอฝึกวิชาเซียน จับหมูป่ามือเปล่า | เฮ็ดอย่างเซียนหรั่ง FULL EP.21 | One Playground

Scum Rangers LIVE-021 ขุนให้อ้วน ฟาร์มให้เงียบ

Scum Rangers LIVE-021 ขุนให้อ้วน ฟาร์มให้เงียบ

Mache leckere Lutscher mit diesem PRO-Gadget! 🚽🍭

Mache leckere Lutscher mit diesem PRO-Gadget! 🚽🍭

มายคราฟแต่ "น้ำกับลาวา" สลับกัน!?

มายคราฟแต่ "น้ำกับลาวา" สลับกัน!?

Live! ถ่ายทอดสดหวย ถ่ายทอดสดการออกรางวัลสลากกินแบ่งรัฐบาล งวดวันที่ 16 ธันวาคม 2567

Live! ถ่ายทอดสดหวย ถ่ายทอดสดการออกรางวัลสลากกินแบ่งรัฐบาล งวดวันที่ 16 ธันวาคม 2567