Reinforcement Learning: AlphaGo

The moment we stopped understanding AI [AlexNet]

MUST SEE! Ai Robots that STUNNED Experts! [Figure 02, Tesla Optimus, Boston Dynamics & More]

ไฮไลท์ฟุตบอล พรีเมียร์ลีก 2024/25 สัปดาห์ที่ 16 : แมนเชสเตอร์ ซิตี้ พบ แมนเชสเตอร์ ยูไนเต็ด

Apko konsa RC Bus Accah laga

Mache leckere Lutscher mit diesem PRO-Gadget! 🚽🍭

Reinforcement Learning from scratch

Graphics in 5 Minutes

มุมมอง 85 597

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 3 ม.ค. 2025

ความคิดเห็น • 61

@darthvader4899 9 หลายเดือนก่อน ⁺⁵⁰
this is video is super underrated. In fact the whole channel is underrated.
@william_8844 5 หลายเดือนก่อน ⁺¹
Maybe i should follow the channel then 😅.
This was my first vid, and the explanation was really well simplified
@themathguy3149 ปีที่แล้ว ⁺¹⁰
Your Channel IS SO GREAT, I share with all my eng friends for you to get more visibility!
@tushargupta1999 9 หลายเดือนก่อน ⁺⁶
This video is amazing. You explained everything in such a simple manner. I am feeling really motivated to learn more about reinforcement learning and neural networks after watching this.
@ashketchum1244 ปีที่แล้ว ⁺⁶
I don't know how I stumbled upon this video but that was very interesting and intuitive to understand. Thank you.
@Arivan_Abdulla 5 หลายเดือนก่อน ⁺³
Too beautiful you can watch this kind of videos all the day without get bored
@limeducky0209 หลายเดือนก่อน
This was so much easier to understand than the other RL videos that came up when I searched this topic
@mind6861 6 หลายเดือนก่อน ⁺³³
Can we have the code for this
@poopcoder468 2 หลายเดือนก่อน
Lol😅😅😅😅😅😅
@jameslibby5215 ปีที่แล้ว ⁺⁸
Very very underrated channel
@benc7910 ปีที่แล้ว
Underrated, two Rs
@jameslibby5215 ปีที่แล้ว
@@benc7910 thank ya sir
@metaljacket8102 9 หลายเดือนก่อน ⁺²
This is really awsome! It's the best video that explains DRL in such an easy to understand way!
@CptDoge-rn3ou ปีที่แล้ว ⁺²
I really like the way you visualize what you are talking about. Thank you for putting in the effort!
@Bet-s4g 3 หลายเดือนก่อน ⁺¹
This is super underrated video
@a.aspden ปีที่แล้ว ⁺²
Your videos are great. Looking forward to more!
@cloudysh 9 หลายเดือนก่อน ⁺¹
This was so surprisingly great :3
@themax2go 10 หลายเดือนก่อน ⁺⁴
agi: 1. ai develops understanding of win-loss conditions and sets policy params (inputs & actions) accordingly. 2. ai creates (= designs & builds) training env(s). 3. ai iterates, avals & adjusts policy parameters accordingly 4. done (or validation run(s) w/ human(s))
@marcinstrzesak346 ปีที่แล้ว ⁺¹
Great video, very helpful, easy to understand.
@Sumpydumpert 7 หลายเดือนก่อน ⁺¹
I agree once you see how it all works it seems like 1s and zeros give me some feed back on r/grand unified theory or cosmo knowledge
@moldo800 ปีที่แล้ว ⁺¹
Excellent. Congratulations ❤
@luiseduardocraizer7416 7 หลายเดือนก่อน ⁺¹
Excellent content!
@gmjammin4367 ปีที่แล้ว ⁺¹
Amazing video as always :)!
@wuyanchu 15 วันที่ผ่านมา
thx and god bless u, regards from hong kong, china, merry chirstmas... ^__^
@swannschilling474 6 หลายเดือนก่อน ⁺¹
Thanks a lot for this one! 😊
@anthonyortiz7924 4 หลายเดือนก่อน
What a great series! I have a question for the experts... was it necessary to map velocity as an input? I'm guessing it's not absolutely necessary and was done to make the training faster? My guess is based on the assumption that the timing of the ball x/y changes to the inputs have an effect, but I may be wrong.
@termisher5676 14 วันที่ผ่านมา
Can i speed up training if i know exact picels thst have high importance and are static?
@BlueBirdgg ปีที่แล้ว ⁺¹
Can you playlist each one of your topics plz?
I wanted to post on Twitter(X) your video topics but could only post a single video at a time.
Great content by the way. Ty very much.
Your perspective on some topics helped me a lot to get a more intuitive understanding.
@g5min ปีที่แล้ว
Good idea! Here's one on generative AI:
th-cam.com/play/PLWfDJ5nla8UoR8P7AGqVw7ZPjXajUFLMo.html
Here's one on reinforcement learning
th-cam.com/play/PLWfDJ5nla8UoexEaLqVMw7q3Ft0vRYscL.html
Here's one on LLMs + text-to-image
th-cam.com/play/PLWfDJ5nla8UoG2mvvHs_OS0asAKC5HJeu.html
@BlueBirdgg ปีที่แล้ว
@@g5min Ty!
@mohajeramir 8 หลายเดือนก่อน ⁺²
Excellent
@jaideepraulji1395 4 หลายเดือนก่อน ⁺¹
Superb
@mado.madeleine ปีที่แล้ว ⁺¹
Super helpful! Thank you 🙏🏽
@jdlopes06 6 หลายเดือนก่อน ⁺¹
Thank you!
@edvinbeqari7551 11 หลายเดือนก่อน
What is your reward function for the pong game? I did a similar pong game and I couldn't get it to learn.
@william_8844 5 หลายเดือนก่อน
I get how the model can see moves and output up or down action. But I don't get how model tracks the score for rewards etc
Can someone explain how the reward is fed into model
@bombur9007 9 หลายเดือนก่อน
how many layers should such network have
@NR_5tudio 2 หลายเดือนก่อน
i just have a quastion, what is that thing ? 6:20 its like a worm ?
like. i didnt take it in my math class.... im 16 years btw
i mean the one u added
@insecureprince120 21 วันที่ผ่านมา ⁺¹
Sigmoid function. Basically squishes the results from what the ReLU spits out to a number between 0 and 1
@NR_5tudio 21 วันที่ผ่านมา ⁺¹
@@insecureprince120 like this?:
A = 5
B = 10
[this thing] 0.5 = 7.5 ?
if yes its the same "lerp node" in unreal engine
@insecureprince120 21 วันที่ผ่านมา ⁺¹
@ not quite, let’s say A=5 and B=10 are outputs from 2 nodes after ReLU. sometimes these outputs are multiplied by weights going to the next layer, maybe not in his example not sure. but, all the outputs after relu are summed together THEN squished with this sigmoid function. formula: S(x)= 1/(1+e^-x)
where e is euler’s number (a constant) and x is the sum of the outputs of the previous nodes (so in your example x=5+10=15) this would give the output: S(15)=.999… (very close to 1 in this example)
if you look up the graph you’ll see it quite literally looks like that squiggle worm he showed. but yeah sigmoid pretty much just normalizes (squishes) the sum of the outputs to fit between 0 and 1
@NR_5tudio 21 วันที่ผ่านมา ⁺¹
@@insecureprince120 bro i waste your time, idk what im gonna say, thank you.. ig?, god bless you
@nikbivation ปีที่แล้ว ⁺¹
thank you for this!
@mineq4967 9 หลายเดือนก่อน
but by what number do you change the weights like you never told us
@Garbage_Wastes_Scavenger หลายเดือนก่อน
Facing the same problem
@ireoluwaTH ปีที่แล้ว ⁺¹
Thank you!!!
@n4mmenam ปีที่แล้ว ⁺¹
Brilliant
@axe863 ปีที่แล้ว ⁺²
Simple Reinforcement learning is extremely dangerous in certain nonstationary environments 😅
@nischalyou ปีที่แล้ว
whats the name of this video game ?
@maxim_ml 7 หลายเดือนก่อน ⁺¹
that was good
@gaydemaupassant6263 6 หลายเดือนก่อน
Pls o want the code plsss
@theduckylord 17 วันที่ผ่านมา
ich bin confuzzled (i dont actually speak german FYI)
@FRANKONATOR123 ปีที่แล้ว
Can you share the source code for this project
@g5min ปีที่แล้ว
You can follow the link to the Karpathy site at the end of the video, repeated here:
karpathy.github.io/2016/05/31/rl/
@herikaniugu ปีที่แล้ว
Imagine using reinforcement learning in quantitative finance 😊
@macratak ปีที่แล้ว
ah yes, reinforcement learning. a fundamental computer graphics technology
@g5min ปีที่แล้ว ⁺⁶
I think that character/game-AI is pretty central to graphics
@pw7225 ปีที่แล้ว ⁺¹
Why so negative?
@revimfadli4666 ปีที่แล้ว
@@g5minespecially AI image generation or processing nowadays
@tadejgreen 23 วันที่ผ่านมา
he just HAD to mention his macbook air...

ต่อไป

เล่นอัตโนมัติ

Reinforcement Learning: AlphaGo

Reinforcement Learning: AlphaGo

The moment we stopped understanding AI [AlexNet]

The moment we stopped understanding AI [AlexNet]

MUST SEE! Ai Robots that STUNNED Experts! [Figure 02, Tesla Optimus, Boston Dynamics & More]

MUST SEE! Ai Robots that STUNNED Experts! [Figure 02, Tesla Optimus, Boston Dynamics & More]

ไฮไลท์ฟุตบอล พรีเมียร์ลีก 2024/25 สัปดาห์ที่ 16 : แมนเชสเตอร์ ซิตี้ พบ แมนเชสเตอร์ ยูไนเต็ด

ไฮไลท์ฟุตบอล พรีเมียร์ลีก 2024/25 สัปดาห์ที่ 16 : แมนเชสเตอร์ ซิตี้ พบ แมนเชสเตอร์ ยูไนเต็ด

Apko konsa RC Bus Accah laga

Apko konsa RC Bus Accah laga

Mache leckere Lutscher mit diesem PRO-Gadget! 🚽🍭

Mache leckere Lutscher mit diesem PRO-Gadget! 🚽🍭

หมวกกันน็อค - TaitosmitH |Official MV|

หมวกกันน็อค - TaitosmitH |Official MV|

Watching Neural Networks Learn

Watching Neural Networks Learn

Reinforcement Learning, by the Book

Reinforcement Learning, by the Book

How did the Enigma Machine work?

How did the Enigma Machine work?

MIT 6.S191: Reinforcement Learning

MIT 6.S191: Reinforcement Learning

Large Language Models from scratch

Large Language Models from scratch

Training an unbeatable AI in Trackmania

Training an unbeatable AI in Trackmania

An introduction to Reinforcement Learning

An introduction to Reinforcement Learning

The Most Important Algorithm in Machine Learning

The Most Important Algorithm in Machine Learning

Attention in transformers, visually explained | DL6

Attention in transformers, visually explained | DL6

ผู้หญิงแต่งงานกับขอทาน แต่กลับถูกดูหมิ่น ในที่สุดชายขเทานก็เผยตัวตย#ละครหวานๆ#ชอบ

ผู้หญิงแต่งงานกับขอทาน แต่กลับถูกดูหมิ่น ในที่สุดชายขเทานก็เผยตัวตย#ละครหวานๆ#ชอบ

How to treat Acne💉

How to treat Acne💉

เจ้าของแทบทรุด บ้านสร้างได้ 3 เดือน พังทรุดตัว เพจดังชี้สาเหตุ ไม่ใช่เกิดจากเสาเข็ม

เจ้าของแทบทรุด บ้านสร้างได้ 3 เดือน พังทรุดตัว เพจดังชี้สาเหตุ ไม่ใช่เกิดจากเสาเข็ม

Live! ถ่ายทอดสดหวย ถ่ายทอดสดการออกรางวัลสลากกินแบ่งรัฐบาล งวดวันที่ 16 ธันวาคม 2567

Live! ถ่ายทอดสดหวย ถ่ายทอดสดการออกรางวัลสลากกินแบ่งรัฐบาล งวดวันที่ 16 ธันวาคม 2567

LIVE🔴 : Singapore vs Thailand | ASEAN Championship 2024 | 17.12.24

LIVE🔴 : Singapore vs Thailand | ASEAN Championship 2024 | 17.12.24

MARK 마크 '프락치 (Fraktsiya) (Feat. 이영지)' MV

MARK 마크 '프락치 (Fraktsiya) (Feat. 이영지)' MV

Real Vs Mannequin Challenge😱

Real Vs Mannequin Challenge😱

HIGHLIGHTS : Singapore 2-4 Thailand | ASEAN Championship 2024 | 17.12.24

HIGHLIGHTS : Singapore 2-4 Thailand | ASEAN Championship 2024 | 17.12.24