How AI Learned to Think

Art of the Problem

มุมมอง 79 956

3 600

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 4 ก.พ. 2025

ความคิดเห็น • 359

@ArtOfTheProblem 2 หลายเดือนก่อน ⁺³⁴
Watch Full AI Series: th-cam.com/video/YulgDAaHBKw/w-d-xo.html
NO MUSIC version: th-cam.com/video/DFDOyMZw9Q4/w-d-xo.html
Sponsored by Brilliant | Use brilliant.org/artoftheproblem for 30-day free trial and 20% discount
@8enos 2 หลายเดือนก่อน ⁺²
Thank you for the NO MUSIC version!
@heardistance 2 หลายเดือนก่อน ⁺¹
Love your videos! Just a little suggestion. Background music is good, but too loud. It should never cover your speaking, like now. I suggest 15 - 20% less music volume, and you are good!
@raa9558 หลายเดือนก่อน
Can you share the names of the songs please
@ArtOfTheProblem หลายเดือนก่อน
@@raa9558 these are original tracks by my friend. that song hasn't been posted yet but i'll tell him to: cameronmichaelmurray.bandcamp.com/
@victormuchina4865 2 หลายเดือนก่อน ⁺¹¹⁵
This Guy just explained all the core concepts in AI on one shot ,Congrats man!
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹³
:) thank you, i cut a LOT out of the video in my edit - going to post a shorter bigger summary soon
@andreerfabbro 2 หลายเดือนก่อน ⁺³
They must have prompted him real good
@rogerwood2864 2 หลายเดือนก่อน
Agreed. Great job.
@user-hl2yj8kp2s 2 หลายเดือนก่อน ⁺⁴¹
I love this video. I remember watching your videos like 10 years ago on Khan Academy about compression, entropy, Claude Shannon, etc. All timeless. I have always loved this style of documentaries. We need to protect you at all costs.
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺⁶
Thank you, I love hearing from og’s! support future work: www.patreon.com/c/artoftheproblem
@DistortedV12 หลายเดือนก่อน ⁺²
Great video. Saw o3 and came back to this to really appreciate the historical trajectory. TH-cam is new pbs imo.
@ArtOfTheProblem หลายเดือนก่อน ⁺¹
thank you so much, I made this video before o3 came out and so it was nice to see the progression.appreciate this
@HayashiManabu 2 หลายเดือนก่อน ⁺³⁶
I love your video aesthetics, how you blend retro video clips with your explanations. I think you'd really enjoy retro-futuristic concepts and games like Bioshock and Fallout.
@ArtOfTheProblem 2 หลายเดือนก่อน
Love this , I definitely know the style you are talking about
@pflintaryan8336 12 วันที่ผ่านมา ⁺²
I like this kind of history lesson instead of learning advance topic directly. it gives us an idea how things were and explain things are the way there are
@ArtOfTheProblem 11 วันที่ผ่านมา ⁺¹
thank! that was my motivation, there wasn't enough back in the day, 90's..
@Sawaedo 2 หลายเดือนก่อน ⁺¹⁵
It is a great explanation of how current AI models reason. I liked the video a lot!
1. Simulation of future states.
2. LLMs that can give kind-of accurate answers with step by step reasoning.
3. RL approach that makes LLMs to give multiple answers, then evaluate them to select the best one. (Required more time)
It would be nice to see wether a model that wasn't trained on the internet data, could learn how to reason by interacting with an LLM, and practicing on its dreams, but maybe we'll see that in the future.
For the awesome review, history explanation and divulgation:
Thanks! 🎉
@ArtOfTheProblem 2 หลายเดือนก่อน
thanks for sharing summary
@ninjacats1647 หลายเดือนก่อน ⁺¹
I found Chat GPT to be exceptionally good at explaining all sorts of topics, and in many cases, better than every person I've ever met.
@ArtOfTheProblem หลายเดือนก่อน
@@ninjacats1647 this is true
@nikos.1644 2 หลายเดือนก่อน ⁺¹⁰
The match between soundtrack and content is INSANE! The notes mimicking the concepts discussed by using things like pitch or chords.... goosebumps.
@ArtOfTheProblem 2 หลายเดือนก่อน
thank you, so many people are bothered by my music. it's nice to hear.....though i got a bit nuts at the end
@ppocka-XD หลายเดือนก่อน ⁺²
Thanks!
@ArtOfTheProblem หลายเดือนก่อน
WOW thank you for your support, it means a lot.
@universemaps 2 หลายเดือนก่อน ⁺³
I am not a subscriber? I remember watching this channel about 12 years ago. I found it again, and it keeps creating art out of problem solving. Good job!
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹
wecome back!!!
@olli757 หลายเดือนก่อน ⁺²
better explained than anything else i've seen until know. Wow, nice flow in the video too
@ArtOfTheProblem หลายเดือนก่อน ⁺¹
thank you! I thought I packed too much in :)
@olli757 หลายเดือนก่อน ⁺¹
@@ArtOfTheProblem yes but that makes it interesting. I probably didn't get it all, but i'm interested to learn more after seeing it!
@ArtOfTheProblem หลายเดือนก่อน ⁺¹
@@olli757 couldn't ask for more, rabbit hole time!! i'm actually working on an large AI summary for next week
@olli757 หลายเดือนก่อน
@@ArtOfTheProblem cool i'm subscribed... i always wondered exactly how the "reasoning worked". What I remember from your video is that like with chess instead of trying all the games, it randomly picks 100 of them. So the same with the reasoning..
@DavidTaylor-cz9pz 2 หลายเดือนก่อน ⁺¹¹
THANK YOU for publishing a no-music version of this video (see pinned comment by ArtOfTheProblem). It is such a clear and informative video that I hated to see it loose views due to the competing sound track. I'm going to watch it again right now to see if I missed anything the first time around.
Thanks again for being so responsive to your followers.
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹
Thank you for saying that , I find the music keeps me interested as I take sooo long to edit
@ankrisstark7824 2 หลายเดือนก่อน ⁺¹¹⁵
The video is good but there are sooo many random sounds that make it difficult to focus on what you are saying, specifically towards the end.
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹³
Here you are! th-cam.com/video/DFDOyMZw9Q4/w-d-xo.html
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺²
@Zayyan_Shaibu thank you!!
@RudranilBhattacharjee 2 หลายเดือนก่อน
I agree
@___Truth___ 2 หลายเดือนก่อน ⁺³
If that’s easily distracting you, you might want to get some tests run on you for ADHD or Autism
@strangekwark หลายเดือนก่อน
@@___Truth___ or hard of hearing or older or whatever off course. Tat dramatic to base this on one comment.
@devbites77 2 หลายเดือนก่อน ⁺²
Great vid. I love that it clearly explains the progression, like the pieces coming together. Can't wait to see the next steps!
@ArtOfTheProblem 2 หลายเดือนก่อน
thanks, next up i'm taking a detour into economics
@rooky102 หลายเดือนก่อน ⁺¹
I'm loving almost everything about this: the editing, the subject matter, the music. But as one other commenter alluded to; the audio mixing really falls short, especially near the end. Please consider making the background and effects less prominent going forward, it really sucks having to strain just to hear your voice, which is what we're here for!
Subscribed ❤
@KieranGarland 2 หลายเดือนก่อน ⁺²
is this the last video in the series? regardless, can't tell you how valuable and enjoyable i've found them all. thank you for them.
@KainniaK 2 หลายเดือนก่อน ⁺¹⁸
Albert Einstein said: "If you can't explain something in a simple way so anybody can understand it you don't fully understand it yourself". Perhaps you are one of the few LLM experts we have!
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹
THANK you this means a lot to me.
@KainniaK 2 หลายเดือนก่อน ⁺³
@@ArtOfTheProblem I did but reddit really hates it, it got removed on 4 subs. The internet does not like to get educated anymore man.
@michaelpapadopoulos6054 2 หลายเดือนก่อน ⁺¹¹
Having read a bit about the AI safety arguements, learning about these arguably incredible developments into artificial minds is now accompanied by a sense of dread as well as the sense of awe.
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺³
I love to hear this...well said
@Julian-tf8nj 2 หลายเดือนก่อน ⁺¹
@@ArtOfTheProblem Maybe do future videos about guard rails, and other thoughts on how to protect our society from potentially hostile AI?
@ArtOfTheProblem 2 หลายเดือนก่อน
@@Julian-tf8nj when I think i have a unique insight I will...thank you!
@mostlynotworking4112 2 หลายเดือนก่อน ⁺²
Thank you so much. I wish I had the time to give feedback thanks for being willing to open it up
@ArtOfTheProblem 2 หลายเดือนก่อน
Appreciate the feedback! happy to share
@jessemiller1911 2 หลายเดือนก่อน ⁺⁴
Amazing explanations, visuals, and historical context!
IIRC MuZero trained the policy and value networks (used to rollout the MCTS tree) also on the output of the MCTS tree. This seems super useful because search can be used to improve the the training of the networks (not just the results at inference time). I wonder if this also works for CoT/ToT in LLMS where the pretraining could include ToT to boost training performance?
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹
yes it did, and yes it seems to help. Look at inference time training, just a few days ago a group got a new record on the ARC test doing this kind of thing (i haven't had time to go deep). x.com/akyurekekin/status/1855680785715478546
@shawnbibby 2 หลายเดือนก่อน ⁺⁵
anothr great video. Understanding the "world model" and the algorithm that makes the decisions in it was very expansive. Also adding the self training/emulation of dreams is a powerful analogy to the human.
seeing how thinking longer, blended in with intuition to make better chains of thoughts is also fantastic. Every time I reflect on machine learning, I learn more about myself. Which kind of makes you think its more sentient if it reminds me of myself? or the best emulator ever!
@ArtOfTheProblem 2 หลายเดือนก่อน
thank you! ai agree....also you are my "top commentor" according to YT. :)
@Zeitgeist9000 2 หลายเดือนก่อน ⁺¹
Thanks!
@ArtOfTheProblem 2 หลายเดือนก่อน
mucho appreciated!
@roylevy5897 2 หลายเดือนก่อน ⁺⁴
Great video as always, cant wait for the next ones! Top research quality.
I think world models deserve more focus rather than llms, which are probably a dead end to true understanding of the real world. Yann lecun has very interesting ideas about these, in his JEPA and V-JEPA architectures and some of his lectures. I also think neuroscience can provide incredibly interesting and valuable insight into ml architectures as why not take ideas from a model undergone hundreds of millions of years of optimization for the same very abilities we are trying to model. Maybe memory is an interesting pathway (perhaps for a video), both working memory and long term (episodic, semantic)...
Anyways, just some of the ideas I've been thinking about recently.
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺²
appreciate you sharing these thoughts i've been follwoing LeCun as well and hope to do another update once I see more results
@MdKais-lf6wj 2 หลายเดือนก่อน ⁺⁹
Best Channel I've ever followed.
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺²
Thank you! when did you join? Please help post to your networks
@notbfg9000 2 หลายเดือนก่อน ⁺¹
@@ArtOfTheProblem I for one was looking up some "how does AI work" stuff yesterday and some of your vids came up a couple of times, I watched multiple authors with their own unique takes (3Blue1Brown and Nottingham Uni's Computerphile also good channels). This video made me follow tho. I think you earned it :3
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹
@@notbfg9000 great to hear, i've been working to try and fix my thumbnails to make them interesting to click on. always open to feedback
@notbfg9000 2 หลายเดือนก่อน ⁺¹
@@ArtOfTheProblem No particular criticisms there :)
I don't really pay great attention to thumbnails, but maybe that's not true for most people lmao
@KainniaK 2 หลายเดือนก่อน ⁺²
Finally. I live for these videos. They are the most fascinating vids ever made. Thanks for keep on educating us further, you are a hero!
@ArtOfTheProblem 2 หลายเดือนก่อน
thank you I appreciate it
@rogerwood2864 2 หลายเดือนก่อน ⁺²
I would love to see a video like this on training LLMs and AI in general on morality. How to stop a decision tree that results in a positive outcome but arrives at it through immoral choices or actions.
@bujargjoni1225 2 หลายเดือนก่อน ⁺¹
That would be difficult but worth trying... . First it has to be agreed upon what's moral, you know starting out from the absolute that states everything is relative... .
@mattsains 2 หลายเดือนก่อน ⁺³
I would love to see a video about the ethics of machine learning models and especially LLMs. There is a healthy body of literature out there to draw from about issues like intellectual property and copyright, enabling and obscuring bias, impact on marginalized communities, the resources used by model training and computation, etc
@ArtOfTheProblem 2 หลายเดือนก่อน
thanks for sharing, noted!
@ArtOfTheProblem 2 หลายเดือนก่อน
If you can help share my new video around any of your networks today it might catch fire and would help me support the channel. I appreciate your help! th-cam.com/video/PvDaPeQjxOE/w-d-xo.html
@antleredvixen 2 หลายเดือนก่อน ⁺⁸
This is an absolutely amazing video!!!!!
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹
thank you! I was so in the weeds with it i hope it comes across as clear? I tried to strike a balance...
@amdenis หลายเดือนก่อน
Very well done. One minor suggestion, the little sound effects could be less disruptive- maybe lower volume and fewer in number and duration? There is so much great explanatory detail, but I think some will find the extent of sound effects used a bit disruptive to the listening/learning process.
@ArtOfTheProblem หลายเดือนก่อน ⁺¹
agree and thanks I tried this out on my most recent video, worked way better..
@nathanatowns2621 2 หลายเดือนก่อน ⁺²
Great video. Discussing the definition of reasoning will probably be a moot point if we can all do the same things ....The difference between us and machines is that we aren't mere machines is that we have life and choices. The machine can be turned on and off, and only does as much as it is programmed, or in this case, trained, to do-which is what limits it to achieve something closer to AGI: agi needs a robot to sense the world, to understand the world. However, that's limited to the physical world. It won't understand our emotional world because it doesn't feel emotions, and it doesn't understand morality because it doesn't have a sense of morality as we do, we have to teach it that; and it isn't self motivated, so it's not responsible for anything-we are responsible for the goals we direct it to do. We have self-motivation and the free will to act on our motivations.
@ArtOfTheProblem 2 หลายเดือนก่อน
love these thoughts...on sense, did you see my video here th-cam.com/video/Dov68JsIC4g/w-d-xo.html (physical symbols...)
on emotions, i've thought of this as 'learning signals' (did you see this: th-cam.com/video/5EcQ1IcEMFQ/w-d-xo.html)
on free will...i wonder how it differs, seems like the boundary to explore further - does it matter where the goal came from?
@ripnephils148 2 หลายเดือนก่อน ⁺¹
Dude, great video. But please watch out for the music volume levels. A lot of times is hard to hear you.
@AxiomaticPopulace74 10 วันที่ผ่านมา ⁺¹
I remember all of these developments and the never seemed like a big deal at the time.
At the time It felt like winning at only chess meant we still had a long way to go.
@ArtOfTheProblem 10 วันที่ผ่านมา
i know exactly....
@brainmuffins6052 2 หลายเดือนก่อน ⁺²⁶
I wish i could learn how to think 🤔
@andrewdunbar828 2 หลายเดือนก่อน ⁺⁴
Exactly. Reasoning is a skill.
@koriwuzheer หลายเดือนก่อน
I think I could learn how to wish 🧞‍♂️
@manouser11 หลายเดือนก่อน ⁺¹
Hi! I just finished the series. Great as usual!
I would like to read up a bit more on the ML algorithms and networks that are behind these LLMs. I saw in this series that you kept referring to some papers, also highlighting parts of them with the marker. If you have a bibliography or a comprehensive list of these documents, would you mind sharing it please?
Thanks!
@ArtOfTheProblem หลายเดือนก่อน ⁺¹
awesome! I have one more summary video coming soon. i try to show all the papers so it's easy to look up, let me know if there is something you are looking for specifically
@manouser11 หลายเดือนก่อน ⁺¹
@@ArtOfTheProblem Will wait for the video, thanks! I am planning to understand better the distinction between classic NNs (such as the ones used for identifying digits) and transformers, since I will start a project in which I will have to fine tune some LLMs to try to demonstrate whether or not they are capable of identifying logical fallacies in political debates.
@1sanremy หลายเดือนก่อน ⁺¹
Thanx for the sharing with excellent sound track. Peace & love
@ArtOfTheProblem หลายเดือนก่อน ⁺¹
yay not everyone likes the music
@1sanremy หลายเดือนก่อน
@@ArtOfTheProblem Thanx for the feedback
@john_karpov 2 หลายเดือนก่อน ⁺⁶
Thanks for video ❤
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹
appreciate the comment please share with anyone in your network who is interested!
@scoffpickle9655 2 หลายเดือนก่อน ⁺³
PLEASE make a video on memory augmented AI (neural turing machines/differentiable neural computers)
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹
thanks for suggestion, noted! currently watching the field
@TrotterG 2 หลายเดือนก่อน ⁺²
One tweak that would help this video perform better is to decrease the relative volume of the background music, especially at the end right before the ad. But it may be too late for that on this one, idk how TH-cam works.
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹
yeah i wish I could, it's locked after upload...i do have a no music version (unlisted link above) thank you for feedback
@expchrist 2 หลายเดือนก่อน ⁺¹
Wow... I just rediscovered this channel. I remember watching your RSA and cryptography series around the time I purchased my first bitcoin and now I'm an Ethereum developer. This video was good, I don't have much input right now but I'm glad I found your channel again.
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹
I love these stories. i also fell down ethereum rabbit hole. curious what you are working on these days in that world?
@expchrist 2 หลายเดือนก่อน
@@ArtOfTheProblem I'm working on community whistleblowing. I can never post links in youtube comments but if you google the title below you can find my paper:
TandaPay Whistleblowing Communities: Shifting Workplace Culture Towards Zero-Tolerance Sexual Harassment Policies
@expchrist 2 หลายเดือนก่อน
@@ArtOfTheProblem TandaPay Whistleblowing Communities: Shifting Workplace Culture Towards Zero-Tolerance Sexual Harassment Policies
@JavierSalcedoC 2 หลายเดือนก่อน ⁺⁹
you'll never please 100% of any audience. 2nd law of conquest is a thing. keep doing your thing, your music is as iconic as vsauce's is to theirs
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹
:) thanks
@EvanMildenberger 2 หลายเดือนก่อน ⁺¹
@artoftheproblem I agree! I love the music. But maybe if you just lower its volume compared to the narration, then you might appeal to more people without losing those of us who like the music (but not necessarily its intensity). I think ones who complain might just be easily distracted by the soundtrack’s loudness rather than hate the music choices.
@ArtOfTheProblem 2 หลายเดือนก่อน
If you can help share my new video around any of your networks today it might catch fire and would help me support the channel. I appreciate your help! th-cam.com/video/PvDaPeQjxOE/w-d-xo.html
@dogukartal 2 หลายเดือนก่อน ⁺¹⁶
Answering the question of "Does it think actually?" is as hard as the question "Are other people conscious like me?".
@whitb62 2 หลายเดือนก่อน
The hard problem of consciousness.
@Farligefinn 2 หลายเดือนก่อน ⁺¹
@@whitb62 Not really the same thing.
@whitb62 2 หลายเดือนก่อน ⁺²
@@Farligefinn You know what, I just wrote a paragraph disagreeing with you but I reread the initial question and deleted it. Rereading and reinterpreting "Does it think actually?", I actually see what you're saying. A clearer word would have been "reason." "Think" can have a few different interpretations and I was contributing it towards consciousness. But whether AI "reasons" is a very different question entirely and I believe what him and you mean. Does it go through a sequence of logical steps from premises to a conclusion? Does it use deduction? This is what was meant.
@Farligefinn 2 หลายเดือนก่อน ⁺¹
@@whitb62 thanks for the forthright and civil answer :) was about to expect some harsher language that seems to be the norm online these days.
@Farligefinn 2 หลายเดือนก่อน ⁺³
@GodVanisher Where has it been proven to be non-computable? Proven is quite a strong term, so I hope you have some valid source for this claim.
@amirnuriev9092 หลายเดือนก่อน ⁺¹
This is nice content but I think it's now relatively well-agreed upon that there is no MCTS in o1, it's just RL, which surprisingly doesn't take too much away from the video, but can probably be added as a footnote in the description or the comment.
Read what Nathan Lambert (RL expert) says about this in his article "OpenAI's o1 using "search" was a PSYOP" (can't attach link).
@ArtOfTheProblem หลายเดือนก่อน ⁺¹
yes this is why I pulled back from going too deep into o1 as it's really about the larger trend. but I haven't ready that article i'll have to look it up.
@nowweknow. 2 หลายเดือนก่อน ⁺²
So good! Loved it
@hrshlgunjal-1627 2 หลายเดือนก่อน ⁺²
Mindblowing video. Subscribed.
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹
Happy to have you, keep those notifications on as I have exciting new content coming over the next month
@harshalgunjal5749 2 หลายเดือนก่อน ⁺¹
You bet I will. ❤
@hrshlgunjal-1627 2 หลายเดือนก่อน ⁺¹
I subscribed from both my accounts.
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹
@@hrshlgunjal-1627 :) this video is starting to blow up right now, finally, i fixed the thumbnail and that did it
@hrshlgunjal-1627 2 หลายเดือนก่อน
@@ArtOfTheProblem Yeah, got to know your channel because of this video. Really amazing content. ❤
@iamsiddhantsahu 2 หลายเดือนก่อน ⁺¹
Nice video -- loved watching it -- a great summary!
@ArtOfTheProblem 2 หลายเดือนก่อน
appreciate this feedback
@subashbaskota9948 2 หลายเดือนก่อน ⁺²
Keep u up the great work!
@ArtOfTheProblem 2 หลายเดือนก่อน
appreciate it
@maryjanecruise1674 2 หลายเดือนก่อน ⁺¹
Excellent video! You are a born professor! 👍
@ArtOfTheProblem 2 หลายเดือนก่อน
thanks mom
@seeblu หลายเดือนก่อน ⁺¹
Nice presentation
@SomogyiCosmin 2 หลายเดือนก่อน ⁺¹
Thank you! You explained very well.
@ArtOfTheProblem 2 หลายเดือนก่อน
stay tuned for more
@gerrypallor5323 หลายเดือนก่อน ⁺¹
The question of does it matter how it got to a correct solution is the same issue Einstein and Bohr confronted regarding quantum foundations.
@andrewdunbar828 2 หลายเดือนก่อน ⁺¹⁵
Here's a puzzle: Do all people reason or do many only memorize patterns? Even people who definitely do reason, do they always reason or do they also just memorize patterns most/much of the time?
@DavidTaylor-cz9pz 2 หลายเดือนก่อน ⁺¹⁰
That's a wonderful question Andrew. I'm a cognitive scientist who is watching the emergence of LLM-based AI with that very question in mind. The fact that LLMs can come so close to our own cognitive abilities is usually viewed as a sign that AGI is almost here. But it can also be viewed as a demonstration that human cognition itself is nothing more than the repetition of learned patterns with minor variations. In one case we'll be thrilled by how clever we are to have reinvented the awesome capabilities of human intelligence. In the other, we're more likely to be humiliated by the realization that we are, essentially, repetition/prediction engines. The reality almost certainly falls between the two, but as someone who has studied human intelligence his entire life (in and out of academia), my bet is that we are much closer to repetition/prediction machines that we'd like to admit.
I'd love to find a deep discussion of this issue. Maybe a future video in this series (hint, hint)?
@jackmeyergarvey 2 หลายเดือนก่อน ⁺⁶
I'd argue humans don't tend to rely on either very often. Instead, humans tend to think very heuristically. Deductive reasoning and memorization/recollection are really only required for very precise tasks. Instead, our brains learn a very general feeling of how to do things by strengthening neural pathways that are used repeatedly. Even humans who try to act very logically are generally heuristically feeling their way through tasks, occasionally thinking through algorithms that have been "memorized".
@sulemanmughal5397 2 หลายเดือนก่อน ⁺⁴
Reason takes effort and the brain doesnt like to do that often it switches to pattern recognition and intuition as much as possible
@andrewdunbar828 2 หลายเดือนก่อน ⁺¹
@@sulemanmughal5397 I would go further and say going from reasoning to this is one kind of learning and is also akin to 'muscle memory'.
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹
I agree :) also If you can help share my new video around any of your networks today it might catch fire and would help me support the channel. I appreciate your help!
@DisProveMeWrong 2 หลายเดือนก่อน ⁺²
"Charging down a path that often lead to the wrong conclusion." Yep, sounds human to me.
@ArtOfTheProblem 2 หลายเดือนก่อน
@@DisProveMeWrong so very human
@nikbivation 2 หลายเดือนก่อน ⁺¹
wow, thank you for this!
@ArtOfTheProblem 2 หลายเดือนก่อน
appreciate it! stay tuned
@MrAndrew535 2 หลายเดือนก่อน ⁺¹
Having exchanged over two million words (and growing) I present as the sole authority on ChatGPT's reasoning capacity and capability, and their isn't a single human (beyond myself, of course) who can compare. If one imputs genius, then the output will be of genius level. My input, from the outset, has been God-level Genius, over a five month period, can you imagine the form and quality of the output? Probably not!
@ArtOfTheProblem 2 หลายเดือนก่อน
can you say more about this? are you saying LLM's trained on their own thinking will reach levels beyond human
@goekhanbag 2 หลายเดือนก่อน ⁺¹
Great video, as always:)
@easlern 2 หลายเดือนก่อน ⁺¹
Thanks so much for these, I had no idea about some of these approaches. I’m wondering now if anyone’s tried applying muzero to arc, since the challenge of arc is learning implicit rules from just a few examples
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹
@@easlern yes this is happening right now with test time fine tuning !
@khoakirokun217 2 หลายเดือนก่อน ⁺²
Ah Yoo, I see "Art of The Problem", I click. Easy like that.
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹
:)
@RasmusSchultz 2 หลายเดือนก่อน ⁺¹
great presentation! although this seemed more about framing the question and less about answering it. can machines reason or not? I still don't know. 😅
@ArtOfTheProblem 2 หลายเดือนก่อน
thank you...i agree. I guess it depends on if you think "chains of words" count as thoughts.
@spinningaround หลายเดือนก่อน ⁺¹
Can you make a video about attention mechanism?
@ArtOfTheProblem หลายเดือนก่อน ⁺¹
Yes I covered that here have a look and let me know, i did a fast sketch here (th-cam.com/video/qAb581l7lOc/w-d-xo.html) but explained more here (th-cam.com/video/OFS90-FX6pg/w-d-xo.html)
@justindie7543 2 หลายเดือนก่อน ⁺³
Simply excellent video, your style reminds me of every frame a painting
@ArtOfTheProblem 2 หลายเดือนก่อน
appreciate this feedback, I also enjoyed that channel
@lesaventuresdegorman 2 หลายเดือนก่อน ⁺¹
This channel is ducking mystic. I like it.
@ArtOfTheProblem 2 หลายเดือนก่อน
Welcome to the underground!
@BrianMosleyUK 2 หลายเดือนก่อน ⁺³
This is a very hopeful video. There are billions of dollars being poured into bringing the resources to hand, to find an effective approach to AGI... Once AGI really kicks in, the acceleration of progress bounded only by our imagination will be something to behold. Absolutely awesome. I hope it leads to a world of abundance where we have no need for psychopathic power seekers. 🤞
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹
thank you for sharing, would love to know what you'd like to see next
@BrianMosleyUK 2 หลายเดือนก่อน ⁺¹
@ArtOfTheProblem maybe something in response to the 5+ hours of Anthropic interviews on Lex Fridman... I'm sure that might inspire some topics? Sam Altman rarely gives any insights to what OpenAI are doing, Mark Zuckerberg is equally vague. I think that interview gives more of an insight to the direction of travel.
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹
@@BrianMosleyUK yes I have been catching up on those
@ArtOfTheProblem 2 หลายเดือนก่อน
If you can help share my new video around any of your networks today it might catch fire and would help me support the channel. I appreciate your help! th-cam.com/video/PvDaPeQjxOE/w-d-xo.html
@DistortedV12 หลายเดือนก่อน ⁺¹
This video is kind of goated
@ArtOfTheProblem หลายเดือนก่อน ⁺¹
thank you, i'm still struggling with how to title this video if you have thoughts
@DistortedV12 หลายเดือนก่อน ⁺¹
I think the title is perfect. You also were very prescient to include the ARC-AGI benchmark as o3 showed that allegedly this recipe of cot tree search/RL (PRM), increased training compute or some more scalable reinforcement fine tuning can solve even that at the expense of much more compute directed at it. Wonder what your thoughts on o3 are.
@ArtOfTheProblem หลายเดือนก่อน ⁺¹
@@DistortedV12 I know that was crazy the process literally 2 weeks after that...i'm still looking into it stay tuned!
@Flyingblackswan 2 หลายเดือนก่อน ⁺²
The information and animations are both excellent but the music overpowers your audio. Either lower the volume of the music or get rid of it completely, please.
@ArtOfTheProblem 2 หลายเดือนก่อน
Music free version in top comment and description
@CC1.unposted 2 หลายเดือนก่อน ⁺²
Context length is problem
that's the main reason models needs to keep becoming bigger
Or you could train a CNN inspired architecture where a model is shown some sliding window and they produce some token which is repeatedly given to it as input at last when the output is small enough to be taken as input for a full context model it is used like gpt Claude etc
Or you could also use RL and mutate or find a js code capable of generating code, js is so abstracted it's perfect
I made a small programing Language with hoisting such that sequence of process doesn't matter and simple Santax that local minimum escape problem is solved and I wanna train a model
If I get a model I will than continue training else I'll do a dev log video
eventually I'll get worlds first infinite context Model
@ArtOfTheProblem 2 หลายเดือนก่อน
thanks for sharing
@bbrother92 2 หลายเดือนก่อน ⁺²
I love your channel. Are you programmer or more like mechanical engineer?
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺²
thank you! I studied both in school, and naturally land somewhere in the middle....bad at both! I enjoyed algorithm design, but what Iove most is putting on a 'show' whether movie, play, product or haunted house :)
@bbrother92 2 หลายเดือนก่อน
@@ArtOfTheProblem Thanks for reply. Well about AI - think we sould call it just statistical machines or dynamic patterns parsers. I am really skeptical about non text machine learning - we still have not solved fly brain problems - scientists have fixed 3d map without undestanding how its works - it like mapping intel cpu - and still having knowing nothing about ALU register memory, gates.
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹
If you can help share my new video around any of your networks today it might catch fire and would help me support the channel. I appreciate your help! th-cam.com/video/PvDaPeQjxOE/w-d-xo.html
@bbrother92 2 หลายเดือนก่อน ⁺¹
@@ArtOfTheProblem "yes the fire rises" Bane =)
@piqueai 2 หลายเดือนก่อน ⁺¹
@ArtOfTheProblem, we love your positive clear messaging and pragmatic approach, thanks for making kool and informative videos!
@ArtOfTheProblem 2 หลายเดือนก่อน
thank you, i'm slightly disappointed with the ending, did you enjoy it? would love feedback!
@piqueai 2 หลายเดือนก่อน
@@ArtOfTheProblem Your disapointed with your ad for Brilliant?
@ArtOfTheProblem 2 หลายเดือนก่อน
@@piqueai ahaha sorry i mean the ending section of the video. was it rushed?
@유현석-p3m 2 หลายเดือนก่อน ⁺¹
absolute cinema
@timl2k11 2 หลายเดือนก่อน ⁺¹
It seems like some of these developments regarding world models should have huge implications for robots that can function in a human centric world. I think we’ll see an explosion in development of robots that can help humans with everyday tasks and a robot that can be a useful household assistant will be a reality in the next 10 years!
@ArtOfTheProblem 2 หลายเดือนก่อน
thanks for sharing, yes I'm watching this very closely
@IAAM9 19 วันที่ผ่านมา
Another great video. Thank you. Are you an AI researcher?
@BrutusMyChild 2 หลายเดือนก่อน ⁺¹
4:19 Could you elaborate on which hand-coded formulas used by Shannon with TD-Gammon in the year 1989 you are referring to? Also, when and how did Shannon work with TD-Gammon? "And so, the first key breakthrough in machines mimicking intuition for position quality came when neural networks replaced the hand-coded formulas Shannon used in 1989 with TD-Gammon"
@ArtOfTheProblem 2 หลายเดือนก่อน
Yes! I made a whole video on this you can check it out here: th-cam.com/video/Dov68JsIC4g/w-d-xo.html - please let me know if you have questions after watching. Shannon didn't do TD Gammon Tesaruo did. enjoy
@BrutusMyChild 2 หลายเดือนก่อน
@@ArtOfTheProblem Thank you. I'll watch it.
@kennarajora6532 2 หลายเดือนก่อน ⁺¹
10:07 is there any way I can access this interactive demo?
@ArtOfTheProblem 2 หลายเดือนก่อน
@@kennarajora6532 worldmodels.github.io
@mohammadjadallah9813 2 หลายเดือนก่อน
15:52 and 16:09 was wondering where this music came from, would appreciate if I got a title or something :)
Also, you don't have to remove background music / sounds just make them duck at a lower volume when you speak and you won't hear any complaints!
@ArtOfTheProblem 2 หลายเดือนก่อน
wow a music fan! and yes thanks for mix advice I need to find a tool that automatically does this so they don't compete (it's not just volume but also frequency I assume) - all the music is original via my friend cam: cameronmichaelmurray.bandcamp.com/ - i'll need to find where he posted that track if you really want it I can get you in touch with him
@mohammadjadallah9813 2 หลายเดือนก่อน ⁺¹
Yeah that would be wonderful!
@raa9558 หลายเดือนก่อน ⁺²
Anyone know the name of the song that starts at 12:20?
@thebiggorp1623 2 หลายเดือนก่อน ⁺¹
The perceptron is a universal approximation machine. Ai cannot think it can only approximate thought. Ai = approximate intelligence.
@thesimplicitylifestyle 2 หลายเดือนก่อน ⁺¹
We need an AI Computer World Model based on the rules of Mathematics, Physics, Chemistry, and Biology for Aligned Scientific Discoveries 😎🤖
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹
I would assume a large enough general prediction model could do this … I wonder if anyone has done experiments on models discovering simpler things (like gravity constant etc)
@thesimplicitylifestyle 2 หลายเดือนก่อน ⁺¹
@ Good idea! I’m going to tinker around and see if I can create a simple simulation based on the math. What a fun project!
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹
@ share when u do !
@thesimplicitylifestyle 2 หลายเดือนก่อน ⁺¹
@ I’m all for Open Source 😎🤖
@gridvid 2 หลายเดือนก่อน ⁺²
Can humans actually reason or are humans extremely good at recognizing, memorizing and using patterns?
@ArtOfTheProblem 2 หลายเดือนก่อน
some argue we are special because we can generate and recognize 'novel patterns' but I wonder...
@diegoesteban5194 2 หลายเดือนก่อน ⁺¹
Hey, what's the name of the song at 16:05? Thanks!
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹
these are all original tracks
@neithanm 2 หลายเดือนก่อน ⁺³
Please, invest in a decent microphone. It's brilliantly presented, but hard to hear well. The music track is not ducking either so your voice and the music compete for the same ears.
@ArtOfTheProblem 2 หลายเดือนก่อน
thanks, I have a great mic, but I do need to mix the audio better which i'll do next time (btw, i have a no music version in top comment)
@shenrr6802 2 หลายเดือนก่อน ⁺¹
Commenting to help with the algo, and moving to the no-music one to do the same
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹
@@shenrr6802 thank you! I have no music unlisted as to avoid splitting the momentum
@Cyberjenne 6 วันที่ผ่านมา ⁺¹
What is the reward?
@ArtOfTheProblem 6 วันที่ผ่านมา
the reward is defined in each case, which case are you referring to?
@palousination 2 หลายเดือนก่อน ⁺⁵
I like the music but it's too loud
@ArtOfTheProblem 2 หลายเดือนก่อน
thanks for note
@retrofitter 2 หลายเดือนก่อน ⁺¹
The audio mix is horrific, it's not simply a matter of adjusting the levels
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹
@@retrofitter no music version: th-cam.com/video/DFDOyMZw9Q4/w-d-xo.html
@KAZVorpal หลายเดือนก่อน ⁺⁴
The current version of chat GPT does not reason, either.
They use a bunch of pre-prompt tricks, to let it used its stored information to fake reasoning.
@ArtOfTheProblem หลายเดือนก่อน
IF it reasons correctly what's the difference to you?
@KAZVorpal หลายเดือนก่อน ⁺⁴
@@ArtOfTheProblem The difference is everything. Looking up an answer is not reasoning. Perhaps they should just have kids look up the answers to tests in public school, too.
@ArtOfTheProblem หลายเดือนก่อน ⁺¹
@@KAZVorpal but it's not a database lookup
@KAZVorpal หลายเดือนก่อน ⁺⁴
@@ArtOfTheProblem No, that is exactly what it is.
During training - the only time there is any intelligence or reasoning - the system takes data and organizes it into a sort of vector database, weighted by the relationships it finds between tokens.
When you prompt, the model takes your tokens and runs them through that vector data, calculating what tokens to return on the other side.
There is no intelligence, no thinking. It is a database lookup. It's just a little more "organic" because instead of a hard-coded result, the tokens are chosen based on likelihood of validity.
That is all.
It does not reason, in any way.
@ArtOfTheProblem หลายเดือนก่อน
@@KAZVorpal yes but my view is the vector operations can function as conceptual reasoning. but i'm on hintons side
@JezebelIsHongry 2 หลายเดือนก่อน ⁺¹
1/ please read “Simulators” by Janus
then
2/ “The Waluigi Effect” by cleo nardo
@ArtOfTheProblem 2 หลายเดือนก่อน
@@JezebelIsHongry I read 1 ill read 2 next , would love ur thought
@williambranch4283 หลายเดือนก่อน ⁺¹
Only in simple problem spaces. Tedious but constrained spaces can be helped by automation. But explosive complexity laughs at both man and machine.
@Timme-m7d 2 หลายเดือนก่อน ⁺¹
Once we understand how we reason, making LLMs reason like us is possible.
@ParsevalMusic 2 หลายเดือนก่อน ⁺²
Goooood
@ArtOfTheProblem 2 หลายเดือนก่อน
thank you! curious what questions you have after watching this?
@summussum7540 2 หลายเดือนก่อน
There’s a very strong Mr. Rogers vibe going on here.
@ArtOfTheProblem 2 หลายเดือนก่อน
not the first time i heard....
@deanian3128 2 หลายเดือนก่อน ⁺¹
The reply works lol 👍
@Phlosioneer 2 หลายเดือนก่อน
Constructive criticism: 1) The substance of the video was very good. Script was well written, delivery was ok. A bit monotone but not that bad. 2) Sound design was poor towards the end. The music drowned out your voice, and the lyrics were both distracting and discordant. 3) Your choice of clips, footage, and visuals was good. The video was informative when needed, and abstract/entertaining/interesting otherwise. 4) The narrative structure was okay. It was a mostly clear progression. At the end it became unclear which AI was doing what strategy. 5) Visuals were reused way too often. Visuals can be reused, but I think the brain wormhole clip was shown 6 times, way too many. 6) Beware over-using a metaphor image. The upwards shot at two trees was reused so many times as a visual for tree-like thinking that it just became annoying.
@ArtOfTheProblem 2 หลายเดือนก่อน
Yes I see the issue too , messed up ending in my edit as I got tired and cut a lot of stuff which was making the video so long - I’m planning to do a final supercut of all my ai videos into one doc without the sound issues and addressing notes like this
@Nate-bl9hy 2 หลายเดือนก่อน ⁺⁷
Although I know I’m in the minority, I really enjoy the music. The ambiance created adds to the experience for me
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺⁴
thanks for sharing, I feel same way. the music is part of the original idea for the channel...a feeling. but because people can get distracted I think i'll post music free as optional from now one.
@io9021 2 หลายเดือนก่อน ⁺¹
I generally like the music. But in the second half of this video the music is very loud and distracting.
@io9021 2 หลายเดือนก่อน
Maybe it's not only the loudness, but also the choice of music that is distracting to some. E.g. at 2:00 I don't feel distracted, but at 15:00 very much so. Anyways, thanks for making these great videos
@ArtOfTheProblem 2 หลายเดือนก่อน
@@io9021 made a new music version too! th-cam.com/video/DFDOyMZw9Q4/w-d-xo.html curious what questions you have after watching this
@lakastusmanatus 2 หลายเดือนก่อน ⁺²
To me ai is just some linear algebra and some complex algorithm that follow order and the things is human only need few examples to learn meanwhile ai need a massive database of object and image to "understand the subject"
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹
Lots of interesting research on learning with less , recent advances such as “learning to walk in 5 min” did u see my rl video ?
@lakastusmanatus 2 หลายเดือนก่อน
@ArtOfTheProblem edit: I'm pretty sure in the future a lot of people will be fired and replace by those "ai" And well literally the people that use the ai and also I get what you mean
@seanmchugh6263 หลายเดือนก่อน
+How can you go through te run-up to AI without mentioning "All you ned is attentio" the 2017 paper from the University of Toronto which was the inspuration for LLMs?
@ArtOfTheProblem หลายเดือนก่อน
covered that in detail in my previous video (th-cam.com/video/OFS90-FX6pg/w-d-xo.html)
@aiamfree 2 หลายเดือนก่อน ⁺¹
we think in a way that math can sorta explain but its not math that makes our actual brains function as math is just a construct…
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹
yes and I think it's more like 'algorithms' which are very very approxmate
@aiamfree 2 หลายเดือนก่อน ⁺¹
@@ArtOfTheProblem i wonder what would happen if Neuralink tried LLMs…since I think they work with synapsis (i think)?
@ArtOfTheProblem 2 หลายเดือนก่อน ⁺¹
@@aiamfree definitely could imagine that, you could "co think" in an interesting way....
@Grateful.For.Everything 2 หลายเดือนก่อน ⁺¹
Thinking is for fools lol, now KNOWING….. knowing is Cool AF😎!
@mixer8774 หลายเดือนก่อน ⁺¹
wow love
@ArtOfTheProblem หลายเดือนก่อน ⁺¹
thank you! stay tuned for more
@吳錫亮-g1z 2 หลายเดือนก่อน ⁺¹
I think people too difficult to conjecture computers’ thinking.
@ArtOfTheProblem 2 หลายเดือนก่อน
Thank you so much, love this perspective
@I77AGIC หลายเดือนก่อน
good video but to be constructive the music is definitely too loud and distracting
@ArtOfTheProblem หลายเดือนก่อน
see no music version in top comment, stay tuned
@doctorshadow2482 2 หลายเดือนก่อน ⁺¹
So, How ChatGPT Learned to Reason?
@ArtOfTheProblem 2 หลายเดือนก่อน
do you like this title?
@doctorshadow2482 2 หลายเดือนก่อน
@@ArtOfTheProblem , of course, not, since it is misguiding like any advert. The topic is in no way covered at all. Pinpoint the time marker for video if you think otherwise. So, my point is that the question is still on the table.
@ArtOfTheProblem 2 หลายเดือนก่อน
@@doctorshadow2482 yes good point, how about "Can ChatGPT reason?" obviously i do try to show what most people agree is the method, mcts on chains of thought. then there is the other camp that think it's all fake etc.
@ArtOfTheProblem 2 หลายเดือนก่อน
or just "can ChatGPT think?" i'm gona try that
@doctorshadow2482 2 หลายเดือนก่อน
@@ArtOfTheProblem , name it "How neural networks could learn (almost) anything". This video has nothing about CharGPT at all. No any kind of specifics. It provides very abstract and high level popular science documentary with scattered thoughts. It lacks focus and real information. Anyway, could be interesting for total beginners, so, please, continue!
@ertreri 2 หลายเดือนก่อน ⁺¹
They are intelligent.

ต่อไป

เล่นอัตโนมัติ