Why The Next AI Breakthroughs Will Be In Reasoning, Not Scaling

Y Combinator

มุมมอง 25 597

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 14 พ.ย. 2024

ความคิดเห็น • 80

@chapterme 20 ชั่วโมงที่ผ่านมา ⁺¹²
Chapters (Powered by ChapterMe) -
0:00 Intro
1:15 The intelligence age
4:18 YC o1 hackathon
12:09 4 orders of magnitude
14:42 The architecture of o1
21:52 Getting that final 10-15% of accuracy
32:06 The companies/ideas that should pivot because of o1
34:44 Outro
@scienceinc.9453 19 ชั่วโมงที่ผ่านมา
@chapterme great marketing
@winddude9 20 ชั่วโมงที่ผ่านมา ⁺⁷³
are they not allowed to talk about claude or something?
@brucebain7340 16 ชั่วโมงที่ผ่านมา ⁺⁶
They mentioned claude several times in the last episode genius
@vvmm3712 15 ชั่วโมงที่ผ่านมา ⁺⁸
@@brucebain7340Yeah they did but I think they are quite biased towards OpenAI. I have used premium versions of Gemini, OpenAI and Claude. Claude is still neck to neck, if not ahead of OpenAI. Also YC needs to be more objective in their POV. I think it happens because most of them talk to founders and read articles etc. rather than using the models extensively themselves to build stuff. Makes a lot of difference in how one perceives technology.
@brucebain7340 15 ชั่วโมงที่ผ่านมา
Of course they are biased. They're human.
Whoever expects them to be impartial will be disappointed.
@vvmm3712 15 ชั่วโมงที่ผ่านมา
@@brucebain7340The bias is not about what to order for dinner. The bias is that company 'A' is going to unlock AGI, Company 'A's models are going to be orders of magnitude better because Mr. Sam told us so. This bias can lead to misjudgements on YC's part which can impact its own investments which run into millions. I mean won't it be sad that someone can simply walk into your office and skew your own vision of something.
@vvmm3712 14 ชั่วโมงที่ผ่านมา ⁺³
@@brucebain7340 The bias is not about what to order for dinner. The bias is also not from a colleague of mine who thinks that Apple built the Apple Intelligence(AI)!
The bias is from a group of people running one of the topmost accelerators in the world.
The bias is that company 'A' is going to unlock AGI, Company 'A's models are going to be orders of magnitude better because Mr. S told us so. This bias can lead to misjudgements on YC's part which can impact its own investment which runs into millions.
@counterfeit25 16 ชั่วโมงที่ผ่านมา ⁺⁵
4:30 Diode Computers is doing PCB (printed circuit board) design, not chip design (like NVIDIA)
@GNARGNARHEAD 15 ชั่วโมงที่ผ่านมา ⁺⁴
those hackathon results were really impressive 🤯
@princee9385 15 ชั่วโมงที่ผ่านมา ⁺⁷
Given all these developments, I need a remote job. PhD since 2022.
@ZevUhuru 12 ชั่วโมงที่ผ่านมา ⁺⁵
LOL. What is Gary's obsession with 'Raw Dogging', he's said this on multiple videos. Bro, stay safe out there 😂
@soyhenryxyz 16 ชั่วโมงที่ผ่านมา ⁺¹⁵
"If an LLM task is hallucinating, it’s likely doing too much. Break it down into steps"
@pin65371 10 ชั่วโมงที่ผ่านมา
The mistake people make when using these tools is they are not specific enough. Treat is like a human or team of humans and your results improve.
@ccash3290 9 ชั่วโมงที่ผ่านมา ⁺¹
@@pin65371 Treat it like a child; Adults are smart and can understand ambiguity and ask for clarity.
@cmw3737 ชั่วโมงที่ผ่านมา
It usually means that there are missing parameters that it has to guess. Instead it should understand ambiguity and be more interactive and say "I don't know" or "it depends" and ask for what it needs to answer the request.
@Manwith6secondmemory 10 ชั่วโมงที่ผ่านมา ⁺⁴
Bro omg, are we hitting a wall or not
@varunvummadi350 ชั่วโมงที่ผ่านมา
Thanks a lot for shout out and adapting us Garry 😂
@jamesulan1 ชั่วโมงที่ผ่านมา
Yeah, AI for customer support is solving something like 20-35% of most customers' support tickets. Tickets are usually the easier ones that come up most frequently.
@gpshangari 19 ชั่วโมงที่ผ่านมา ⁺¹¹
The people in chat against OpenAi have no idea what's going on. It's too late to stop AI, it's not a fad and it is not going away. There is no new fad. The only thing that kept humanity together was intelligence, now things are going to accelerate. But the benefits will only reach a few of course.
@realbillnye 12 ชั่วโมงที่ผ่านมา ⁺¹
It's more so that LLM cannot truly reason. The reasoning from o1 is chain of thought using the same LLM backend. This will likely not scale to AGI. I'm here for the ride, let's see
@drichards4426 8 ชั่วโมงที่ผ่านมา ⁺¹
Who’s saying it’s a fad ?
@alonroth11 20 ชั่วโมงที่ผ่านมา ⁺³
Can you show the info you are looking at....
@Arcticwhir 12 ชั่วโมงที่ผ่านมา ⁺¹
very inspiring and exciting podcast to look forward to the future.
@NilsWestgardh 18 ชั่วโมงที่ผ่านมา ⁺³
The AI bot beating Dota 2 pros at The International put OpenAI on the map for me.
@vivek-singh-se 18 ชั่วโมงที่ผ่านมา ⁺⁷
Please share link to Sam Altman's essay referred early in the video.
@aaithubarla 8 ชั่วโมงที่ผ่านมา
Bruh, Google it.
@jean-phil 5 ชั่วโมงที่ผ่านมา
google The Intelligence Age
@aaithubarla 3 ชั่วโมงที่ผ่านมา
can you not google it? Is it that hard?
@Daniely-z2k ชั่วโมงที่ผ่านมา
I love the girl. she's so authentic. When she speaks, it feels like you're right there with her, having a conversation as if you're sitting together in a cozy coffee shop, where everyone feels like friends.
@yadniksable ชั่วโมงที่ผ่านมา
Advice to everyone building a product. Ai is for scaling your solution your insight in a particular domain. So work on insight building and use ai model to scale that solution to solve the problem at a deeper level for target user.
@Analyse_US 17 ชั่วโมงที่ผ่านมา ⁺⁹
Feels like alot of coolaid being chugged back here. I am personally becoming more skeptical by the day. Microsoft probably has invested the most money and time in this space and their AI driven products are under whelming. I am still wanting and waiting to see a compelling AI LLM driven productivity tool. All I am seeing is demos that include a high degree of deception. For example, the first Gemini duck demo, Devin Upwork demo, Tesla robots with human controllers etc. I am at the stage where I need to actually have access to the product, to believe any claims.
@vvmm3712 15 ชั่วโมงที่ผ่านมา ⁺⁴
Kinda agree..
@vdimension6300 12 ชั่วโมงที่ผ่านมา ⁺³
Same
@jean-phil 4 ชั่วโมงที่ผ่านมา
hmm there are already products using AI .. just look at google already generating 25% of their code by AI, think of all the AI features in Adobe Lightroom and other Adobe products, AI features in phones, self driving cars, AI finding 0 day hack .. I even used co-pilot this week at work to generate some reports .. and this is just the beginning
@winnerswritethestory3370 4 ชั่วโมงที่ผ่านมา
you must be living under a rock then
@Analyse_US 3 ชั่วโมงที่ผ่านมา
@@winnerswritethestory3370Name a successful commercial LLM product that isn't a base model.
@elliptictree 18 ชั่วโมงที่ผ่านมา ⁺⁴
Interesting
@Dom-zy1qy 4 ชั่วโมงที่ผ่านมา
You want AGI? Heres how:
1. Make an LLM good enough to create and implement highly accurate environments for use in RL. This should work with any arbitrary task.
2. Train a good policy
3. Profit?
@netsurfer256 16 ชั่วโมงที่ผ่านมา ⁺¹
lbh current models and APIs already can handle the scaling cases of use
@parvbhullar 18 ชั่วโมงที่ผ่านมา ⁺¹
Garry is obsessed with evals 😊
@sudheerkumarme 19 ชั่วโมงที่ผ่านมา ⁺⁵
I would like to see more Indian startups funded by YC. Please consider opening an YC India to invest in Indian startups.
@CardboardBoxed 16 ชั่วโมงที่ผ่านมา ⁺²
Indian startups and have some of the worst ROI. They’re usually not internationally trusted so many VCs won’t fund them. You’re better off looking for domestic investors.
@michaelocean4788 18 ชั่วโมงที่ผ่านมา ⁺³
I get confused when Gary uses 'eval' to refer to testing, as it means something different in Python. Terms like 'metrics' or 'benchmarks' are more common in the LLM context and feel more precise.
@Dom-zy1qy 5 ชั่วโมงที่ผ่านมา
I see "eval" more commonly used than "metrics".
@qet-lab 19 นาทีที่ผ่านมา
Tic Tac Toe . Its become difficult to build AI applications. Open AI keeps wrapping them up every 6 months.
@jonathanedwardgibson 20 ชั่วโมงที่ผ่านมา ⁺¹
The mind expands, not stacks.
@hongyihuang3560 10 ชั่วโมงที่ผ่านมา
I do not believe that AI will be able to do chip design better than humans. PCBs maybe, chips no. It took the brightest people around the globe (literally any good chips you see today touches at least decades of research from Japan, Europe, US IP). There are so many specialties in chip design including testing, production ready, simulation, RF, documentation, security, compiler that’s beyond the capabilities of a closed loop LLM.
If LLMs PCB design capability transfers to SoCs, China would have already made and beat Apple & NVIDIA.
The first half is absolutely make believe, solve physics, nuclear power, climate… seriously? AI will solve societal issues? I find it hard to believe that next gen data centers for AI will need nuclear power.
The second half I agree more on the real progress of AI: do tests, create a moat by building agents and accumulate proprietary data.
Speculation is dangerous, I hope people can think for themselves.
@JS-mj9en 18 ชั่วโมงที่ผ่านมา ⁺¹
AI should eliminate the need for customer support, so an AI customer support solution seems destined to fail
@tonypeng8792 18 ชั่วโมงที่ผ่านมา
Amazing!
@no-wai 20 ชั่วโมงที่ผ่านมา ⁺³
I hope you guys are not blinded by Openai
@vladimirbosinceanu5778 2 ชั่วโมงที่ผ่านมา
more biomimicry --> intelligence on tap
@Mayeverycreaturefindhappiness 16 ชั่วโมงที่ผ่านมา
didn't Orion disappoint? I don't think we can just assume it will keep scaling. I am excited for the o series.
@DakshGuptaCuriosium 20 ชั่วโมงที่ผ่านมา
shoutout atopile
@thehappydaysapp 19 ชั่วโมงที่ผ่านมา ⁺⁴
All this advancement is amazing but what I do not understand is, how is most of this advancement actually helping humanity? How is it helping the majority of humanity and not the one percent of investors?
@scienceinc.9453 19 ชั่วโมงที่ผ่านมา
There's no law in physics to ensure that
@sprobertson 19 ชั่วโมงที่ผ่านมา ⁺¹
That's not what companies are for
@williamliu796 15 ชั่วโมงที่ผ่านมา ⁺¹
you can now talk to one of these LLMs and learn almost anything / ask questions about anything. Every kid with access to the internet can has a personal tutor for every subject for free or $20/month. I’d say humanity is being helped.
@zipytshorts 14 ชั่วโมงที่ผ่านมา ⁺¹
you know when companies are more efficient that means you have cheaper products/services or any other goods
@AlexWilkinsonYYC 13 ชั่วโมงที่ผ่านมา
Capitalism ... is for capitalists. 😉 It's in the name.
@AlexWilkinsonYYC 13 ชั่วโมงที่ผ่านมา
4 people agreeing on everything is boring. Get a homeless guy in there or something.
I can't wait for them to replace all the customer service agente with AI because ill make an AI call center that calls all their customer service centers and tries to manipulate them.
Adoption isnt going to be super quick because having a surface area for attack thay massive is a huge liability.
@jvijayavallabh5869 20 ชั่วโมงที่ผ่านมา ⁺⁵
First
@Drackomass 20 ชั่วโมงที่ผ่านมา
I was so close!
@ammarkov 19 ชั่วโมงที่ผ่านมา
hahahah
@AntonioLaPlaca 15 ชั่วโมงที่ผ่านมา
lol
@Төлеби 13 ชั่วโมงที่ผ่านมา ⁺¹
Diana Hu beautiful
@rustamkhujarustamov2929 20 ชั่วโมงที่ผ่านมา
third
@Drackomass 20 ชั่วโมงที่ผ่านมา
Second
@seanlive6975 16 ชั่วโมงที่ผ่านมา ⁺¹
Still not seeing these tools achieve anything that humans are not capable of. Some efficiency gains maybe and useful as a learning tool. I think it's a limitation of the stochastic parrot from training data approach, it's never going to be creative and bring new innovations. That will need a new approach entirely.
@AlexWilkinsonYYC 13 ชั่วโมงที่ผ่านมา
No way. I can't put bugs all over my code base nearly as fast as Cursor can.
@Carthodon 3 ชั่วโมงที่ผ่านมา
@@AlexWilkinsonYYC You surprised a lol out of me.
@XX-pl9wp 18 ชั่วโมงที่ผ่านมา
You should listen to the Marvin Minsky conversations from 80s AI Winter. Please stop talking about AGI until then. You're clueless!!!
@man4hire 20 ชั่วโมงที่ผ่านมา
Fifth
@rkara2 17 ชั่วโมงที่ผ่านมา
AGI means you have solved the horizontal scalability problem. Which means having the ability to access any companies database public or private.
So can one of you geniuses explain how exactly that is going to happen??
My guess is that all of you are being duped by Altman because he has to say s**t like that to keep his investors satisfied and feeling good about themselves lol 😂

ต่อไป

เล่นอัตโนมัติ

The 10 Trillion Parameter AI Model With 300 IQ