Large Language Models (LLMs) - Everything You NEED To Know

A Hackers' Guide to Language Models

Stealing LLMs (MIT, Microsoft, Harvard) #ai

Real Vs Mannequin Challenge😱

ใครคือฆาตกรตัวจริง ?! EP.11 (ver. คืนคริสมาสต์ สุดสยอง !!!

ตรวจหวยงวดวันที่ 16 ธันวาคม 2567 พร้อมรางวัล N3 รางวัลพิเศษ รางวัล 2 ตัว : Matichon Online

TEST TIME Optimized AI REASONING (MIT)

Discover AI

มุมมอง 2 874

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 2 ก.พ. 2025

ความคิดเห็น • 14

@itsaskill9358 2 หลายเดือนก่อน
This is the topics I like to see, awesome.
@code4AI 2 หลายเดือนก่อน
More to come!
@windywashoe 2 หลายเดือนก่อน
The title of the of the last two papers combined should be “Test Time Turbo Is All You Need".
@code4AI 2 หลายเดือนก่อน
Smile.
@winternuce 2 หลายเดือนก่อน
It would be to see when LLM can solve sudoku (even the easy one). LLMs seam to struggle with those (tested with newest DeepSeek's DeepThink), even if advice them little in prompts. Of course there might be other AI to solve them better, but if LLM is path to AGI, that could one potential benchmark to logical thinking. Also word puzzle or "Sanaristikko" (in Finnish) are quite easy to people, but difficult to LLMs.
@code4AI 2 หลายเดือนก่อน
You just have to pre-train your model on this particular task, in your case a visual and geometric word pattern, and the performance will go up. Fine-tuning will not deliver results, if not pre-trained on. It is all a question of pre-trained datasets, their complexity and geometry (and more ...).
@112eke 2 หลายเดือนก่อน
Well, I cannot see the reason why this would be required at all. All it has to do is to recognize that this is a sudoku and then generate the algoritm in python to solve (that is in the dataset for sure) and execute it.
@ArmaanSultaan 2 หลายเดือนก่อน
I have one question.
In TTT+TTC(of last video) combination. Isn't it possible to just not reset the parameters in TTT part or some kind of selective resetting or reLora training after solving the problem? To result in persistent continuous learning.
Won't this drive down computational costs exponentially since in novel situations retraining will happen but as problems get more and more repititive it will just do old traditional llm to get to solution without extensive same amount of thinking time?
Also intelligence will grow like virus.
Thoughts?
@code4AI 2 หลายเดือนก่อน ⁺¹
If you do not reset the parameters you encounter memorization.
@ArmaanSultaan 2 หลายเดือนก่อน
@code4AI It's bad??
Don't we want it to memorize learnings? And go through process of learning again only if we need it?
@code4AI 2 หลายเดือนก่อน ⁺¹
You don't understand. We want to LLM to understand the problem and the solution on a generic level, to be applied continuously to all problems. This is what we call "learning the solution". >
IF the LLM just memorizes one solution string, it is not inherent in its learned knowledge and therefore it will fail with the slightest variation of the task given. Like with humans, if I just memorize the results of a test, without understanding the underlying methodology or why this is a solution, I will pass this single test, but fail right at the next one, because I haven't learned anything. I just memorized a string.
@drdca8263 2 หลายเดือนก่อน
18:04 : I found it surprising that so many people felt that this seemed like cheating.
To my mind, if it is just doing computations on the input, to get the output, if it is within the compute requirements for the challenge, why would it be cheating?
Solomonoff induction presumably isn’t cheating , except that it would take too much compute time and memory, so why would this be cheating?
@code4AI 2 หลายเดือนก่อน
Because you have a permutation training sequence that includes the solution to the test.
@drdca8263 2 หลายเดือนก่อน
@ To *a* test, yes, but only an answer that was part of the question one was actually given. There’s no outside-help involved, no being given any extra information about the answer to the actual question one is expected to answer.
Like, the task is, “given these input output pairs, determine the answer for this other input”. And, training on (a processed version of) the given input/output pairs, is just doing a computation on part of the input one is given.
To think of it as cheating, I think one would have to confuse different levels of abstraction about the problem.

ต่อไป

เล่นอัตโนมัติ

Large Language Models (LLMs) - Everything You NEED To Know

Large Language Models (LLMs) - Everything You NEED To Know

A Hackers' Guide to Language Models

A Hackers' Guide to Language Models

Stealing LLMs (MIT, Microsoft, Harvard) #ai

Stealing LLMs (MIT, Microsoft, Harvard) #ai

Real Vs Mannequin Challenge😱

Real Vs Mannequin Challenge😱

ใครคือฆาตกรตัวจริง ?! EP.11 (ver. คืนคริสมาสต์ สุดสยอง !!!

ใครคือฆาตกรตัวจริง ?! EP.11 (ver. คืนคริสมาสต์ สุดสยอง !!!

ตรวจหวยงวดวันที่ 16 ธันวาคม 2567 พร้อมรางวัล N3 รางวัลพิเศษ รางวัล 2 ตัว : Matichon Online

ตรวจหวยงวดวันที่ 16 ธันวาคม 2567 พร้อมรางวัล N3 รางวัลพิเศษ รางวัล 2 ตัว : Matichon Online

Highlight : นายใหญ่ฉุนใคร?

Highlight : นายใหญ่ฉุนใคร?

NEW: LoRA Models override Pre-trained Knowledge (MIT)

NEW: LoRA Models override Pre-trained Knowledge (MIT)

Multi Agent & Multi Modal AI does Physics (MIT)

Multi Agent & Multi Modal AI does Physics (MIT)

It's Not About Scale, It's About Abstraction

It's Not About Scale, It's About Abstraction

Instability is All You Need: The Surprising Dynamics of Learning in Deep Models

Instability is All You Need: The Surprising Dynamics of Learning in Deep Models

AI Is Making You An Illiterate Programmer

AI Is Making You An Illiterate Programmer

LCM: The Ultimate Evolution of AI? Large Concept Models

LCM: The Ultimate Evolution of AI? Large Concept Models

SMARTER: AI Reasoning w Knowledge Graphs + Agents

SMARTER: AI Reasoning w Knowledge Graphs + Agents

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

SciAgents Graph Reasoning: Stanford vs MIT #ai

SciAgents Graph Reasoning: Stanford vs MIT #ai

ส่องฟอร์ม อาหมัด ดิยัลโล่ เล่นโคตรดี | แมนซิตี้ 1-2 แมนยู

ส่องฟอร์ม อาหมัด ดิยัลโล่ เล่นโคตรดี | แมนซิตี้ 1-2 แมนยู

Players vs Trophies 🤯

Players vs Trophies 🤯

Real Vs Mannequin Challenge😱

Real Vs Mannequin Challenge😱

ใครขยับไม่ได้เป็น!!

ใครขยับไม่ได้เป็น!!

ใครคือฆาตกรตัวจริง ?! EP.11 (ver. คืนคริสมาสต์ สุดสยอง !!!

ใครคือฆาตกรตัวจริง ?! EP.11 (ver. คืนคริสมาสต์ สุดสยอง !!!

เดี่ยว - วันที่ได้คำตอบ - Live Show - The Voice Thailand 2024 - 15 Dec 2024

เดี่ยว - วันที่ได้คำตอบ - Live Show - The Voice Thailand 2024 - 15 Dec 2024

Mache leckere Lutscher mit diesem PRO-Gadget! 🚽🍭

Mache leckere Lutscher mit diesem PRO-Gadget! 🚽🍭

总算是用上情侣手机壳了 #玩一种很新的东西 #手机壳 #情侣

总算是用上情侣手机壳了 #玩一种很新的东西 #手机壳 #情侣