Generative AI in a Nutshell - how to survive and thrive in the age of AI

Yoshua Bengio Doesn’t Think We’re Ready for Superhuman AI. We’re Building it Anyway.

The Future Mark Zuckerberg Is Trying To Build

IR Iran v Morocco | FIFA Futsal World Cup 2024 | Round of 16 | Highlights

อะไรในกล่อง #challange #gamechallenge #thesnack #จีโน่ #ปอนด์ #friendchallenge #mysterybox #shorts

BUS ชื่อกรุ๊ปไลน์ จดหมายอนาคต และความรู้สึกที่ไม่เคยบอกกัน | Chairs to Share EP.57

22 - Shard Theory with Quintin Pope

AXRP

มุมมอง 570

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 27 ก.ย. 2024
What can we learn about advanced deep learning systems by understanding how humans learn and form values over their lifetimes? Will superhuman AI look like ruthless coherent utility optimization, or more like a mishmash of contextually activated desires? This episode's guest, Quintin Pope, has been thinking about these questions as a leading researcher in the shard theory community. We talk about what shard theory is, what it says about humans and neural networks, and what the implications are for making AI safe.
Patreon: patreon.com/axrpodcast
Ko-fi: ko-fi.com/axrpodcast
Episode art by ‪@hamishdoodles‬
Topics we discuss, and timestamps:
- 0:00:42 - Why understand human value formation?
- 0:19:59 - Why not design methods to align to arbitrary values?
- 0:27:22 - Postulates about human brains
- 0:36:20 - Sufficiency of the postulates
- 0:44:55 - Reinforcement learning as conditional sampling
- 0:48:05 - Compatibility with genetically-influenced behaviour
- 1:03:06 - Why deep learning is basically what the brain does
- 1:25:17 - Shard theory
- 1:38:49 - Shard theory vs expected utility optimizers
- 1:54:45 - What shard theory says about human values
- 2:05:47 - Does shard theory mean we're doomed?
- 2:18:54 - Will nice behaviour generalize?
- 2:33:48 - Does alignment generalize farther than capabilities?
- 2:42:03 - Are we at the end of machine learning history?
- 2:53:09 - Shard theory predictions
- 2:59:47 - The shard theory research community
- 3:13:45 - Why do shard theorists not work on replicating human childhoods?
- 3:25:53 - Following shardy research
The transcript
: axrp.net/episo...
Shard theorist links:
- Quintin's LessWrong profile: www.lesswrong....
- Alex Turner's LessWrong profile: www.lesswrong....
- Shard theory Discord: / discord
- EleutherAI Discord: / discord
Research we discuss:
- The Shard Theory Sequence: www.lesswrong....
- Pretraining Language Models with Human Preferences: arxiv.org/abs/...
- Inner alignment in salt-starved rats: www.lesswrong....
- Intro to Brain-like AGI Safety Sequence: www.lesswrong....
- Brains and transformers:
- The neural architecture of language: Integrative modeling converges on predictive processing: www.pnas.org/d...
- Brains and algorithms partially converge in natural language processing: www.nature.com...
- Evidence of a predictive coding hierarchy in the human brain listening to speech: www.nature.com...
- Singular learning theory explainer: Neural networks generalize because of this one weird trick: www.lesswrong....
- Singular learning theory links: metauni.org/slt/
- Implicit Regularization via Neural Feature Alignment, aka circles in the parameter-function map: arxiv.org/abs/...
- The shard theory of human values: www.lesswrong....
- Predicting inductive biases of pre-trained networks: openreview.net...
- Understanding and controlling a maze-solving policy network, aka the cheese vector: www.lesswrong....
- Quintin's Research agenda: Supervising AIs improving AIs: www.lesswrong....
- Steering GPT-2-XL by adding an activation vector: www.lesswrong....
Links for the addendum on mesa-optimization skepticism:
- Quintin's response to Yudkowsky arguing against AIs being steerable by gradient descent: www.lesswrong....
- Quintin on why evolution is not like AI training: www.lesswrong....
- Evolution provides no evidence for the sharp left turn: www.lesswrong....
- Let's Agree to Agree: Neural Networks Share Classification Order on Real Datasets: arxiv.org/abs/...

ความคิดเห็น • 4

@Dan-dy8zp 7 หลายเดือนก่อน
There's a lot more than heartbeat and such we are born with. We expect 3 dimensions of space and one of time and that we are agents with preferences for future states of the world. We expect other things that move or have what appear to be 'eyes', to be other agents. We try to figure out what those agents 'want' soon after birth. We can exhibit jealousy by 3 months of age. We can recognize some facial expressions instinctively. People can have phantom limb syndrome who never had the limb, so there is a mental map of a normal human body. Probably many many more things.
@BrainWrinklers ปีที่แล้ว
Hey Quinten, wanna come on our show? We talk rationality and AI safety+alignment.
@Dan-dy8zp 7 หลายเดือนก่อน
Also, it seems relevant that bilateral anterior cingulate cortex destruction produces a psychopath, at any age. That seems a very important point. We don't really learn moral behavior, not from a blank slate exposure to our world.
We learn morality the way we learn to walk. Its perfecting our strategy for doing something we basically already know how to do instinctively.
@Dan-dy8zp 7 หลายเดือนก่อน
Religiosity is considered heritable 25%-60% .

ต่อไป

เล่นอัตโนมัติ

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Yoshua Bengio Doesn’t Think We’re Ready for Superhuman AI. We’re Building it Anyway.

Yoshua Bengio Doesn’t Think We’re Ready for Superhuman AI. We’re Building it Anyway.

The Future Mark Zuckerberg Is Trying To Build

The Future Mark Zuckerberg Is Trying To Build

IR Iran v Morocco | FIFA Futsal World Cup 2024 | Round of 16 | Highlights

IR Iran v Morocco | FIFA Futsal World Cup 2024 | Round of 16 | Highlights

อะไรในกล่อง #challange #gamechallenge #thesnack #จีโน่ #ปอนด์ #friendchallenge #mysterybox #shorts

อะไรในกล่อง #challange #gamechallenge #thesnack #จีโน่ #ปอนด์ #friendchallenge #mysterybox #shorts

BUS ชื่อกรุ๊ปไลน์ จดหมายอนาคต และความรู้สึกที่ไม่เคยบอกกัน | Chairs to Share EP.57

BUS ชื่อกรุ๊ปไลน์ จดหมายอนาคต และความรู้สึกที่ไม่เคยบอกกัน | Chairs to Share EP.57

Officer Rabbit is so bad. He made Luffy deaf. #funny #supersiblings #comedy

Officer Rabbit is so bad. He made Luffy deaf. #funny #supersiblings #comedy

24 - Superalignment with Jan Leike

24 - Superalignment with Jan Leike

Style Theory: School Dress Codes Will RUIN Your Life!

Style Theory: School Dress Codes Will RUIN Your Life!

How are memories stored in neural networks? | The Hopfield Network #SoME2

How are memories stored in neural networks? | The Hopfield Network #SoME2

What Shakespeare's English Sounded Like - and how we know

What Shakespeare's English Sounded Like - and how we know

How Heathcliff Lost His Cool

How Heathcliff Lost His Cool

Has Generative AI Already Peaked? - Computerphile

Has Generative AI Already Peaked? - Computerphile

Geoffrey Hinton | On working with Ilya, choosing problems, and the power of intuition

Geoffrey Hinton | On working with Ilya, choosing problems, and the power of intuition

Film Theory: Bluey is MUCH Darker Than You Realize!

Film Theory: Bluey is MUCH Darker Than You Realize!

The Physics and Philosophy of Time - with Carlo Rovelli

The Physics and Philosophy of Time - with Carlo Rovelli

50 ชั่วโมง ภารกิจด่วน!! ตะลุยช่วยผู้ประสบภัยน้ำท่วม!!

50 ชั่วโมง ภารกิจด่วน!! ตะลุยช่วยผู้ประสบภัยน้ำท่วม!!

ATLAS - ฉันคนเก่า ( Let Me Try Again ) | Official MV

ATLAS - ฉันคนเก่า ( Let Me Try Again ) | Official MV

Liverpool 5-1 West Ham | Carabao Cup Highlights

Liverpool 5-1 West Ham | Carabao Cup Highlights

เกว็นมาอุดหนุนร้าน BEN 10 ตามสั่ง #ตลก #ละครสั้น #ben10 #บ้านกูเอง

เกว็นมาอุดหนุนร้าน BEN 10 ตามสั่ง #ตลก #ละครสั้น #ben10 #บ้านกูเอง

เมื่อเพื่อนผมว่างจนเกินไป เลยโดนตอกฟรี...| Minecraft #minecraft #มายคราฟ #fypシ #minecraftmemes #ตลก

เมื่อเพื่อนผมว่างจนเกินไป เลยโดนตอกฟรี...| Minecraft #minecraft #มายคราฟ #fypシ #minecraftmemes #ตลก

LIVE : Buriram United vs Kaya FC-Iloilo | SHOPEE CUP 2024/25 | 26.09.24

LIVE : Buriram United vs Kaya FC–Iloilo | SHOPEE CUP 2024/25 | 26.09.24

Please Help This Poor Boy 🙏

Please Help This Poor Boy 🙏

We finally APPROVED @ZachChoi

We finally APPROVED @ZachChoi