Evan Hubinger - Alignment Stress-Testing at Anthropic [Alignment Workshop]

It's Not About Scale, It's About Abstraction

Unreasonably Effective AI with Demis Hassabis

บังอาจ ทาบบารมี ! ผ่าเบื้องลึก 1 วันก่อนสังหาร เดินเกมล้มตระกูล “วิลาวัลย์” #ถกไม่เถียง

#อึ้ง!เหลือจะเชื่อ!ไทยพลิกนรกดับสิงคโปร์คาบ้าน ทะลุเข้ารอบรองชนะเลิศ! คารวะอิชิอิโคตรการเปลี่ยนแปลง!

เดี่ยว - วันที่ได้คำตอบ - Live Show - The Voice Thailand 2024 - 15 Dec 2024

Richard Ngo - Reframing AGI Threat Models [Alignment Workshop]

FAR․AI

มุมมอง 440

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 8 ม.ค. 2025
In “Reframing AGI Threat Models,” Richard Ngo suggests defining ‘misaligned coalitions’-groups of humans and AIs that might grab power in illegitimate ways, from terrorist groups and rogue states to corporate conspiracies. This alternative framework shifts focus to the nature of coalitions and their risk potential, whether from decentralization or centralization.
Highlights:
🔹Misuse vs Misalignment - Distinction not useful on a technical or governance level
🔹Misaligned Coalition - Humans + Als attempting to seize power in illegitimate ways
🔹Small-scale actors - Risks from decentralization support offense, compute controls, and international regulation
🔹 Large-scale actors - Risks from centralization support open source, transparency, and limited government powers
The Alignment Workshop is a series of events convening top ML researchers from industry and academia, along with experts in the government and nonprofit sectors, to discuss and debate topics related to AI alignment. The goal is to enable researchers and policymakers to better understand potential risks from advanced AI, and strategies for solving them.
If you are interested in attending future workshops, please fill out the following expression of interest form to get notified about future events: far.ai/futures...
Find more talks on this TH-cam channel, and at www.alignment-...
#AlignmentWorkshop

ความคิดเห็น • 1

ต่อไป

เล่นอัตโนมัติ

Evan Hubinger - Alignment Stress-Testing at Anthropic [Alignment Workshop]

Evan Hubinger – Alignment Stress-Testing at Anthropic [Alignment Workshop]

It's Not About Scale, It's About Abstraction

It's Not About Scale, It's About Abstraction

Unreasonably Effective AI with Demis Hassabis

Unreasonably Effective AI with Demis Hassabis

บังอาจ ทาบบารมี ! ผ่าเบื้องลึก 1 วันก่อนสังหาร เดินเกมล้มตระกูล “วิลาวัลย์” #ถกไม่เถียง

บังอาจ ทาบบารมี ! ผ่าเบื้องลึก 1 วันก่อนสังหาร เดินเกมล้มตระกูล “วิลาวัลย์” #ถกไม่เถียง

#อึ้ง!เหลือจะเชื่อ!ไทยพลิกนรกดับสิงคโปร์คาบ้าน ทะลุเข้ารอบรองชนะเลิศ! คารวะอิชิอิโคตรการเปลี่ยนแปลง!

#อึ้ง!เหลือจะเชื่อ!ไทยพลิกนรกดับสิงคโปร์คาบ้าน ทะลุเข้ารอบรองชนะเลิศ! คารวะอิชิอิโคตรการเปลี่ยนแปลง!

เดี่ยว - วันที่ได้คำตอบ - Live Show - The Voice Thailand 2024 - 15 Dec 2024

เดี่ยว - วันที่ได้คำตอบ - Live Show - The Voice Thailand 2024 - 15 Dec 2024

ช้างศึกโดนก่อน ไล่ยิงคืนสิงคโปร์ ทะลุน็อคเอาท์

ช้างศึกโดนก่อน ไล่ยิงคืนสิงคโปร์ ทะลุน็อคเอาท์

The Turing Lectures: The future of generative AI

The Turing Lectures: The future of generative AI

BREAKING: OpenAI's new O3 model changes everything

BREAKING: OpenAI's new O3 model changes everything

What's the future for generative AI? - The Turing Lectures with Mike Wooldridge

What's the future for generative AI? - The Turing Lectures with Mike Wooldridge

Jason Wei: Scaling Paradigms for Large Language Models

Jason Wei: Scaling Paradigms for Large Language Models

What Is an AI Anyway? | Mustafa Suleyman | TED

What Is an AI Anyway? | Mustafa Suleyman | TED

Большой разговор Юрия Пивоварова и Кирилла Мартынова о России и ее истории / «Новая газета Европа»

Большой разговор Юрия Пивоварова и Кирилла Мартынова о России и ее истории / «Новая газета Европа»

Find the research gap with AI in ONE day: Groundbreaking new process

Find the research gap with AI in ONE day: Groundbreaking new process

AI and Open Source with Andrew Ng, Founder DeepLearning.AI

AI and Open Source with Andrew Ng, Founder DeepLearning.AI

Ray Kurzweil: Future of Intelligence | MIT 6.S099: Artificial General Intelligence (AGI)

Ray Kurzweil: Future of Intelligence | MIT 6.S099: Artificial General Intelligence (AGI)

ผู้หญิงแต่งงานกับขอทาน แต่กลับถูกดูหมิ่น ในที่สุดชายขเทานก็เผยตัวตย#ละครหวานๆ#ชอบ

ผู้หญิงแต่งงานกับขอทาน แต่กลับถูกดูหมิ่น ในที่สุดชายขเทานก็เผยตัวตย#ละครหวานๆ#ชอบ

🔴 LIVE : ถ่ายทอดสด การออกรางวัลสลากกินแบ่งรัฐบาล งวดวันที่ 16 ธันวาคม 2567

🔴 LIVE : ถ่ายทอดสด การออกรางวัลสลากกินแบ่งรัฐบาล งวดวันที่ 16 ธันวาคม 2567

หนูกับเต้ รัก ”พี่อู๋จูน“ นะ

หนูกับเต้ รัก ”พี่อู๋จูน“ นะ

ช้างศึกโดนก่อน ไล่ยิงคืนสิงคโปร์ ทะลุน็อคเอาท์

ช้างศึกโดนก่อน ไล่ยิงคืนสิงคโปร์ ทะลุน็อคเอาท์

OHANA บ้าพลัง EP.134 : เกมการ์ดโอฮาน่า X วัยหนุ่ม 2544

OHANA บ้าพลัง EP.134 : เกมการ์ดโอฮาน่า X วัยหนุ่ม 2544

【หนังพากย์ไทย】ยอดฝีมือสังหารนักโทษ แต่นักโทษเป็นปรมาจารย์กังฟูที่ซ่อนอยู่ เขาจัดการทั้งหมดในทันที

【หนังพากย์ไทย】ยอดฝีมือสังหารนักโทษ แต่นักโทษเป็นปรมาจารย์กังฟูที่ซ่อนอยู่ เขาจัดการทั้งหมดในทันที

ไฮไลท์ ฟุตบอล ASEAN MITSUBISHI ELECTRIC CUP 2024 : สิงคโปร์ พบ ไทย

ไฮไลท์ ฟุตบอล ASEAN MITSUBISHI ELECTRIC CUP 2024 : สิงคโปร์ พบ ไทย

กินขนมมั้ยจ้ะน้อง หนมน้า😝

กินขนมมั้ยจ้ะน้อง หนมน้า😝