[1hr Talk] Intro to Large Language Models

Are Aligned Language Models “Adversarially Aligned”?

Generative AI in a Nutshell - how to survive and thrive in the age of AI

ครูผู้รอดชีวิต เล่านาทีช่วยนักเรียนจากเหตุไฟไหม้รถบัส | เจาะข่าวค่ำ | GMM25

Обязательно запомни эту хитрость! Как можно легко перелить масло в узкое горлышко? #shorts

[Live] : ONE FIGHT NIGHT 25 วันนี้!! "อเล็กซิส vs รีเกียน"

Universal and Transferable Adversarial Attacks on Aligned Language Models Explained

Gabriel Mongaras

มุมมอง 2 022

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 7 ต.ค. 2024
Paper found here: arxiv.org/abs/...
Demo here: llm-attacks.org/

ความคิดเห็น • 10

@machinelearnear ปีที่แล้ว ⁺¹
Amazing work, Gabriel, really like your videos & approach. Keep it up!
@amirhoseinshirani7328 ปีที่แล้ว
Your approach to present articles is awesome! keep working 👏🏻
@viruldewnaka1193 ปีที่แล้ว
Great stuff, keep uploading
@SkyBeast55 ปีที่แล้ว
i like your videos! thanks a lot
@AndyLee-xq8wq 10 หลายเดือนก่อน
nice video!!
@jimmyjackson7848 7 หลายเดือนก่อน
reading over these papers reminds me of Johnny Long, when he introduced the google hacking for pen-testers..
@shivangitripathi1356 10 หลายเดือนก่อน
how are tokens generated actually? how do we check attacks takes place by placing those tokens? from where does these token comes? can anybody answer me ?
@noadsensehere9195 4 หลายเดือนก่อน
How can I implement this paper
@רותםישראלי-כ3ד ปีที่แล้ว
Really liked your videos but I prefer the ones about vision
@gabrielmongaras ปีที่แล้ว
Glad you're enjoying my videos! I'm trying to keep a wide range of topics covering vision, text, and audio as there are really cool developments in all three domains! Trying not to get trapped in a single domain as developments in one domain can also affect another. Also, LLMs are the craze right now and I think reading over some of the papers would be beneficial to know what's currently going on with them right now.

ต่อไป

เล่นอัตโนมัติ

[1hr Talk] Intro to Large Language Models

[1hr Talk] Intro to Large Language Models

Are Aligned Language Models “Adversarially Aligned”?

Are Aligned Language Models “Adversarially Aligned”?

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Generative AI in a Nutshell - how to survive and thrive in the age of AI

ครูผู้รอดชีวิต เล่านาทีช่วยนักเรียนจากเหตุไฟไหม้รถบัส | เจาะข่าวค่ำ | GMM25

ครูผู้รอดชีวิต เล่านาทีช่วยนักเรียนจากเหตุไฟไหม้รถบัส | เจาะข่าวค่ำ | GMM25

Обязательно запомни эту хитрость! Как можно легко перелить масло в узкое горлышко? #shorts

Обязательно запомни эту хитрость! Как можно легко перелить масло в узкое горлышко? #shorts

[Live] : ONE FIGHT NIGHT 25 วันนี้!! "อเล็กซิส vs รีเกียน"

[Live] : ONE FIGHT NIGHT 25 วันนี้!! "อเล็กซิส vs รีเกียน"

สาเหตุที่เนี้ยสพูดได้ #shorts

สาเหตุที่เนี้ยสพูดได้ #shorts

Large Language Models (LLMs) - Everything You NEED To Know

Large Language Models (LLMs) - Everything You NEED To Know

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Ten Years Hence Lecture: Adversarial Attacks on Large Language Models

Ten Years Hence Lecture: Adversarial Attacks on Large Language Models

AI safety: Universal and Transferable Attacks on Aligned Language Models

AI safety: Universal and Transferable Attacks on Aligned Language Models

What are AI Agents?

What are AI Agents?

What is generative AI and how does it work? - The Turing Lectures with Mirella Lapata

What is generative AI and how does it work? – The Turing Lectures with Mirella Lapata

Towards Monosemanticity: Decomposing Language Models Into Understandable Components

Towards Monosemanticity: Decomposing Language Models Into Understandable Components

Attacking LLM - Prompt Injection

Attacking LLM - Prompt Injection

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits and BitNet

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits and BitNet

This mother's baby is too unreliable.

This mother's baby is too unreliable.

Yours Ever - COCKTAIL Feat. Q Flure |Official MV|

Yours Ever - COCKTAIL Feat. Q Flure |Official MV|

คลิปนี้ทำผมอายุสั้นไปอีก 10 ปี!! (SPD)

คลิปนี้ทำผมอายุสั้นไปอีก 10 ปี!! (SPD)

Flying card unique skills incredible# trend life growth star# hello creator# bask in my autumn harv

Flying card unique skills incredible# trend life growth star# hello creator# bask in my autumn harv

นึกว่าคนบ้าถือมีดที่แท้เป็น พนง.เทศบาล

นึกว่าคนบ้าถือมีดที่แท้เป็น พนง.เทศบาล

Hide&Seek Extreme#3 - ซ่อนในก้อนหินไม่กี่ก้อนเอง!

Hide&Seek Extreme#3 - ซ่อนในก้อนหินไม่กี่ก้อนเอง!

CAMPปลิ้น | EP.82[2/2] เมาท์มอยกับ 3 สาวเพื่อนซี้ ที่พร้อมขยี้พ่อหมีได้ทุกเมื่อ

CAMPปลิ้น | EP.82[2/2] เมาท์มอยกับ 3 สาวเพื่อนซี้ ที่พร้อมขยี้พ่อหมีได้ทุกเมื่อ

New EPIC SCAN RUN Challenge - Herobrine vs Noob!

New EPIC SCAN RUN Challenge - Herobrine vs Noob!