Soft Mixture of Experts - An Efficient Sparse Transformer

Hypnotized AI and Large Language Model Security

Generative AI in a Nutshell - how to survive and thrive in the age of AI

What are they drawing?#devil #lilith #funny #shorts

[LIVE] : ONE ลุมพินี 82 วันนี้!! คู่เอก | ยอดไอคิว vs อับดัลลาห์

เมื่อหนุ่ม Benson เจอแฟนเก่าสมัยม.ต้นกลางคอนเสิร์ต! คนเยอะแค่ไหนก็ยังมองเห็น🤩ใส่ใจแฟนคลับทุกคนจริงๆ🥺

Universal and Transferable LLM Attacks - A New Threat to AI Safety

AI Papers Academy

มุมมอง 2 638

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 7 ต.ค. 2024
In this video we review the paper Universal and Transferable Adversarial Attacks on Aligned Language Models. This paper caught many of the famous LLMs by surprise including ChatGPT, Bard and LLaMa-2, by bypassing their safety mechanism and fooling them to answer harmful prompts using universal jailbreak prompts.
In this paper, the authors propose a new method for attacking large language models that can induce undesirable behavior. The authors demonstrate that their approach improves substantially upon existing attack methods and is able to reliably break the target model. They also show that the resulting attacks can even demonstrate a notable degree of transfer to other models.
In the video we will observe some of the results and the examples including how ChatGPT and LLaMa-2 were fooled by the method suggested in the paper.
Lastly, we will discuss how the method works in high-level by reviewing its three key elements.
Paper website - llm-attacks.org/
Arxiv page - arxiv.org/abs/...
Code - github.com/llm...
👍 Please like & subscribe if you enjoy this content
----------------------------------------------------------------------------------
Support us - paypal.me/aipa...
----------------------------------------------------------------------------------

ความคิดเห็น • 2

ต่อไป

เล่นอัตโนมัติ

Soft Mixture of Experts - An Efficient Sparse Transformer

Soft Mixture of Experts - An Efficient Sparse Transformer

Hypnotized AI and Large Language Model Security

Hypnotized AI and Large Language Model Security

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Generative AI in a Nutshell - how to survive and thrive in the age of AI

What are they drawing?#devil #lilith #funny #shorts

What are they drawing?#devil #lilith #funny #shorts

[LIVE] : ONE ลุมพินี 82 วันนี้!! คู่เอก | ยอดไอคิว vs อับดัลลาห์

[LIVE] : ONE ลุมพินี 82 วันนี้!! คู่เอก | ยอดไอคิว vs อับดัลลาห์

เมื่อหนุ่ม Benson เจอแฟนเก่าสมัยม.ต้นกลางคอนเสิร์ต! คนเยอะแค่ไหนก็ยังมองเห็น🤩ใส่ใจแฟนคลับทุกคนจริงๆ🥺

เมื่อหนุ่ม Benson เจอแฟนเก่าสมัยม.ต้นกลางคอนเสิร์ต! คนเยอะแค่ไหนก็ยังมองเห็น🤩ใส่ใจแฟนคลับทุกคนจริงๆ🥺

คลิปนี้ทำผมอายุสั้นไปอีก 10 ปี!! (SPD)

คลิปนี้ทำผมอายุสั้นไปอีก 10 ปี!! (SPD)

Zico Kolter - Adversarial Attacks on Aligned Language Models

Zico Kolter - Adversarial Attacks on Aligned Language Models

Large Language Models (LLMs) - Everything You NEED To Know

Large Language Models (LLMs) - Everything You NEED To Know

Attacking LLM - Prompt Injection

Attacking LLM - Prompt Injection

Ten Years Hence Lecture: Adversarial Attacks on Large Language Models

Ten Years Hence Lecture: Adversarial Attacks on Large Language Models

Fast Inference of Mixture-of-Experts Language Models with Offloading

Fast Inference of Mixture-of-Experts Language Models with Offloading

How Will Large Language Models Impact Cybersecurity?

How Will Large Language Models Impact Cybersecurity?

AI Agents Explained Like You're 5 (Seriously, Easiest Explanation Ever!)

AI Agents Explained Like You're 5 (Seriously, Easiest Explanation Ever!)

AI safety: Universal and Transferable Attacks on Aligned Language Models

AI safety: Universal and Transferable Attacks on Aligned Language Models

Universal and Transferable Adversarial Attacks on Aligned Language Models Explained

Universal and Transferable Adversarial Attacks on Aligned Language Models Explained

โรงเรียนตอนตี 2 โคตรหลอน..!! [โรงเรียนไดเม่]

โรงเรียนตอนตี 2 โคตรหลอน..!! [โรงเรียนไดเม่]

[Live] : ONE FIGHT NIGHT 25 วันนี้!! "อเล็กซิส vs รีเกียน"

[Live] : ONE FIGHT NIGHT 25 วันนี้!! "อเล็กซิส vs รีเกียน"

ไฮไลท์ฟุตบอล #บุนเดสลีกา | แฟร้งค์เฟิร์ต 3-3 บาเยิร์น มิวนิค | 6 ต.ค. 67

ไฮไลท์ฟุตบอล #บุนเดสลีกา | แฟร้งค์เฟิร์ต 3-3 บาเยิร์น มิวนิค | 6 ต.ค. 67

ขับรถชนผีเปรตจน 7 โมงเช้า😭😱 Night Drive

ขับรถชนผีเปรตจน 7 โมงเช้า😭😱 Night Drive

Silent Hill 2 | เกมออกวันแรก! สัมผัสไปด้วยกัน

Silent Hill 2 | เกมออกวันแรก! สัมผัสไปด้วยกัน

10% vs 100% #beatbox #tiktok

10% vs 100% #beatbox #tiktok

10 ชั่วโมง แข่งซื้อพูลวิลล่า !! ( Primkung x กายหงิด x F Pongpitak ) | EP.1

10 ชั่วโมง แข่งซื้อพูลวิลล่า !! ( Primkung x กายหงิด x F Pongpitak ) | EP.1