[1hr Talk] Intro to Large Language Models

LLaMA 3 Tested!! Yes, It’s REALLY That GREAT

How Meta’s Chief AI Scientist Believes We’ll Get To Autonomous AI Models

Hello tom 👋🏻😉🎀 #yoshirinrada #โยชิ

ลองกินบะหมี่กึ่งสําเร็จรูป (ชนิดแรกของโลก!) 🍜🇯🇵 #ประวัติศาสตร์ #ประเทศญี่ปุ่น #มาม่า #チキンラーメン

Live ฟังสด เดอะช็อค | ตั้น อินดี้ - ตั้ม รถขนไม้ | วัน พุธ ที่ 8 พฤษภาคม 2567 | The Shock 13

Meta Announces Llama 3 at Weights & Biases’ conference

Weights & Biases

มุมมอง 68 091

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 8 พ.ค. 2024
In an engaging presentation at Weights & Biases’ Fully Connected conference, Joe Spisak, Product Director of GenAI at Meta, unveiled the latest family of Llama models, Llama 3.
Highlighting a significant milestone in AI development, the Llama 3 models, including the impressive 8 billion and 70 billion parameter models released during the conference, along with a glimpse into the future with a 400 billion parameter model still in the works.
Joe shared insights into the training processes and alignment of Llama 3, which now ranks as the top-performing model in the open weights category on the MMLU, GSM-K, HumanEval benchmarks.
Weights & Biases is proud to support our customers such as Meta as they push the boundaries of AI, to learn how to fine-tune your LLMs using torchtune and Weights & Biases, start here: wandb.me/torchtune
Timestamps:
00:00 Introduction
03:05 Overview of Llama at Meta
05:59 Introducing Meta Llama 3
7:04 Advancements in Llama 3: Training and Data Scale
10:02 Benchmarking Llama 3 Performance
14:01 Enhancing Model Safety and Red Teaming
16:23 Expanding the Ecosystem and Future Directions
23:00 Closing remarks: Future plans for Llama models, and an invitation to use Meta's Lama 3.
#MetaLlama #ArtificialIntelligence #AITrends #TechInnovation
วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 27

@Crux69 14 วันที่ผ่านมา ⁺¹⁸
My favorite fact from this is that the smarter the model, the more it violates rules. Just like us :)
@utuberay007 12 วันที่ผ่านมา
Very true ! People who are way smarter on tax laws are the one who violate most , innocent people pay more than what they are supposed to etc . Same goes with many other laws
@techpiller2558 10 วันที่ผ่านมา
Or, the rules it uses instead of the rules we assumed are different.
@thenoblerot 15 วันที่ผ่านมา ⁺¹¹
Thanks for this W&B
@WeightsBiases 14 วันที่ผ่านมา ⁺¹
Our pleasure!
@RakeshMurria 15 วันที่ผ่านมา ⁺⁷
I really enjoyed this. Thanks
@WeightsBiases 14 วันที่ผ่านมา ⁺¹
Glad you enjoyed it!
@siloquant 14 วันที่ผ่านมา ⁺¹
Congratulations!
@ihesiulo 14 วันที่ผ่านมา ⁺⁶
There's a universe where Joseph Spisak is Mark Zuckerberg's brother. Oh, and nice presentation. Wonderful work they are doing at Meta AI.
@naninano8813 15 วันที่ผ่านมา ⁺⁵
so all those supervisor/safeguard models are only utilized during training? i mean, once the weights of llama3 are out, there is no safeguard network between user and inference engine right?
@Crux69 14 วันที่ผ่านมา
I'm sure they have a safety model that tries to review every request and catch some negative responses.
@PeterLappo 14 วันที่ผ่านมา
How much did it cost to build, including hardware and engineering costs?
@techpiller2558 10 วันที่ผ่านมา
What will be the SQLite of LLMs, with capability for local use? Llama?
@thegreatgustby 16 วันที่ผ่านมา ⁺²
I think he could have said "ridiculous" a bit more often
@gubatron 14 วันที่ผ่านมา ⁺⁴
vin diesel!
@RichReportcom 8 วันที่ผ่านมา ⁺¹
Summary: Safety and size. The end.
@ericadar 15 วันที่ผ่านมา ⁺¹⁵
a few hours go by...llama 3 no longer SOTA
@adinsoftic 15 วันที่ผ่านมา ⁺⁷
That's why they open source it. They let the community figure things out and iterate. For Meta LLM is just a tool and not a product on itself
@SkepticButOptimist 15 วันที่ผ่านมา ⁺²
Wait what is sota now?
@adinsoftic 15 วันที่ผ่านมา ⁺²
@@SkepticButOptimist "state of the art"
@JeiShian 14 วันที่ผ่านมา
Which model is sota?
@MiraPloy 12 วันที่ผ่านมา ⁺⁵
I think it's supposed ro be either phi or sensenova, neither of which are released @@JeiShian
@GerardSans 10 วันที่ผ่านมา
How silly is to redteam a model which you control the training data to check for bioweapons capabilities. How stupid should you have to be? Isn’t easier to run a search on the data 😂😅
@matbeedotcom 15 วันที่ผ่านมา
I’m glad they saw how useless they made codellama 😂, it was waaaay overly aligned

ต่อไป

เล่นอัตโนมัติ

[1hr Talk] Intro to Large Language Models

[1hr Talk] Intro to Large Language Models

LLaMA 3 Tested!! Yes, It’s REALLY That GREAT

LLaMA 3 Tested!! Yes, It’s REALLY That GREAT

How Meta’s Chief AI Scientist Believes We’ll Get To Autonomous AI Models

How Meta’s Chief AI Scientist Believes We’ll Get To Autonomous AI Models

Hello tom 👋🏻😉🎀 #yoshirinrada #โยชิ

Hello tom 👋🏻😉🎀 #yoshirinrada #โยชิ

ลองกินบะหมี่กึ่งสําเร็จรูป (ชนิดแรกของโลก!) 🍜🇯🇵 #ประวัติศาสตร์ #ประเทศญี่ปุ่น #มาม่า #チキンラーメン

ลองกินบะหมี่กึ่งสําเร็จรูป (ชนิดแรกของโลก!) 🍜🇯🇵 #ประวัติศาสตร์ #ประเทศญี่ปุ่น #มาม่า #チキンラーメン

Live ฟังสด เดอะช็อค | ตั้น อินดี้ - ตั้ม รถขนไม้ | วัน พุธ ที่ 8 พฤษภาคม 2567 | The Shock 13

Live ฟังสด เดอะช็อค | ตั้น อินดี้ - ตั้ม รถขนไม้ | วัน พุธ ที่ 8 พฤษภาคม 2567 | The Shock 13

Honkai: Star Rail | เนื้อเรื่อง ภารกิจบุกเบิก 2.2 "ร้องไห้อีกคราเมื่อยามตื่น" อบอุ่นแน่วันนี้

Honkai: Star Rail | เนื้อเรื่อง ภารกิจบุกเบิก 2.2 "ร้องไห้อีกคราเมื่อยามตื่น" อบอุ่นแน่วันนี้

NVIDIA CEO Jensen Huang Leaves Everyone SPEECHLESS (Supercut)

NVIDIA CEO Jensen Huang Leaves Everyone SPEECHLESS (Supercut)

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Did AI Just End Music? (Now it’s Personal) ft. Rick Beato

Did AI Just End Music? (Now it’s Personal) ft. Rick Beato

What Is an AI Anyway? | Mustafa Suleyman | TED

What Is an AI Anyway? | Mustafa Suleyman | TED

TransformerFAM: Feedback attention is working memory

TransformerFAM: Feedback attention is working memory

What is LangChain?

What is LangChain?

Why American Automakers Are Failing In China

Why American Automakers Are Failing In China

Shaping the World of Robotics with Chelsea Finn

Shaping the World of Robotics with Chelsea Finn

But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning

But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning

Why I spent $3600 on the iPad Pro M4.

Why I spent $3600 on the iPad Pro M4.

Which Phone Unlock Code Will You Choose? 🤔️

Which Phone Unlock Code Will You Choose? 🤔️

Start from 0 at any point on the T1 Digital Tape Measure

Start from 0 at any point on the T1 Digital Tape Measure

ทดลอง DysonTP-07ระหว่างควันและไอน้ำ มันแตกต่างกันมากกว่าที้คุณเห็น

ทดลอง DysonTP-07ระหว่างควันและไอน้ำ มันแตกต่างกันมากกว่าที้คุณเห็น

Macro Photography Lighting #litbylume | Miniature Nature Studio

Macro Photography Lighting #litbylume | Miniature Nature Studio

โทรศัพท์ร้อนใช้วิธีไหนแก้? #แม็คเกอร์ #ครอบครัวเอ็นจอย #เอ็นจอยคับผม #ฝากติดตาม #ช่องยูทูป

โทรศัพท์ร้อนใช้วิธีไหนแก้? #แม็คเกอร์ #ครอบครัวเอ็นจอย #เอ็นจอยคับผม #ฝากติดตาม #ช่องยูทูป

ทำไมยังใช้ iPhone ในปี 2024 ? (แอนดรอยก็สู้ไม่ได้)

ทำไมยังใช้ iPhone ในปี 2024 ? (แอนดรอยก็สู้ไม่ได้)

ชิปหายของแทร่ เจอมือดี งัดเอาชิป RTX 4090 ไปขายจีนกันเพียบ

ชิปหายของแทร่ เจอมือดี งัดเอาชิป RTX 4090 ไปขายจีนกันเพียบ