1st Multilingual Model Workshop - Training an instruction tuned and aligned LLM for Indian languages

1st Multilingual Model Workshop - Developing Arabic-centric Bilingual LLMs

Cerebras AI Day - Opening Keynote - Andrew Feldman

Real Madrid lift their 15th European trophy! | UCL Today | CBS Sports Golazo

100😭🎉 #thankyou

Send this to an artist! 😏 #shortsart

1st Multilingual Model Workshop - Continued Pre-training of LLMs

Cerebras Systems

มุมมอง 489

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 11 ก.พ. 2024
Large language models (LLMs) are routinely pre-trained on billions of tokens, only to restart the process over again once new data becomes available. A much cheaper and more efficient solution would be to enable the continual pre-training of these models, i.e. updating pre-trained models with new data instead of re-training them from scratch. However, the distribution shift induced by novel data typically results in degraded performance on past data.
This talk discusses the vision to develop methods that can enable efficiently updating pre-trained models with new knowledge while preventing the forgetting of past knowledge. Taking a step towards efficient continual pre-training, we examine the effect of different warm-up strategies and replay when continuing to pre-train models on new data and new languages.
วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 1

@kalilinux8682 หลายเดือนก่อน
This is gold. Thanks guys for sharing your findings!

ต่อไป

เล่นอัตโนมัติ

1st Multilingual Model Workshop - Training an instruction tuned and aligned LLM for Indian languages

1st Multilingual Model Workshop - Training an instruction tuned and aligned LLM for Indian languages

1st Multilingual Model Workshop - Developing Arabic-centric Bilingual LLMs

1st Multilingual Model Workshop - Developing Arabic-centric Bilingual LLMs

Cerebras AI Day - Opening Keynote - Andrew Feldman

Cerebras AI Day - Opening Keynote - Andrew Feldman

Real Madrid lift their 15th European trophy! | UCL Today | CBS Sports Golazo

Real Madrid lift their 15th European trophy! | UCL Today | CBS Sports Golazo

100😭🎉 #thankyou

100😭🎉 #thankyou

Send this to an artist! 😏 #shortsart

Send this to an artist! 😏 #shortsart

Sprinting with More and More Money

Sprinting with More and More Money

Data + AI Summit Keynote, Wednesday Part 4 - Democratizing LLMs with MosaicML and Databricks

Data + AI Summit Keynote, Wednesday Part 4 - Democratizing LLMs with MosaicML and Databricks

0116_1330_EC_Tensor Network Decoding Beyond 2D_Christophe Piveteau

0116_1330_EC_Tensor Network Decoding Beyond 2D_Christophe Piveteau

1st Multilingual Model Workshop - GPT-SW3: An LLM for Swedish and Nordic Languages

1st Multilingual Model Workshop - GPT-SW3: An LLM for Swedish and Nordic Languages

1st Multilingual Model Workshop - Pretraining the Jais Bilingual Arabic-English Language Models

1st Multilingual Model Workshop - Pretraining the Jais Bilingual Arabic-English Language Models

LLM: Pretraining, Instruction fine-tuning and RLHF

LLM: Pretraining, Instruction fine-tuning and RLHF

Continual learning in the presence of Large pretrained models - Rahaf Aljundi - CoLLAs 2023

Continual learning in the presence of Large pretrained models - Rahaf Aljundi - CoLLAs 2023

Cerebras AI Day ML Models and Product Keynote - Jessica Liu

Cerebras AI Day ML Models and Product Keynote - Jessica Liu

What is Prompt Tuning?

What is Prompt Tuning?

Cerebras AI Day - Qualcomm and Cerebras Fireside Chat

Cerebras AI Day - Qualcomm and Cerebras Fireside Chat

คอมข้างทาง I EP.2 (2.1) เดินตลาดริมถนนแล้วเจอเลย! งบ 3 ร้อย เปิดติดมั้ย? มาดู

คอมข้างทาง I EP.2 (2.1) เดินตลาดริมถนนแล้วเจอเลย! งบ 3 ร้อย เปิดติดมั้ย? มาดู

iPhone 15 vs POCO X6 PRO - FREEFIRE DAMAGE TEST #freefire #pocox6pro #iphone15 #90fps

iPhone 15 vs POCO X6 PRO - FREEFIRE DAMAGE TEST #freefire #pocox6pro #iphone15 #90fps

ไม่น่าเชื่อจะเป็นแบบนี้ ใช้มือถือส่องใต้น้ำ2สี ต้นกำเนิดแม่น้ำเจ้าพระยาจังหวัดนครสวรรค จะเจออะไรมั้ย

ไม่น่าเชื่อจะเป็นแบบนี้ ใช้มือถือส่องใต้น้ำ2สี ต้นกำเนิดแม่น้ำเจ้าพระยาจังหวัดนครสวรรค จะเจออะไรมั้ย

iPhone 15 Pro vs Samsung s24🤣 #shorts

iPhone 15 Pro vs Samsung s24🤣 #shorts

Two GPT-4os interacting and singing

Two GPT-4os interacting and singing

ความจริงของหน้าจอ Loading ที่เล่นกับจิตวิทยา #เรื่องเล่า #สาระ #loading #โปรแกรม #shorts

ความจริงของหน้าจอ Loading ที่เล่นกับจิตวิทยา #เรื่องเล่า #สาระ #loading #โปรแกรม #shorts

Power up all cell phones.

Power up all cell phones.

3200 Mah Battery 3 Torch Mobile

3200 Mah Battery 3 Torch Mobile