How to Build LLMs on Your Company’s Data While on a Budget

[1hr Talk] Intro to Large Language Models

But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning

BUMIL JADI ANAK SD

上集 | 小朋友撞见弟弟被人贩子拐走，巧用调虎离山之计救出弟弟！#儿童安全教育 #防拐 #儿童安全#儿童自救

พร้อมรักหรือยัง? Ready For Love? พากย์ไทย 06 ซีรีส์

How to Train Your Own Large Language Models

Databricks

มุมมอง 34 453

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 25 มิ.ย. 2024
Given the success of OpenAI’s GPT-4 and Google’s PaLM, every company is now assessing its own use cases for Large Language Models (LLMs). Many companies will ultimately decide to train their own LLMs for a variety of reasons, ranging from data privacy to increased control over updates and improvements. One of the most common reasons will be to make use of proprietary internal data.
In this session, we’ll go over how to train your own LLMs, from raw data to deployment in a user-facing production environment. We’ll discuss the engineering challenges, and the vendors that make up the modern LLM stack: Databricks, Hugging Face, and MosaicML. We’ll also break down what it means to train an LLM using your own data, including the various approaches and their associated tradeoffs.
Topics covered in this session:
- How Replit trained a state-of-the-art LLM from scratch
- The different approaches to using LLMs with your internal data
- The differences between fine-tuning, instruction tuning, and RLHF
Talk by: Reza Shabani
Here’s more to explore:
LLM Compact Guide: dbricks.co/43WuQyb
Big Book of MLOps: dbricks.co/3r0Pqiz
Connect with us: Website: databricks.com
Twitter: / databricks
LinkedIn: / databricks
Instagram: / databricksinc
Facebook: / databricksinc
วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 7

@BlackThorne 10 หลายเดือนก่อน ⁺¹⁴
00:35 🧠 Business Use Case: Training Large Language Models (LLMs)
01:45 💡 Reasons for Training LLMs: Customization, Cost Efficiency
02:40 🔀 Training Process: Data Pipelines, Model Training, Inference
05:16 📊 Data Processing: Filtering, Anonymization, Pre-processing
08:23 🔤 Tokenizer & Vocabulary Training: Custom Vocabulary, Benefits, Challenges
13:09 🎯 Model Evaluation: Human Eval Framework, Code Metrics vs. NLP Metrics
18:35 ⚙ Model Training & Specs: Model Size, Training Objective, Attention Mechanisms
20:55 📈 Model Training Challenges: Data Determinism, Loss Curve Spikes
23:41 🔄 Generation vs. Evaluation: Separating the Process
24:08 🚀 Deployment: Building Inference Stack, Managed Services
24:52 🖥 Model training involves GPU and model size considerations, pre/post-processing, and server/client-side logic.
25:49 🧠 Evaluating your model is crucial; define success criteria early to guide the training process.
26:02 🔄 Rapid iteration is valuable for testing model behavior and improving user experience.
26:29 ⏳ Ensure compatibility between training and inference stacks to avoid sub-optimal results.
26:57 🔄 Customization drives the desire to train LLMs with one's data; various approaches exist.
27:51 📚 Retrieval-based augmentation involves fetching relevant context to guide model responses.
28:08 🤖 Contextual prompting improves model's domain-specific knowledge, even if not originally trained.
28:21 💡 Embeddings and semantic similarity prioritize context selection for retrieval.
30:01 🎯 Fine-tuning methods vary in complexity; consider instruction tuning and training from scratch.
31:25 🔄 Models struggle with varying data formats, short-form content, and changing facts.
32:35 🌍 Custom domain data presents challenges; careful selection and use of embeddings is key.
33:37 🌶 Fine-tuning is complex; unsupervised fine-tuning for new domain knowledge has limitations.
35:18 🚫 Agents might become redundant as models absorb useful functionalities.
36:00 🔄 Balancing training data mix is challenging; no established formula, lots of variables.
37:11 💾 Data iteration tools are crucial as data, not GPUs, becomes the bottleneck for model advancement.
@syednaveed1391 9 หลายเดือนก่อน ⁺⁶
Super useful. I am a physician, tried fine tune using cancer documents. It didn't work. Found your video Thanks
@dilipjha08 หลายเดือนก่อน
Thanks for knowledge sharing to the technology user. It was very details about the dlt as well as streaming tables and comprison between it and demo of the topic was very perfect.
@CalebFenton 9 หลายเดือนก่อน ⁺¹
Thanks for the info and esp the hot takes.
@AnandShah-ds 7 หลายเดือนก่อน ⁺²
That was the best part. Should have been a opener.
@mohsenghafari7652 3 หลายเดือนก่อน ⁺¹
hi. please help me. how to create custom model from many pdfs in Persian language? tank you.
@goodstuff5666 หลายเดือนก่อน
Very nice tutorial! Could you guys share the slides? Thanks.

ต่อไป

เล่นอัตโนมัติ

How to Build LLMs on Your Company’s Data While on a Budget

How to Build LLMs on Your Company’s Data While on a Budget

[1hr Talk] Intro to Large Language Models

[1hr Talk] Intro to Large Language Models

But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning

But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning

BUMIL JADI ANAK SD

BUMIL JADI ANAK SD

上集 | 小朋友撞见弟弟被人贩子拐走，巧用调虎离山之计救出弟弟！#儿童安全教育 #防拐 #儿童安全#儿童自救

上集 | 小朋友撞见弟弟被人贩子拐走，巧用调虎离山之计救出弟弟！#儿童安全教育 #防拐 #儿童安全#儿童自救

พร้อมรักหรือยัง? Ready For Love? พากย์ไทย 06 ซีรีส์

พร้อมรักหรือยัง? Ready For Love? พากย์ไทย 06 ซีรีส์

😉Amazing Trick For Cleaning Your Hairbrush!💇‍♀️ #funnycat #lifehacks #catmemes #trending

😉Amazing Trick For Cleaning Your Hairbrush!💇‍♀️ #funnycat #lifehacks #catmemes #trending

BloombergGPT: How We Built a 50 Billion Parameter Financial Language Model

BloombergGPT: How We Built a 50 Billion Parameter Financial Language Model

"How to give GPT my business knowledge?" - Knowledge embedding 101

"How to give GPT my business knowledge?" - Knowledge embedding 101

A Technical Deep Dive into Unity Catalog's Practitioner Playbook

A Technical Deep Dive into Unity Catalog's Practitioner Playbook

Using ChatGPT with YOUR OWN Data. This is magical. (LangChain OpenAI API)

Using ChatGPT with YOUR OWN Data. This is magical. (LangChain OpenAI API)

Evaluating LLM-based Applications

Evaluating LLM-based Applications

Fine-tuning Large Language Models (LLMs) | w/ Example Code

Fine-tuning Large Language Models (LLMs) | w/ Example Code

Large Language Models from scratch

Large Language Models from scratch

Training Your Own AI Model Is Not As Hard As You (Probably) Think

Training Your Own AI Model Is Not As Hard As You (Probably) Think

"okay, but I want Llama 3 for my specific use case" - Here's how

"okay, but I want Llama 3 for my specific use case" - Here's how

วิธีควบคุมคอมด้วยมือถือ #remotedesktop #เล่นคอมผ่านมือถือ

วิธีควบคุมคอมด้วยมือถือ #remotedesktop #เล่นคอมผ่านมือถือ

แอปปลอมระบาด!! เตือนผู้ใช้ iPhone หลีกเลี่ยงติดตั้งแอปนอกจาก App Store

แอปปลอมระบาด!! เตือนผู้ใช้ iPhone หลีกเลี่ยงติดตั้งแอปนอกจาก App Store

What a wonderful cell phone to call and use battery...2024 June

What a wonderful cell phone to call and use battery...2024 June

ทำคลิปใส่ตัวหนังสือชื่อเมนูกาแฟเท่ๆ #iphoneiosthailand #Apple #iPhone #apple #ทริคดีๆ #รอบรู้ไอที

ทำคลิปใส่ตัวหนังสือชื่อเมนูกาแฟเท่ๆ #iphoneiosthailand #Apple #iPhone #apple #ทริคดีๆ #รอบรู้ไอที

2ใบเทา…โหลดได้ทุกแอพ!!🤩 #Redmi12C #ผ่อนซัมซุงนนทบุรี #ผ่อนซัมซุงบางใหญ่ #ผ่อนไปใช้ไป

2ใบเทา…โหลดได้ทุกแอพ!!🤩 #Redmi12C #ผ่อนซัมซุงนนทบุรี #ผ่อนซัมซุงบางใหญ่ #ผ่อนไปใช้ไป

Will the battery emit smoke if it rotates rapidly?

Will the battery emit smoke if it rotates rapidly?

Apple vs Samsung 😂

Apple vs Samsung 😂

POCO F6 PRO😈 VS IPHONE 15 PRO MAX💀PUBG TEST. #pocof6pro #pubgnewupdate #pubgmobile #bgmishorts

POCO F6 PRO😈 VS IPHONE 15 PRO MAX💀PUBG TEST. #pocof6pro #pubgnewupdate #pubgmobile #bgmishorts