False Starts and Dead Ends: Building a Retrieval Augmented Generation System // Wes Ladd // LLMs III

Launch an LLM App in One Hour (LLM Bootcamp)

vLLM Office Hours - FP8 Quantization Deep Dive - July 9, 2024

Cute Fish Crying 😭❤️|

Now You C-Amy EP.204 I ตะลุยงานองค์พระปฐมเจดีย์ ตามเก็บ ร้านเด็ด ร้านดัง!!

ร้อยพ่อพันแม่ - WanMai [Official MV]

Building RAG-based LLM Applications for Production // Philipp Moritz & Yifei Feng // LLMs III Talk

MLOps.community

มุมมอง 2 717

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 15 พ.ย. 2024
// Abstract
In this talk, we will cover how to develop and deploy RAG-based LLM applications for production. We will cover how the major workloads (data loading and preprocessing, embedding, serving) can be scaled on a cluster, how different configurations can be evaluated and how the application can be deployed. We will also give an introduction to Anyscale Endpoints which offers a cost-effective solution for serving popular open-source models.
// Bio
Philipp Moritz
Philipp Moritz is one of the creators of Ray, an open-source system for scaling AI. He is also co-founder and CTO of ‪@anyscale‬, the company behind Ray. He is passionate about machine learning, artificial intelligence, and computing in general and strives to create the best open-source tools for developers to build and scale their AI applications.
Yifei Feng
Yifei leads the Infrastructure and SRE teams at ‪@anyscale‬. Her teams focus on building a seamless, cost-effective, and scalable infrastructure for large-scale machine learning workloads. Before Anyscale, she spent a few years at Google working on the open-source machine learning library TensorFlow.
// Sign up for our Newsletter to never miss an event:
mlops.communit...
// Watch all the conference videos here:
home.mlops.com...
// Check out the MLOps Community podcast: open.spotify.c...
// Read our blog:
mlops.community/blog
// Join an in-person local meetup near you:
mlops.communit...
// MLOps Swag/Merch:
mlops-communit...
// Follow us on Twitter:
/ mlopscommunity
//Follow us on Linkedin:
/ mlopscommunity

ความคิดเห็น • 1

@Insipidityy ปีที่แล้ว ⁺¹
If the LLM evaluator is not trained on ray documentation, how can it be used to evaluate if the responses to ray-related questions are correct?
One more assumption is you're using GPT4 as the LLM evaluator. If GPT4 is asked to evaluate GPT4, isn't there an inherent bias at play?

ต่อไป

เล่นอัตโนมัติ

False Starts and Dead Ends: Building a Retrieval Augmented Generation System // Wes Ladd // LLMs III

False Starts and Dead Ends: Building a Retrieval Augmented Generation System // Wes Ladd // LLMs III

Launch an LLM App in One Hour (LLM Bootcamp)

Launch an LLM App in One Hour (LLM Bootcamp)

vLLM Office Hours - FP8 Quantization Deep Dive - July 9, 2024

vLLM Office Hours - FP8 Quantization Deep Dive - July 9, 2024

Cute Fish Crying 😭❤️|

Cute Fish Crying 😭❤️|

Now You C-Amy EP.204 I ตะลุยงานองค์พระปฐมเจดีย์ ตามเก็บ ร้านเด็ด ร้านดัง!!

Now You C-Amy EP.204 I ตะลุยงานองค์พระปฐมเจดีย์ ตามเก็บ ร้านเด็ด ร้านดัง!!

ร้อยพ่อพันแม่ - WanMai [Official MV]

ร้อยพ่อพันแม่ - WanMai [Official MV]

ถ่ายทอดสด พร้อมบทวิเคราะห์ l International Exhibition Matches l ทีมชาติไทย พบ ทีมชาติเลบานอน

ถ่ายทอดสด พร้อมบทวิเคราะห์ l International Exhibition Matches l ทีมชาติไทย พบ ทีมชาติเลบานอน

A Survey of Production RAG Pain Points and Solutions // Jerry Liu // AI in Production Conference

A Survey of Production RAG Pain Points and Solutions // Jerry Liu // AI in Production Conference

Efficiently Scaling and Deploying LLMs // Hanlin Tang // LLM's in Production Conference

Efficiently Scaling and Deploying LLMs // Hanlin Tang // LLM's in Production Conference

Fast LLM Serving with vLLM and PagedAttention

Fast LLM Serving with vLLM and PagedAttention

Lessons Learned on LLM RAG Solutions

Lessons Learned on LLM RAG Solutions

vLLM Office Hours - Deep Dive into Mistral on vLLM - October 17, 2024

vLLM Office Hours - Deep Dive into Mistral on vLLM - October 17, 2024

Early days of RAG and LlamaIndex - Jerry Liu

Early days of RAG and LlamaIndex - Jerry Liu

Ray Train: A Production-Ready Library for Distributed Deep Learning

Ray Train: A Production-Ready Library for Distributed Deep Learning

Developing and Serving RAG-Based LLM Applications in Production

Developing and Serving RAG-Based LLM Applications in Production

I Analyzed My Finance With Local LLMs

I Analyzed My Finance With Local LLMs

"เต๋อ-เสือ" ทำกับแกล้มรสเด็ด แล้วเช็ดน้ำตาเซียน | เฮ็ดอย่างเซียนหรั่ง FULL EP.18 | One Playground

"เต๋อ-เสือ" ทำกับแกล้มรสเด็ด แล้วเช็ดน้ำตาเซียน | เฮ็ดอย่างเซียนหรั่ง FULL EP.18 | One Playground

奶奶的衣柜坏了，给她翻新了一下。My grandma’s wardrobe was broken, so I gave it a makeover.丨Liziqi Channel

奶奶的衣柜坏了，给她翻新了一下。My grandma’s wardrobe was broken, so I gave it a makeover.丨Liziqi Channel

Does the rabbit have so many children now?#Short #Officer Rabbit #angel

Does the rabbit have so many children now?#Short #Officer Rabbit #angel

Greece vs. England UEFA Nations League Highlights | FOX Soccer

Greece vs. England UEFA Nations League Highlights | FOX Soccer

ไม่น่าเชื่อว่าเธอจะฉลาดขนาดนี้ #negi #challenges

ไม่น่าเชื่อว่าเธอจะฉลาดขนาดนี้ #negi #challenges

"อาคม" ทนายตัวจริงของเมียตั้ม: NewsHour 15-11-67

"อาคม" ทนายตัวจริงของเมียตั้ม: NewsHour 15-11-67

Two Faced (Official Music Video) - Linkin Park

Two Faced (Official Music Video) - Linkin Park

"หนุ่ม กรรชัย"ซัดเจ็บ ปะทะ"กฤษอนงค์"ประโยคนี้จี๊ดมาก | SCLbb112 : คมชัดลึกออนไลน์

"หนุ่ม กรรชัย"ซัดเจ็บ ปะทะ"กฤษอนงค์"ประโยคนี้จี๊ดมาก | SCLbb112 : คมชัดลึกออนไลน์