Introducing Arize-Phoenix and OpenInference

From Idea to Production: AI Infra for Scaling LLM Apps

Function Calling for LLMs: RAG without a Vector Database

🔴Live สด! 𝐏𝐆𝐒 𝐀𝐏𝐀𝐂 𝐐𝐔𝐀𝐋𝐈𝐅𝐈𝐄𝐑𝐒 𝟐𝟎𝟐𝟒 𝐏𝐇𝐀𝐒𝐄 𝟐 | PLAY-IN วันที่ 2

หนุ่มเตือน รถจมน้ำมา อย่าสตาร์ทรถ ประกันขอกุญแจไป ก็ห้ามสตาร์ท

#บุ๋มปนัดดา ถูกหมอสั่งห้ามลงน้ำลึก เสี่ยงมดลูกติดเชื้อ | Shorts Clip 2024

Towards Robust GenAI: Techniques for Evaluating Enterprise LLM Applications

MLOps World: Machine Learning in Production

มุมมอง 109

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 15 พ.ค. 2024
Speaker: Dhruv Singh, Co-Founder & CTO, HoneyHive AI
As LLMs become incredibly capable, evaluating their performance and safety has gotten trickier. Traditional human evaluation is slow, expensive, and biased. This bottleneck hinders enterprise AI adoption.
This talk will outline the pitfalls of current evaluation methods. It will then introduce emerging automated evaluation solutions. The approach combines real-time ""micro evaluators"" that monitor models with strategic human feedback loops. This powerful combination provides constant insights into a model's strengths, weaknesses, and blind spots. By the end, you'll learn strategies to confidently use language models in your apps and products.

ความคิดเห็น •

ต่อไป

เล่นอัตโนมัติ

Introducing Arize-Phoenix and OpenInference

Introducing Arize-Phoenix and OpenInference

From Idea to Production: AI Infra for Scaling LLM Apps

From Idea to Production: AI Infra for Scaling LLM Apps

Function Calling for LLMs: RAG without a Vector Database

Function Calling for LLMs: RAG without a Vector Database

🔴Live สด! 𝐏𝐆𝐒 𝐀𝐏𝐀𝐂 𝐐𝐔𝐀𝐋𝐈𝐅𝐈𝐄𝐑𝐒 𝟐𝟎𝟐𝟒 𝐏𝐇𝐀𝐒𝐄 𝟐 | PLAY-IN วันที่ 2

🔴Live สด! 𝐏𝐆𝐒 𝐀𝐏𝐀𝐂 𝐐𝐔𝐀𝐋𝐈𝐅𝐈𝐄𝐑𝐒 𝟐𝟎𝟐𝟒 𝐏𝐇𝐀𝐒𝐄 𝟐 | PLAY-IN วันที่ 2

หนุ่มเตือน รถจมน้ำมา อย่าสตาร์ทรถ ประกันขอกุญแจไป ก็ห้ามสตาร์ท

หนุ่มเตือน รถจมน้ำมา อย่าสตาร์ทรถ ประกันขอกุญแจไป ก็ห้ามสตาร์ท

#บุ๋มปนัดดา ถูกหมอสั่งห้ามลงน้ำลึก เสี่ยงมดลูกติดเชื้อ | Shorts Clip 2024

#บุ๋มปนัดดา ถูกหมอสั่งห้ามลงน้ำลึก เสี่ยงมดลูกติดเชื้อ | Shorts Clip 2024

Expected Ending?

Expected Ending?

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Generative AI in a Nutshell - how to survive and thrive in the age of AI

LLM Control Theory Seminar (April 2024)

LLM Control Theory Seminar (April 2024)

UI UX Design in Startups: Navigating Problems and Challenges.

UI UX Design in Startups: Navigating Problems and Challenges.

Yuval Noah Harari: “We Are on the Verge of Destroying Ourselves” | Amanpour and Company

Yuval Noah Harari: “We Are on the Verge of Destroying Ourselves” | Amanpour and Company

Mo Gawdat on AI: The Future of AI and How It Will Shape Our World

Mo Gawdat on AI: The Future of AI and How It Will Shape Our World

Think Fast, Talk Smart: Communication Techniques

Think Fast, Talk Smart: Communication Techniques

Session 7: RAG Evaluation with RAGAS and How to Improve Retrieval

Session 7: RAG Evaluation with RAGAS and How to Improve Retrieval

Integrating LLMs into Your Product: Considerations and Best Practices

Integrating LLMs into Your Product: Considerations and Best Practices

What are AI Agents?

What are AI Agents?

Phép thuật hở | CHANG DORY | ometv

Phép thuật hở | CHANG DORY | ometv

ทีมชาติไทย VS คิวบา | HIGHLIGHT | 17 ก.ย. 67 | | ฟุตซอลชิงแชมป์โลก 2024 | T Sports 7

ทีมชาติไทย VS คิวบา | HIGHLIGHT | 17 ก.ย. 67 | | ฟุตซอลชิงแชมป์โลก 2024 | T Sports 7

Expected Ending?

Expected Ending?

HIGHLIGHTS : Bangkok United (THA) 4-2 Tampines Rovers (SGP) | AFC Champions League TWO | 18.09.24

HIGHLIGHTS : Bangkok United (THA) 4-2 Tampines Rovers (SGP) | AFC Champions League TWO | 18.09.24

ดรามา ! พา “ครูเบญ” ตรวจข้อสอบตัวเอง ไม่ให้ใช้มือถือ-ถ่ายคลิป | ข่าวเย็นประเด็นร้อน

ดรามา ! พา “ครูเบญ” ตรวจข้อสอบตัวเอง ไม่ให้ใช้มือถือ-ถ่ายคลิป | ข่าวเย็นประเด็นร้อน

Epic Ghost Camp EP.39 พิสูจน์ผี!! ที่บ้านสวน!! เจอวิญญาณสีแดงเต็มๆตา

Epic Ghost Camp EP.39 พิสูจน์ผี!! ที่บ้านสวน!! เจอวิญญาณสีแดงเต็มๆตา

Worst flight ever

Worst flight ever

VLOGWEEK #20 สิงหาทำไมมันคมจังว่ะ ! แปปๆจะสิ้นปีอีกแล้ว…. สุดท้ายทุกคนก็มีชีวิตเป็นของตัวเอง

VLOGWEEK #20 สิงหาทำไมมันคมจังว่ะ ! แปปๆจะสิ้นปีอีกแล้ว…. สุดท้ายทุกคนก็มีชีวิตเป็นของตัวเอง