Beyond the Hype: A Realistic Look at Large Language Models • Jodie Burchell • GOTO 2024

[Webinar] LLMs for Evaluating LLMs

How to Limit LLM Hallucinations

World’s smallest 4K headset 😎 #tech #vr #technology #virtualreality #insideout2

ช่วยผมปีนเขา ใน มายคราฟ! @jimsocoolsohot

[ไฮไลต์] ฟุตบอลชาย ญี่ปุ่น vs ปารากวัย รอบแบ่งกลุ่ม | โอลิมปิก 2024

Mitigating LLM Hallucinations with a Metrics-First Evaluation Framework

DeepLearningAI

มุมมอง 21 801

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 24 ก.ค. 2024
Join in on this workshop where we will showcase some powerful metrics to evaluate the quality of the inputs and outputs with a focus on both RAG and fine-tuning use cases. In the context of LLMs, “hallucination” refers to a phenomenon where the model generates text that is incorrect, nonsensical, or not real. Since LLMs are not databases or search engines, they would not cite where their response is based on. These models generate text as an extrapolation from the prompt you provided.
What attendees can expect to takeaway from the workshop:
-Deep dive into research-backed metrics to evaluate the quality of the inputs (data quality, RAG context quality, etc) and outputs (hallucinations) while building LLM powered applications.
-Evaluation and experimentation framework while prompt engineering with RAG, as well as while fine-tuning with your own data
-Demo led practical guide to building guardrails and mitigating hallucinations while building LLM powered applications
To access the slides, please click here:
docs.google.com/presentation/...
To read the academic paper, please click here:
www.rungalileo.io/blog/chainpoll
To see these concepts in action, take a look at the Hallucination Index here: www.rungalileo.io/hallucinati...
This event is inspired by DeepLearning.AI’s GenAI short courses, created in collaboration with AI companies across the globe. Our courses help you learn new skills, tools, and concepts efficiently within 1 hour.
www.deeplearning.ai/short-cou...
About Galileo
At Galileo we are building the first algorithm-powered LLMOps Platform for the enterprise. Galileo provides ML teams with an intelligent ML data bench to collaboratively improve data quality across their model workflows - from pre-training, to post-production. Galileo is currently powering ML teams across the Fortune 500 as well as startups across multiple industries.
Speakers:
Vikram Chatterji, Co-founder and CEO at Galileo
/ vikram-chatterji
Atindriyo Sanyal, Co-founder and CTO at Galileo
/ atinsanyal
บันเทิง

ความคิดเห็น • 20

@HonestGraduate 9 หลายเดือนก่อน ⁺¹
Thank you for the presentation and demo!
@ajeethkumar6296 หลายเดือนก่อน
Thanks for the clear cut explanation
@user-wz5rd6vg2r 9 หลายเดือนก่อน ⁺¹⁵
The real contribution seems to be the prompt they used to generate the CoT and the metric value... Could you share the code used for the metric and the prompt for ChatPGT?
@KokkeOP 9 หลายเดือนก่อน ⁺²
The paper and the Slides are both in the description, guys. :) read.
@purvislewies3118 9 หลายเดือนก่อน ⁺¹
Blessed love...givethanks...Cape Town
@user-wz5rd6vg2r 9 หลายเดือนก่อน ⁺⁴
Nice talk! Could you please share the notebook?
@danteblink 9 หลายเดือนก่อน ⁺¹
Do you think human intervention in the evaluation process is going to last? It seems its a process that LLMs could achieve by themselves in the near future.
@JuliusOpusprofundum 9 หลายเดือนก่อน ⁺¹
❤
@senderlapin 9 หลายเดือนก่อน ⁺²
Я из России. Спасибо за вебинар.
@JuliusOpusprofundum 9 หลายเดือนก่อน
F u. I AM FROM UKRAINE.
@zaursamedov8906 9 หลายเดือนก่อน ⁺³
Guys would u be able to drop the notebook please?
@hcrespo3 9 หลายเดือนก่อน ⁺⁴
I'm also interested, thanks
@komalmistry7284 9 หลายเดือนก่อน
Could someone share the link to the paper that was mentioned here "ChainPoll" , I believe.
@Deeplearningai 9 หลายเดือนก่อน
It is in the video description!
@davidvilla2402 9 หลายเดือนก่อน ⁺¹
I don't know how bt I searched the n word and it came up

ต่อไป

เล่นอัตโนมัติ

Beyond the Hype: A Realistic Look at Large Language Models • Jodie Burchell • GOTO 2024

Beyond the Hype: A Realistic Look at Large Language Models • Jodie Burchell • GOTO 2024

[Webinar] LLMs for Evaluating LLMs

[Webinar] LLMs for Evaluating LLMs

How to Limit LLM Hallucinations

How to Limit LLM Hallucinations

World’s smallest 4K headset 😎 #tech #vr #technology #virtualreality #insideout2

World’s smallest 4K headset 😎 #tech #vr #technology #virtualreality #insideout2

ช่วยผมปีนเขา ใน มายคราฟ! @jimsocoolsohot

ช่วยผมปีนเขา ใน มายคราฟ! @jimsocoolsohot

[ไฮไลต์] ฟุตบอลชาย ญี่ปุ่น vs ปารากวัย รอบแบ่งกลุ่ม | โอลิมปิก 2024

[ไฮไลต์] ฟุตบอลชาย ญี่ปุ่น vs ปารากวัย รอบแบ่งกลุ่ม | โอลิมปิก 2024

MISS CIRCLE GOT CAUGHT!!

MISS CIRCLE GOT CAUGHT!!

How to evaluate an LLM-powered RAG application automatically.

How to evaluate an LLM-powered RAG application automatically.

AI Explained: Metrics to Detect Hallucinations

AI Explained: Metrics to Detect Hallucinations

How to Build, Evaluate, and Iterate on LLM Agents

How to Build, Evaluate, and Iterate on LLM Agents

Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use

Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use

Session 7: RAG Evaluation with RAGAS and How to Improve Retrieval

Session 7: RAG Evaluation with RAGAS and How to Improve Retrieval

How to Make the Best First Impressions

How to Make the Best First Impressions

Evaluating LLM-based Applications

Evaluating LLM-based Applications

MLOps Speclization Course July 2024

MLOps Speclization Course July 2024

Deep Dive into LLM Evaluation with Weights & Biases

Deep Dive into LLM Evaluation with Weights & Biases

5 คำภาษาไทยที่มีความหมายในภาษาญี่ปุ่นแบบคนละเรื่องเลย !? #ญี่ปุ่น

5 คำภาษาไทยที่มีความหมายในภาษาญี่ปุ่นแบบคนละเรื่องเลย !? #ญี่ปุ่น

BTS of this trend with my scarry friends 😳| Andra Gogan

BTS of this trend with my scarry friends 😳| Andra Gogan

When an RV meets a zombie outside #rv

When an RV meets a zombie outside #rv

Tòm tem giữa ban ngày 🤣 | Asian women #tiktok #trending #funny #beautiful

Tòm tem giữa ban ngày 🤣 | Asian women #tiktok #trending #funny #beautiful

อุทาหรณ์ เล่นชกท้อง เกือบเอาชีวิตไม่รอด | อีจัน EJAN

อุทาหรณ์ เล่นชกท้อง เกือบเอาชีวิตไม่รอด | อีจัน EJAN

#จ๊ะนงผณี เปิดค่าตัวขึ้นเวทีคู่ #เอมวิทวัส หลักล้าน | Shorts Clip 2024

#จ๊ะนงผณี เปิดค่าตัวขึ้นเวทีคู่ #เอมวิทวัส หลักล้าน | Shorts Clip 2024

นนท์เดอะซีรีส์ EP.20 ตอน ล่าเหล็กไหล ภูเขาควาย | หลอนไดอารี่

นนท์เดอะซีรีส์ EP.20 ตอน ล่าเหล็กไหล ภูเขาควาย | หลอนไดอารี่

แกล้งวิ่งไล่จับลิงลพบุรี หนีกระเจิง!! 🐒 #ลิง #ลิงลพบุรี #สัตว์โลกน่ารัก

แกล้งวิ่งไล่จับลิงลพบุรี หนีกระเจิง!! 🐒 #ลิง #ลิงลพบุรี #สัตว์โลกน่ารัก