How to set up RAG - Retrieval Augmented Generation (demo)
ฝัง
- เผยแพร่เมื่อ 17 พ.ค. 2024
- When I posted my video on Retrieval Augmented Generation (RAG), I got a lot of requests to show exactly how it worked at a technical level.
RAG helps you create a ChatGPT-like experience that uses your own internal content. Instead of only accessing information from the internet, an LLM can access your PDFs or databases or any other internal content by using a RAG architecture.
I know this is a little more technical than my previous videos, so I’d love to hear from you if you thought this was interesting and if I should do more! Drop a comment so I know.
#RAG #AI #VectorDB #LLM #codetocare #artificialintelligence #machinelearning #ml #chatgpt #genai
Also, check out our VectorDB here: www.intersystems.com/news/iri...
Check out my LinkedIn: / donwoodlock
Timecodes:
00:00 - Intro
00:35 - Demo Layout in Stages
00:59 - Stage 1: Using a Simple Pass Through Prompt
01:52 - Stage 1: Desktop Demo
04:15 - Stage 2: Adding Instructions to your prompt
05:11 - Stage 2: Desktop Demo
07:18 - Stage 3: Setting up Retrieval Augmented Generation
09:05 - Stage 3: Desktop Demo
19:10 - Wrap up
---
ABOUT INTERSYSTEMS
Established in 1978, InterSystems Corporation is the leading provider of data technology for extremely critical data in healthcare, finance, and logistics. It’s cloud-first data platforms solve interoperability, speed, and scalability problems for large organizations around the globe.
InterSystems Corporation is ranked by Gartner, KLAS, Forrester and other industry analysts as the global leader in Data Access and Interoperability. InterSystems is the global market leader in Healthcare and Financial Services.
Website: www.intersystems.com/
TH-cam: / @intersystemscorp
LinkedIn: / intersystems
Twitter: / intersystems - วิทยาศาสตร์และเทคโนโลยี
Amazing video, save me ton of days to study. Thank you very much Mr. Don
Excellent, made so simple.😃
helpful! thanks.
Awesome Don, please do more of these videos explaining more about AI/ML
next topic: one on Lang chain or llama model 3 which was released recently.
Yes - I'll take a look.
@@dwoodlock Thank you dan you are amazing
Great video! Could you please do one video on RAG with knowledge graph? Thanks!
You really are a teacher, made it consumable to RAG newbies, this is how videos should be lol. Thank you very much
It's so good to see a good demo, please put more videos, I would specifically looking for RDMS database to vector database to store embeddings so my user can use AI
Best explanation I have seen to date of how RAG works.
The way you conveyed that information was so structured and informative.
Your explanation made me grasp the concept quickly.
Thank You!!!
I loved this explanation of RAG. I’ll share this with my team.
Thanks, after watching your video I got clear understanding how we can implement RAG.
Appreciate you taking the time to get into the weeds
This was fantastic! Thank you!
Greate Explanation
I really fall in love with RAG, I am a Master student and I want to do some research on RAG for my upcoming Thesis Project. Can I have some suggestion from you ? articles or publications ?
Thanks for this video
Great video! where can I get the code used in the video?
Simple and clear! Thanks!
Great explanation! Is there a way i can get the code you show in the jupyter notebook please? Would like to play around with it using my dataset. That would be very helpful!
Great video! How did you get the content of the website? I am working on a similar project, and most of my effort seems to be scraping pages.
love it, thank you! I appreciate these videos.
Very thankful Don. It would be awesome to get some timestamps throughout the video, but this is incredible as is.
Ah. Good idea.
Done!
@@dwoodlock Awesome
Hi Don, thank you for the great video. It would be great if you could share the notebook codes 😊
Always wanted to understand how everyone all of a sudden made their own bots. Now I understand how simple it is.
How did you turn the website into a csv file? web-scraper? if so, which one?
Awesome Sir !!
Many thanks, Don. Your explanation was simple, clear and very helpful.
To ensure that the top 5 answers are more relevant to the prompt questions, how can we verify and validate the model's output ? How can we test the bias in the data being prepared using the CSV, which is necessary to obtain the top answers?
My other question is, how can we integrate these models in real-time? i.e. how to integrate these models into different HIS and CIS touchpoints (or any healthcare system, really), how to run and train these models, what additional computation (hardware requirements and increase in total cost of ownership) might be needed, and how to confirm User Acceptance Tests and seek customer sign-off before moving to production.
In terms of validation, you will need to do some of this by hand - meaning a human reading the answers and verifying that they are accurate, that they represent the right tone, and that your model isn't responding to things that are off topic and/or are dangerous. These 3 could be separate validation threads.
@@dwoodlock Thanks Don and very helpful.
So what is df?
Distance Formula I think
What is concatenate?