How to set up RAG - Retrieval Augmented Generation (demo)

แชร์
ฝัง
  • เผยแพร่เมื่อ 21 พ.ย. 2024

ความคิดเห็น • 108

  • @davidgarces3584
    @davidgarces3584 2 หลายเดือนก่อน +13

    I have been doing dozens of trainings and courses in the past months and I have to say this was the absolute best deep dive into RAG!!! I loved how u showed what this looks like in code. Most explanations are completely theoretical.
    This was amazing! please continue to post videos on (gen)AI!!!!
    I would personally would love to understand how a system (CRM, etc.) is integrated in this architecture. Anybody else interested in this? Please like!

  • @belugaaa6649
    @belugaaa6649 7 หลายเดือนก่อน +26

    Awesome Don, please do more of these videos explaining more about AI/ML
    next topic: one on Lang chain or llama model 3 which was released recently.

    • @dwoodlock
      @dwoodlock  6 หลายเดือนก่อน +9

      Yes - I'll take a look.

    • @belugaaa6649
      @belugaaa6649 6 หลายเดือนก่อน +2

      @@dwoodlock Thank you dan you are amazing

  • @funmitoadeyemi4647
    @funmitoadeyemi4647 7 หลายเดือนก่อน +9

    You really are a teacher, made it consumable to RAG newbies, this is how videos should be lol. Thank you very much

  • @frack4oil16
    @frack4oil16 5 หลายเดือนก่อน +6

    I’m a cybersecurity professional but I don’t code. More like an infrastructure guy. But now I’ve become very interested in AI security. Coding was always daunting for me. This guy made it look easy as pie. Thank you!

  • @odysseuspolynoos9335
    @odysseuspolynoos9335 หลายเดือนก่อน +1

    You are just great, as you know your subject and you have the will to elucidate us. Many thanks and all the best!

  • @HarmonyWang-u9j
    @HarmonyWang-u9j หลายเดือนก่อน +1

    This is so clear! I am a beginner who was able to learn the whole process with your video, thank you so much!

  • @arvindms9
    @arvindms9 2 หลายเดือนก่อน +1

    This was one of the best videos on RAG. Very nicely done!

  • @govindarajram8553
    @govindarajram8553 6 หลายเดือนก่อน +6

    It's so good to see a good demo, please put more videos, I would specifically looking for RDMS database to vector database to store embeddings so my user can use AI

    • @dwoodlock
      @dwoodlock  5 หลายเดือนก่อน

      Try InterSystems IRIS.

  • @nadellaella6416
    @nadellaella6416 7 หลายเดือนก่อน +2

    The way you conveyed that information was so structured and informative.
    Your explanation made me grasp the concept quickly.
    Thank You!!!

  • @nayanchoudhary4353
    @nayanchoudhary4353 4 หลายเดือนก่อน

    I had been looking for the full set steps for a year now. This is easiest and very good video content! Thank you for teaching such important use of LLMs!

  • @akashacharya2813
    @akashacharya2813 5 หลายเดือนก่อน

    I am full stack developer trying to get into machine learning /ai industry workspace. Your videos are motivation for people like us.

  • @OstaNazir
    @OstaNazir 7 หลายเดือนก่อน +3

    I loved this explanation of RAG. I’ll share this with my team.

  • @starter_culture
    @starter_culture 6 หลายเดือนก่อน +2

    Best explanation I have seen to date of how RAG works.

  • @swethaanil5472
    @swethaanil5472 4 หลายเดือนก่อน

    I tried the approach with a product catalog dataset and was able to get similar results. You are a wonderful teacher . Thank you so much for this video and for the education. Looking forward to the next demo.

    • @sandyna1k
      @sandyna1k 3 หลายเดือนก่อน

      How did you set up. I tried to do what he showed in the video but cannot get past initial errors. For example- OpenAI not found

    • @aditiajaykumar6381
      @aditiajaykumar6381 2 หลายเดือนก่อน

      what website you used and how did you download the website? any suggestions?

  • @dbose_ai
    @dbose_ai 6 หลายเดือนก่อน +1

    Unbelievable content. Couldn't have been any simpler than this. Thank you so much.

  • @fitnessismypassion
    @fitnessismypassion 4 หลายเดือนก่อน

    Amazing job Don! Subscribed as soon as I have seen your first video about RAG and this one is even greater. Thank you very much, please keep making videos. You have a talent for teaching to lay audience.

  • @nomorecramps
    @nomorecramps 5 หลายเดือนก่อน +1

    This is awesome! Loved the explanation and walkthrough. Definitely sharing this with all folks i come across who'd need help with this!

  • @sjmediaonline
    @sjmediaonline 6 หลายเดือนก่อน +1

    Great video! Could you please do one video on RAG with knowledge graph? Thanks!

  • @MohitGupta-cz9ok
    @MohitGupta-cz9ok 6 หลายเดือนก่อน

    Thanks, after watching your video I got clear understanding how we can implement RAG.

  • @jacksonhanson154
    @jacksonhanson154 7 หลายเดือนก่อน +1

    Very thankful Don. It would be awesome to get some timestamps throughout the video, but this is incredible as is.

    • @dwoodlock
      @dwoodlock  6 หลายเดือนก่อน

      Ah. Good idea.

    • @dwoodlock
      @dwoodlock  6 หลายเดือนก่อน +2

      Done!

    • @jacksonhanson154
      @jacksonhanson154 6 หลายเดือนก่อน

      @@dwoodlock Awesome

  • @MWileY-nj1yb
    @MWileY-nj1yb 5 หลายเดือนก่อน

    I appreciate you immensely! Awesome concise clear information. I really hope you continue to make available content like this.

  • @SantK1208
    @SantK1208 5 หลายเดือนก่อน

    You made my demo remarkable, I explained my audience easily ❤❤❤❤❤❤

  • @parzivalplayer9636
    @parzivalplayer9636 4 หลายเดือนก่อน

    Fantastic stuff here.I hope you continue to explain the complex topics.

  • @aanyaaryancoolgamers2013
    @aanyaaryancoolgamers2013 6 หลายเดือนก่อน +1

    Great video! where can I get the code used in the video?

  • @yogeshsuryamurthy5939
    @yogeshsuryamurthy5939 7 หลายเดือนก่อน

    Many thanks, Don. Your explanation was simple, clear and very helpful.
    To ensure that the top 5 answers are more relevant to the prompt questions, how can we verify and validate the model's output ? How can we test the bias in the data being prepared using the CSV, which is necessary to obtain the top answers?
    My other question is, how can we integrate these models in real-time? i.e. how to integrate these models into different HIS and CIS touchpoints (or any healthcare system, really), how to run and train these models, what additional computation (hardware requirements and increase in total cost of ownership) might be needed, and how to confirm User Acceptance Tests and seek customer sign-off before moving to production.

    • @dwoodlock
      @dwoodlock  6 หลายเดือนก่อน

      In terms of validation, you will need to do some of this by hand - meaning a human reading the answers and verifying that they are accurate, that they represent the right tone, and that your model isn't responding to things that are off topic and/or are dangerous. These 3 could be separate validation threads.

    • @yogeshsuryamurthy5939
      @yogeshsuryamurthy5939 6 หลายเดือนก่อน

      @@dwoodlock Thanks Don and very helpful.

  • @fazilkagdi3290
    @fazilkagdi3290 7 หลายเดือนก่อน +3

    Always wanted to understand how everyone all of a sudden made their own bots. Now I understand how simple it is.

  • @arivolis
    @arivolis 5 หลายเดือนก่อน

    Great demo and explanation . Thank you for sharing the knowledge !

  • @damnguyenduyan
    @damnguyenduyan 6 หลายเดือนก่อน

    Amazing video, save me ton of days to study. Thank you very much Mr. Don

  • @boseashish
    @boseashish 2 หลายเดือนก่อน +1

    beautiful detaining. thankyou for a kind video

  • @bhaskarmazumdar9478
    @bhaskarmazumdar9478 7 หลายเดือนก่อน +1

    Appreciate you taking the time to get into the weeds

  • @JI9131
    @JI9131 10 วันที่ผ่านมา

    Thanks for the video! It’s great to see how the RAG really works. I understand that RAG is quite good at organizing answers with text. How about replacing the CSV files with large Excel sheets and needing answers from different columns in the same row? Will the RAG perform well in that case? I find that while the response is often correct, sometimes it jumps to different rows and makes up answers. What is your take on using Excel sheets as documents for RAG?

  • @hemantgaikwad3352
    @hemantgaikwad3352 4 หลายเดือนก่อน

    Good content. Thank you. Looking forward to next video.

  • @BKNaveen
    @BKNaveen 6 หลายเดือนก่อน +2

    Great explanation! Is there a way i can get the code you show in the jupyter notebook please? Would like to play around with it using my dataset. That would be very helpful!

    • @porter.anderson11
      @porter.anderson11 5 หลายเดือนก่อน

      Were you able to find the code shown in Jupyter or something similar?

  • @GiantNeuron
    @GiantNeuron 3 หลายเดือนก่อน +1

    Very useful! Amazing explanation!

  • @ajaykumarsugumaran3740
    @ajaykumarsugumaran3740 3 หลายเดือนก่อน

    Excellent Don, the video explained RAG as neatly as possible!
    1. Is it possible to share the code and the web source and web data csv file so we can follow along for practice?
    2. Then openAI model used here for embedding, did you have to pay for using that?
    3. Could you cover a separate video on current AI challenges related to AI ethics and ways to tackle them? Eg: AI used to create spam/ fake profiles, generate non consensual adult content images etc.

    • @sandyna1k
      @sandyna1k 3 หลายเดือนก่อน

      I have the same questions. Did you find resolution

    • @ajaykumarsugumaran3740
      @ajaykumarsugumaran3740 3 หลายเดือนก่อน

      @@sandyna1k 1. You can try web scraping or sites like httracker.
      2. Looks like it's a paid version of LLM.
      3. Haven't found an answer yet.

  • @HB-dj6eq
    @HB-dj6eq 4 หลายเดือนก่อน

    Excellent presentation, well done. The code really helped.

  • @Themojii
    @Themojii 6 หลายเดือนก่อน +1

    Hi Don, thank you for the great video. It would be great if you could share the notebook codes 😊

  • @prakashshekade
    @prakashshekade 5 หลายเดือนก่อน

    Amazing Don. I wonder why is there a need to use Langchain. Can you please throw some light?

  • @secondsleep
    @secondsleep 5 หลายเดือนก่อน

    Great video, easy to follow, highly informative. Thanks!

  • @penpen9379
    @penpen9379 2 หลายเดือนก่อน +1

    Amazing video! Thanks Don.

  • @ajaykumarsugumaran3740
    @ajaykumarsugumaran3740 3 หลายเดือนก่อน

    Great video. Could you explain when would we use RAGs? Is it supposed to be used almost always with an LLM? Or RAGs have special use cases?

  • @CalvHobbes
    @CalvHobbes 5 หลายเดือนก่อน

    Thank you, this was one of the most useful hands on tutorials and really cleared the confusion. I wonder if you plan to share the Jupyter notebook you used and if there is an upcoming (or existing) video that address the step you took to gather the input data (i.e. scraping the website to create the csv). It would be so useful to know about techniques for data retrieval and storage from bigger sites/ other data sources, and what would be different if you didn't have the liberty to only pick the top 4 closest embeddings. E.g. what if it were a retail web site and the user asked "what dresses do you have in blue?". Thank you once again.

    • @dwoodlock
      @dwoodlock  5 หลายเดือนก่อน

      I've had a few questions on the 'scraping the website' step. I just basically wrote a python script, using a few libraries, that navigated through the website from the top, and stored the documents.

    • @CalvHobbes
      @CalvHobbes 5 หลายเดือนก่อน

      Thank you

  • @souravmazumder58
    @souravmazumder58 4 หลายเดือนก่อน

    Thanks for the video. Is there a way I can understand how the costing works when Im using RAG?

  • @sanjaysitaramsoni
    @sanjaysitaramsoni 6 หลายเดือนก่อน

    I really fall in love with RAG, I am a Master student and I want to do some research on RAG for my upcoming Thesis Project. Can I have some suggestion from you ? articles or publications ?
    Thanks for this video

  • @divyabhasin1379
    @divyabhasin1379 6 หลายเดือนก่อน

    Great video very well explained. Do you also have a video on how to download a website into a CSV?

    • @dwoodlock
      @dwoodlock  5 หลายเดือนก่อน

      Not yet. I may do one - lots of you have asked.

  • @sdaiwepm
    @sdaiwepm หลายเดือนก่อน

    What a wonderful example! Before I run through this from my own Jupyter Notebook - do you recall roughly how much it cost in OpenAI credits?

  • @LGEightZero
    @LGEightZero 3 หลายเดือนก่อน

    Thanks for this. How would this differ if it were not web based public data, but actually patient data for example? How is access to permissions based sensitive data handled in this case?

  • @The_Reductionist
    @The_Reductionist 6 หลายเดือนก่อน +4

    How did you turn the website into a csv file? web-scraper? if so, which one?

    • @dwoodlock
      @dwoodlock  5 หลายเดือนก่อน

      I wrote it myself using the beautiful soup library in python. It wasn't too complicated.

  • @jdcage
    @jdcage 6 หลายเดือนก่อน

    Great video! How did you get the content of the website? I am working on a similar project, and most of my effort seems to be scraping pages.

  • @vladd.9264
    @vladd.9264 4 หลายเดือนก่อน

    Hi Don,
    I am learning that now and also doing something with assistant based on given material, like healthcare website in your case. So watched this RAG video to learn the concept.
    The question is that really needed to do the embedding and calculate the vector distances as opposed to just uploading all content (website text) to the storage and let the LLM to use it through a prompt. Means in this case the engine should basically do the same distance search, maybe cutting the material to smaller pieces than uploaded (?) and finally stick to the least distant for the answer. Do you think your more manual approach can generate better answers?

  • @juanluisesteban7394
    @juanluisesteban7394 2 หลายเดือนก่อน +1

    Really good. Thanks!

  • @raghureddy1746
    @raghureddy1746 2 หลายเดือนก่อน +1

    Thank you Thank you!!!!

  • @chetanpdeshpande3439
    @chetanpdeshpande3439 7 หลายเดือนก่อน +2

    This was fantastic! Thank you!

  • @nabinbhusalofficial
    @nabinbhusalofficial 5 หลายเดือนก่อน

    Very Helpful, Can you Please make a video on RAG for low resource language like Nepali?

  • @bullsbearsfight
    @bullsbearsfight 5 หลายเดือนก่อน

    awesome explanation

  • @mohammadnour8114
    @mohammadnour8114 5 หลายเดือนก่อน +1

    can you share the notebook and the dataset?

  • @peterezkka
    @peterezkka 7 หลายเดือนก่อน

    love it, thank you! I appreciate these videos.

  • @baladin06
    @baladin06 หลายเดือนก่อน

    Thanks!

  • @appa9450
    @appa9450 5 หลายเดือนก่อน

    My question is, say you want to make a chatbot that gives responses to questions about workflow and how to do certain things within a company. Would it be possible to connect it to a database that holds lots of documentation about a company?

    • @dwoodlock
      @dwoodlock  5 หลายเดือนก่อน

      Yes certainly. With the approach I outlined, you can connect to documents within a database. You would just need to store the embeddings into the database itself, or a separate database that supports that.

  • @sibims653
    @sibims653 7 หลายเดือนก่อน +1

    Greate Explanation

  • @asheeshmathur
    @asheeshmathur 6 หลายเดือนก่อน

    Excellent, made so simple.😃

  • @ionuta.5359
    @ionuta.5359 6 หลายเดือนก่อน

    Simple and clear! Thanks!

  • @maa1dz1333q2eqER
    @maa1dz1333q2eqER 4 หลายเดือนก่อน

    Great Job, Thanks!!!!

  • @mzimmerman1988
    @mzimmerman1988 6 หลายเดือนก่อน +1

    helpful! thanks.

  • @fernandes1431
    @fernandes1431 5 หลายเดือนก่อน +1

    how do you get the webpage dataset?

    • @dwoodlock
      @dwoodlock  5 หลายเดือนก่อน +1

      I wrote a little python script that scraped the website by navigating it from the top like a tree-based search and stored the text of each page.

  • @gHosT_scovoxwZ
    @gHosT_scovoxwZ 3 หลายเดือนก่อน

    can you please share link to get the csv file, so that i can practice setting up this.

  • @gridplan
    @gridplan 5 หลายเดือนก่อน

    Suppose I wanted the answer to cite its source(s) -- in your example, the web page(s) that went into the final answer. How would that change this approach? Presumably, I wouldn't want to encode the web page's URL in the same vector as the text as that would skew the similarity calculation (dot product).

    • @dwoodlock
      @dwoodlock  5 หลายเดือนก่อน +1

      Yes - you could put a reference in each of the documents that you choose and ask the LLM for the references back for the sections that it used. Given their are cryptic (like you could replace the URLs with unique IDs), they wouldn't really skew the embeddings.

    • @gridplan
      @gridplan 5 หลายเดือนก่อน

      @@dwoodlock That makes sense, especially if I use IDs rather than URLs like you're suggesting. Thank you!

  • @cj5214
    @cj5214 2 หลายเดือนก่อน

    this is my first Vedio on AI, just trying this sample code, it keep asking API key ? how can i manage it?

  • @Tessi42
    @Tessi42 6 หลายเดือนก่อน

    Wow, Thank You

  • @pptmtz
    @pptmtz 5 หลายเดือนก่อน

    Thanks, this is so cool

  • @mdtanbeerjubaer584
    @mdtanbeerjubaer584 3 หลายเดือนก่อน

    Can you please share the csv file so that i can practice. I am a new researcher will learn these for my thesis.

  • @050zaher
    @050zaher 4 หลายเดือนก่อน

    You are great

  • @Sanu-pl8is
    @Sanu-pl8is 6 หลายเดือนก่อน

    How can we use a csv file with multiple fields as the prompt may require to evaluate all fields and answer the best one.

    • @dwoodlock
      @dwoodlock  5 หลายเดือนก่อน

      Most people will turn the multiple fields into a sentence or a paragraph. Then you have a document.

  • @pjali1
    @pjali1 6 หลายเดือนก่อน

    Awesome Sir !!

  • @jehnzcode
    @jehnzcode 5 หลายเดือนก่อน

    👏 new subscriber here

  • @rahuln3977
    @rahuln3977 5 หลายเดือนก่อน

    hi sir can i get this code to replicate on my jupyter notebook

  • @sohans3550
    @sohans3550 5 หลายเดือนก่อน

    is it possible to share CSV file Don?

  • @matiasgarces2465
    @matiasgarces2465 5 หลายเดือนก่อน

    For dummies please!!! step by step

  • @paultoensing3126
    @paultoensing3126 6 หลายเดือนก่อน

    So what is df?

    • @agusriyadi6802
      @agusriyadi6802 6 หลายเดือนก่อน

      Distance Formula I think

    • @dwoodlock
      @dwoodlock  5 หลายเดือนก่อน

      The primary data structure used in data science is called a Data Frame. It's just like a SQL table. The rows are typically instances and the columns are typically attributes/fields/properties of each instance. Hence the variable df.

  • @shiyabrezel
    @shiyabrezel 4 หลายเดือนก่อน

    That’s

  • @paultoensing3126
    @paultoensing3126 6 หลายเดือนก่อน

    What is concatenate?