“I want to give ChatGPT 10x more docs” - RAG Explained

แชร์
ฝัง
  • เผยแพร่เมื่อ 4 ม.ค. 2025

ความคิดเห็น • 80

  • @bryandotmee
    @bryandotmee 5 หลายเดือนก่อน +17

    This is sooo important, the pdf size limits has been a blocker for analysis of docs that are over a few pages

  • @02167631
    @02167631 5 หลายเดือนก่อน +8

    You are such a gifted teacher. The way you explain things in a way that makes such a foreign concept easy to understand is magical.

    • @zaneperrydotcom
      @zaneperrydotcom 4 หลายเดือนก่อน +1

      I agree; well done. Don't change a thing. This is the level of helpful explanation we are looking for. Well done!

  • @RaxLakhani
    @RaxLakhani 4 หลายเดือนก่อน

    You really did a good job in helping me understand the various terms and functions around LLM queries Thanks

  • @headoverheels88
    @headoverheels88 10 วันที่ผ่านมา

    Perfectly explained what I needed for my company. I knew what I needed wasn't particularly complicated but I just didn't have the tools; the auto-chunking is what convinced me to pusure this option as this was the missing link. Thank you.

  • @Zhaku69
    @Zhaku69 5 หลายเดือนก่อน +12

    This video likely just solved an issue I was trying to fix for my company. Love these more teacher vibe type videos thanks so much!!

    • @RalucaMMihai
      @RalucaMMihai 4 หลายเดือนก่อน +1

      In his community he does lots of lectures...

    • @RalucaMMihai
      @RalucaMMihai 4 หลายเดือนก่อน

      In his community he does lots of lectures. I recommend you check it out

  • @slackerpope
    @slackerpope 4 หลายเดือนก่อน +2

    I'm a student and writer. This video is exactly what I needed to get started with RAG. The last example was critical. Showing us how to hook up the knowledge base was huge. That step was not intuitive. I'll be referring to this video several times in the next month.
    Also, yes please, more education videos. THANK YOU!

  • @malcolmvanhilten125
    @malcolmvanhilten125 5 หลายเดือนก่อน +3

    Excellent, i think Vector shift should link to this vid from their website. A great tutorial for getting to know the product and especially its capabilities. More of these please Igor

  • @JonasMunnich
    @JonasMunnich 3 หลายเดือนก่อน

    You are so good at explaining more advanced concepts in simple terms - Really useful! Pls make more of these kinds of videos!

  • @danthompson8309
    @danthompson8309 4 หลายเดือนก่อน +1

    Thank you for the video, very helpful and clarifying. Yes, please more of these technical primer and proof of concept videos . Thank you.

  • @tar-yy3ub
    @tar-yy3ub 5 หลายเดือนก่อน +7

    Honestly don't care that this was sponsored, this was useful content

    • @aiadvantage
      @aiadvantage  5 หลายเดือนก่อน

      That means a lot. I only do sponsored vids that I would want to make anyway :)

    • @kirlyukvv
      @kirlyukvv 4 หลายเดือนก่อน

      @@aiadvantage Respect

  • @nicolaikalb
    @nicolaikalb 5 หลายเดือนก่อน +1

    Great video! Exactly what I was looking for. Keep on going with your wonderful style. Greetings from Cyprus.

  • @EriCraftCreations
    @EriCraftCreations 5 หลายเดือนก่อน +7

    I am so grateful that you explained what RAG is in an essy way to understand. ❤

    • @aiadvantage
      @aiadvantage  5 หลายเดือนก่อน +5

      This topic has been on my list for the TH-cam for a while. Glad you enjoyed the explainer

    • @pandipatipavan3804
      @pandipatipavan3804 5 หลายเดือนก่อน +2

      HI​@@aiadvantagebrother can you please make a complete video on how to earn or make money using AI tools for free as soon as possible for free 🙏 I am from India and I really need money since I lost my Job due to Recession brother. I think it could be more helpful if you could me a way by making a complete video on How to use AI to make money easily with Indian Rupees and what are the websites that will provide the real money in a faster manner. I will be eagerly waiting for your kind response and Thank You so much brother 🙏

  • @Hall
    @Hall 4 หลายเดือนก่อน

    ❤ These educational videos are very valuable. You are a good teacher. The first half especially upgraded my AI approach. Thank you!

  • @kamelirzouni
    @kamelirzouni 4 หลายเดือนก่อน +1

    Thank you!

  • @ekot0419
    @ekot0419 5 หลายเดือนก่อน +2

    Wow. I cant agree more for what you said in this video. 8 minutes into it. Thank you. Yeah, I burned myself out trying to understand what agents is about.

  • @hope42
    @hope42 5 หลายเดือนก่อน +3

    Back when me and Al Gore invented the internet, I remember hearing about my first fun memory of automation, like this story:
    Around the early 1980s, a group of programmers at MIT connected their Coke vending machine to the internet. This allowed them to check the status of the machine remotely, particularly to see if there were cold sodas available or if it needed to be refilled. This is often cited as one of the earliest examples of an internet-connected appliance, a precursor to what we now call the Internet of Things (IoT).

  • @SomogyiCosmin
    @SomogyiCosmin 4 หลายเดือนก่อน

    WoW. Thank you Igor for sharing this knowledge. I think this kind of videos are very usefull because of the real live examples.

  • @dannyquiroz5777
    @dannyquiroz5777 5 หลายเดือนก่อน +2

    Nice video I wasn’t aware of this tool, could be useful for many use cases

  • @kamelirzouni
    @kamelirzouni 4 หลายเดือนก่อน

    Thank you very much for this impressive work. This is one of my favorite videos you have produced.😍

    • @aiadvantage
      @aiadvantage  4 หลายเดือนก่อน +1

      Alright noted! Thanks for the kind and generous comment

  • @ttvd94
    @ttvd94 หลายเดือนก่อน

    one of the best explanations I've seen on YT

  • @curtcooper5465
    @curtcooper5465 5 หลายเดือนก่อน +3

    Very much thankful for this man.

  • @ChrisBarkerZA
    @ChrisBarkerZA หลายเดือนก่อน

    SUCH a good demo, so clear and crisp and useful. Well done. I can see how RAG is great for specific questions as per the chainsaw demo, but is it the approach I should use for a grant proposal writing system, where I want to load 10 past grant proposals, and the system needs to structure and write new proposals given some objective. I would want the system to generate very long form output, and and to make use of the past proposals for some (but not all) of the info.

  • @maniacatthecontrols
    @maniacatthecontrols 4 หลายเดือนก่อน

    Yes, I prefer educational tutorials like this. Thank you.

  • @ivanjosegomes9599
    @ivanjosegomes9599 5 หลายเดือนก่อน

    Great stuff, great video format, great teacher!

  • @fcmelo1
    @fcmelo1 5 หลายเดือนก่อน +1

    It was exactly what I was expecting. Thank you.

  • @CM-zl2jw
    @CM-zl2jw 5 หลายเดือนก่อน

    Brilliant video- the Zombies have arrived and we need to be prepared!! 😂😂. Thanks for the chuckle!! much appreciated! Thank you.
    I like the “completionist Igor”. Others have said it… you are a gifted and highly talented sharer of knowledge. Love learning how to 🐟 🙏

  • @Mr_Chiro_
    @Mr_Chiro_ 5 หลายเดือนก่อน

    Great explanation of everything. Even with my advanced knowledge this was a very good explanation.

    • @aiadvantage
      @aiadvantage  5 หลายเดือนก่อน +2

      Love to hear that! The definitions in this space are getting fuzzy at best so more discussions on topics like this really are necessary (at all levels). I feel like no two people could agree on the definition of "AI" or "Agents" let alone "AGI".

  • @sandiegojoey1
    @sandiegojoey1 5 หลายเดือนก่อน +1

    I haven't finished the video yet, so sorry if you covered this. One of the things I want to be able to do is to have an LLM read a document and when I ask questions about the document, I need the LLM to cite the source of its answers by highlighting the specific passages that support its answer. Is this possible yet?

    • @CM-zl2jw
      @CM-zl2jw 5 หลายเดือนก่อน +1

      Should be easy if you give it a prompt… just say what you said here to the LLM.

  • @writetopardeep
    @writetopardeep 5 หลายเดือนก่อน

    Brilliant video man

  • @slackerpope
    @slackerpope 4 หลายเดือนก่อน

    Does using RAG force the LLM to only reference your context data to formulate its reply? I'd like to eliminate hallucinations so I can be confident that the output is actually relevant. Example: I'd like to feed a pdf of a play into RAG and know that the LLM is only referring to my document while formulating answers and not hallucinating material that is similar. Any information on how to keep the LLM from hallucinating while I'm trying to chat with my documents would be greatly appreciated.

    • @rapidAIWebDev
      @rapidAIWebDev 4 หลายเดือนก่อน

      RAG-base implementations hallucinate because LLMs often treat certain words as being identical to each other even when they are referring to different things. For example, GPT-3.5 Turbo surprisingly treats calcium and magnesium as if they are the same thing. (Demo: th-cam.com/video/K4Wg6QzPfyI/w-d-xo.htmlsi=kqlrJCfbYcT-j67x&t=412 Time: 06:52) GPT-4 treats Afonso and Alfonso as if they are the same, even though they are distinctly different names. (Demo: th-cam.com/video/K4Wg6QzPfyI/w-d-xo.htmlsi=9gizAFSk7OTEZabe&t=554 Time: 09:14) If you want to use RAG, your implementation must address this issue or you will still have hallucinations. In fact, this is the most important issue that your code needs to address. You could use something like RAGFix to resolve this and eliminate hallucinations.

  • @rapidAIWebDev
    @rapidAIWebDev 4 หลายเดือนก่อน

    Even with RAG, ChatGPT hallucinates up to 23% of the time. The reason is that LLMs often treat certain words as being identical to each other even when they are referring to different things. For example, GPT-3.5 Turbo surprisingly treats calcium and magnesium as if they are the same thing. (Demo: th-cam.com/video/K4Wg6QzPfyI/w-d-xo.htmlsi=kqlrJCfbYcT-j67x&t=412 Time: 06:52) GPT-4 treats Afonso and Alfonso as if they are the same, even though they are distinctly different names. (Demo: th-cam.com/video/K4Wg6QzPfyI/w-d-xo.htmlsi=9gizAFSk7OTEZabe&t=554 Time: 09:14) If you want to use RAG, your implementation must address this issue or you will still have hallucinations. In fact, this is the most important issue that your code needs to address.

    • @kirlyukvv
      @kirlyukvv 4 หลายเดือนก่อน

      And how to solve it?

    • @rapidAIWebDev
      @rapidAIWebDev 4 หลายเดือนก่อน

      @@kirlyukvv @14:04 the video uses PubMed IDs to teach how Noun-Phrase routes work. The video then applies this to a real-world RAG problem to show how to alter the context to remove conflicting noun-phrase routes. That's the methodology for solving it. And if you want it fully automated, you can use a service like RAGFix that takes care of it for you.

  • @LuisMaaan
    @LuisMaaan 4 หลายเดือนก่อน

    Is it possible to add some kind of database with the goal of storing histories of unique chat sessions of users with VectorShift? to build some kind of customer support bot that is able to store past conversations and pull them as context too

  • @SupahNin10dohp
    @SupahNin10dohp 5 หลายเดือนก่อน

    Do you know what this increases the file size or context length too? I have PDFs that even Google Gemini with its 1 million token limit won't even read, or when it does it gives a short answer.

  • @697sona
    @697sona 4 หลายเดือนก่อน

    Nice demonstration, I have a question about chunks, how would I know the number of chunks that would be required to answer a query beforehand? as you put 2 for chunks before you gave query about chainsaws

    • @aiadvantage
      @aiadvantage  4 หลายเดือนก่อน

      All depends on the purpose of the chatbot. If the goal is to answer specific queries then 2 chunks is what you want. If there is many documents and the answers should pull together a lot of varied info then you will need more. All depends on the goal and knowledge base (and how its structured)

    • @697sona
      @697sona 4 หลายเดือนก่อน

      @@aiadvantage I understand thanks, but what if there was another mention of chainsaw and you were unaware and chose 2 chunks, in that case which 2 chunks will be used and which will be left out?

  • @africanqueen007
    @africanqueen007 4 หลายเดือนก่อน

    what happens to the documents uploaded...does vectorshift preserve confidentiality?

  • @EddyRoAguilar
    @EddyRoAguilar 4 หลายเดือนก่อน

    So great content! Please do more like it, but as you said be conservative with sponsorship so it can be credible. 💯

  • @Bmutch966
    @Bmutch966 5 หลายเดือนก่อน +3

    Please do more educational videos :)

  • @GiveMeTheMic22
    @GiveMeTheMic22 4 หลายเดือนก่อน

    Amazing thumbnail is this midjourney?

  • @Bmutch966
    @Bmutch966 5 หลายเดือนก่อน

    Awesome video! Thanks!

  • @waqqashanafi
    @waqqashanafi 4 หลายเดือนก่อน

    Doesn't Google Cloud suite already do this for free? (with their AI Chatbot builder tool)

  • @Mr_Chiro_
    @Mr_Chiro_ 5 หลายเดือนก่อน

    I would also promote Flowise for building RAG backends

  • @thequantumsword
    @thequantumsword 5 หลายเดือนก่อน +1

    Thanks so much for this.
    Could anyone help me understand why investing in Vectorshift is superior to just building a custom GPT in ChatGPT Plus?
    I'm asking in the context of writing an entire academic textbook.

    • @aiadvantage
      @aiadvantage  5 หลายเดือนก่อน +3

      GPTs always require user interaction. You can't just let a GPT run in the background as opposed to the Vectorshift pipeline which you can deploy and it just runs 24/7 and waits for the trigger to happen. Also GPTs can't be added to a website as a standalone chatbot and you don't get to branch out the interactions like you can here. Hope that makes sense

    • @alexanderahamilton6123
      @alexanderahamilton6123 5 หลายเดือนก่อน +1

      ​@@aiadvantage Also a writer here - so my understanding then, is that if I only need to use this as a tool to retrieve and utilize information for myself for my own project or projects, I can just upload files to a ChatGPT interface as I have done previously, and there's no additional advantage to my building a knowledge base. Or is there...? Thanks much - great work.

  • @plaverbach
    @plaverbach 5 หลายเดือนก่อน

    Just tried to find out if I should cancel my Chat GPT and get a Co-pilot Pro. I could not find anything updated on TH-cam!
    IDK if this is an opportunity to you

    • @aiadvantage
      @aiadvantage  5 หลายเดือนก่อน +2

      copilot is worse in every way except that its safe to use for corporations so I would stick with ChatGPT

    • @plaverbach
      @plaverbach 5 หลายเดือนก่อน

      And integration with MS Office. My friends have been telling me it is crazy!!

  • @RedTick2
    @RedTick2 4 หลายเดือนก่อน

    Great video! Very cool tool, however the price ramp seems very high. The free level of course is a great place to start, but very quickly I can see needing a bit more to practice and try out flows, but I get that that should not be free but 20 bucks a month is a lot considering at this stage of "play" this is not yet commercially viable and may never be. In other words, I would very much like the ability to have useful conversations with documents, or in my case obsidian files/DB's and I am willing to pay, but I can't justify 20 bucks a month. That's on top of my other subscriptions to make all this work. Is this just me or does this feel pricy for a hobby application? I understand it's reasonable for commercial applications.

  • @yassinbadermedia
    @yassinbadermedia 5 หลายเดือนก่อน

    Danke Brudi

  • @jamessukka
    @jamessukka 5 หลายเดือนก่อน +3

    Amazing Tutorial Igor. 👏👏👏. I have been having a question since long if we can create a knowledge base with pdf that contains images as well. If possible plz direct me where I can learn about that.

  • @collabcomm9007
    @collabcomm9007 5 หลายเดือนก่อน

    Dude how do you make those thumbnails please tell me

    • @aiadvantage
      @aiadvantage  5 หลายเดือนก่อน +1

      We have a very detailed step by step guide in our community 😁 Might have to make it public soon because so many people keep asking haha

  • @learnwhispering
    @learnwhispering 5 หลายเดือนก่อน

    I'd rather learn to have have wild apex dinosaurs 🦎🦎crawl on me (and i have)... but learning this means one can survive the AI event horizion that will obliterate the middle class. 💕💕 thank you for the vids

    • @CM-zl2jw
      @CM-zl2jw 5 หลายเดือนก่อน

      🤔 interesting comment.

  • @kellymaxwell8468
    @kellymaxwell8468 5 หลายเดือนก่อน

    so will this help with games how will this help with games
    
    
    We need an AI agent's ai can reason code program script map. So games break it down and do art assets do long term planing. Better reason so it can do a game rather than write it out. Or be able to put those ideas into
    
    
    
    REALITY. And maybe being able to remember and search the ent conversation needed for role
    playing and making games.

  • @angloland4539
    @angloland4539 5 หลายเดือนก่อน

    ♥️

  • @UchechukwuOnuoha-cw4cl
    @UchechukwuOnuoha-cw4cl 5 หลายเดือนก่อน

    I think 🤔 that Iam the only Nigerian ( African) in this channel ❤

  • @tedbozarth8300
    @tedbozarth8300 4 หลายเดือนก่อน

    The AI industry should align on one key priority / to stop using the word agentic 😂

  • @markjohnson1823
    @markjohnson1823 หลายเดือนก่อน

    Don’t use vectorshift. It’s a waste of time. You can’t talk with your knowledge data. The team over at vectorshift does not understand how to build a rag platform. You’re better off building your own.

  • @hope42
    @hope42 5 หลายเดือนก่อน

    I just asked Gemini Pro 1.5 Experimental 2 million context window ... Are you a vector DB? It said kinda ... then said diagram you with Mermaid code.
    graph LR
    A[Your Input] --> B(Tokenization);
    B --> C(Embedding Creation);
    C --> D{Similarity Search};
    D -- Closest Embeddings --> E(Retrieve Information);
    E --> F{Response Generation};
    F --> G[My Output]