Chatbots with RAG: LangChain Full Walkthrough

แชร์
ฝัง
  • เผยแพร่เมื่อ 15 ม.ค. 2025

ความคิดเห็น • 107

  • @niatro
    @niatro ปีที่แล้ว +20

    🎯 Key Takeaways for quick navigation:
    00:00 🤖 This video demonstrates how to build a chatbot using retrieval augmented generation (RAG) with OpenAI's GPT-3.5 model to answer questions about recent events or internal documentation.
    02:58 💡 Language models like GPT-3.5 can sometimes provide inaccurate or hallucinated information because they rely solely on their training data and don't have access to external knowledge.
    08:16 📚 Chatbots can use external knowledge or "Source knowledge" to improve responses. This can be done by inserting context or documents into the prompt.
    16:09 🔍 Retrieval Augmented Generation (RAG) combines parametric knowledge from the model's training with source knowledge from external documents, allowing the model to access and update information.
    19:33 🧠 Pinecone is used as a vector database to store embeddings of external documents for RAG, allowing for efficient retrieval and updating of knowledge.
    23:44 🚀 Embedding documents for retrieval in RAG is typically done in batches to avoid overloading the model and network with too many embeddings at once.Is there anything specific you would like to know more about or any questions you have regarding the key takeaways from the video transcript?
    24:25 📊 Preparing external documents for retrieval with Pinecone should consider batch size limitations to avoid problems.
    26:00 📄 Initializing a Vector database and embedding text chunks for retrieval with Pinecone's Lineage.
    27:07 🤖 Using source knowledge from external documents to augment prompts and improve chatbot responses.
    28:43 🧠 Augmenting queries with source knowledge for chatbot responses, demonstrating improved answer quality.
    31:14 🛡️ Retrieval Augmented Generation (RAG) can significantly enhance the retrieval performance of chatbots by utilizing external knowledge.
    32:23 ⏳ Implementing RAG with a simple approach but noting that it may not be suitable for all types of queries.
    33:47 📈 Benefits of using RAG include improved retrieval performance, citation of information sources, and faster response times.
    34:43 💰 Considerations when using RAG include token usage, cost, and potential performance degradation when feeding too much information into the chatbot.
    Made with HARPA AI

  • @SmitModh-d4z
    @SmitModh-d4z ปีที่แล้ว +3

    very informative video, got understanding of RAG, and LangChain as well, successfully created my very first RAG based chatbot, Learned a lot 👍

  • @Cdaprod
    @Cdaprod ปีที่แล้ว +2

    Nice glad you ran over this I’ve been stuck on part of my RAG but I’m sure your going to clear that up! 🎉thanks

  • @nadranaj
    @nadranaj ปีที่แล้ว +2

    Excellent tutorial with clear comparison before and after RAG. The Python code walkthrough is useful.

  • @talharauf3111
    @talharauf3111 ปีที่แล้ว +1

    Amazing James!
    Really Helpful Material

  • @andersonmatheuslemke9345
    @andersonmatheuslemke9345 10 หลายเดือนก่อน

    Nice video! What is this editor?

  • @austinmontoya314
    @austinmontoya314 ปีที่แล้ว

    I love your statue in the background

  • @francesco.miliani
    @francesco.miliani ปีที่แล้ว

    Great video! Congrats 👏 Excellent speaking capability, keep it up!

  • @kevinb3300
    @kevinb3300 ปีที่แล้ว +4

    Nice video. Any tips for how to handle the "Hi, how are you?" case where you don't want it to do the retrieval part? I'm thinking there must be a way to limit retrieval results based on a similarity score threshold or something. Thanks!

  • @iddoshemtovv
    @iddoshemtovv ปีที่แล้ว

    thanks a lot buddy, keep on the hard work you help me so much!

  • @Tanaka-Buchou
    @Tanaka-Buchou 11 หลายเดือนก่อน +2

    Oh my! Tate is now a developer in the matrix.

    • @jamesbriggs
      @jamesbriggs  11 หลายเดือนก่อน +1

      the matrix got me in the end

    • @Tanaka-Buchou
      @Tanaka-Buchou 11 หลายเดือนก่อน

      @@jamesbriggs 🤣🤣 Nevertheless, Thanks for the efforts you put in to create awesome contents.

  • @Taranggpt6
    @Taranggpt6 ปีที่แล้ว

    Hey, Wonderful Explanation,
    Could you please help me understand one thing,
    at 30:35, Jupyter Cell No - 36, The response from augmented prompt (about speciality about Llama 2) isn't been appended to the messages. How come in the next run when trying "Safety measures" prompt, model already has some information about the Llama 2 when we haven't actually given the model the chat history??

  • @raminzandvakili7418
    @raminzandvakili7418 ปีที่แล้ว +3

    Thank you for your videos. They are all awsome. Could you please also talk about how can we extract more granual meta-data from our documents (like page number for example), and how we can force an agent to add the meta-data to the final answer it provides?

  • @adriaanb7371
    @adriaanb7371 ปีที่แล้ว +1

    Is there a working QA model that knows the Langchain documentation? The official ones don't seem to work

  • @tejagunupudi5318
    @tejagunupudi5318 ปีที่แล้ว

    This is nice but do you also have any video with weviate?

  • @abudhabi9850
    @abudhabi9850 ปีที่แล้ว

    Hey man, love ur vids. Great quality

  • @andriusem
    @andriusem ปีที่แล้ว +1

    Great walkthrough as always. My auestion would be is it beter to use LLM chain like conversational chain or chatbot is better for RAG? also RAG Agents, althoug they run on huge amaount of tokens.

  • @DreamsAPI
    @DreamsAPI ปีที่แล้ว +7

    A minute after being uploaded, I am watching

  • @xspydazx
    @xspydazx 10 หลายเดือนก่อน

    ok generally a good talk.... Similar to the q&A with Context...
    We need to understand that the LLM < Longterm Base memory and the LocalDB is also An Augmentation to Updated Longterm Memory , We could say its a form of Short Term Memory as it is Updatable and Appendable ... the chat memory is the Working Memory (disposable unless stored into Short Term Memory DB) .... In a well designed model the short term memory can be used to FINE TUNE the LLM ie Updating the base knowledge in some form of interval to keep the base memory upto date, at this point new methodologies which use the " Question/Instruct - Context - Answer/Response) structure... the new base model can be redistributed ... for users ... Subsets of the short term DB can be then deleted and Specialised Domain knowledge HELD and NOT UPDATED Into the base Model (only desired fine tuning sets to perform specific instructs (such as diagnosis)...or ROLE PLAYING.... only a small subset needs to be fine tuned to allow for the new ability or skill.... where the actual knowledge can still be held locally in domain based knowledge vectorDB....
    One thing people have seem to have forgotten about vector DB is that they are still databases hence can be structured ! and even relational ... hence converting existing databases to vector databases with structured based querys available for use by the search model (new instruct tuning required - for each query added)... ie the model could be used to search a specific patient history and use as context.... COnnection to the real world scenario of diagnosis already existing in the domain ... enabling for rich search and querying and diagnosis as well as possible discussive sessions with the "doctor or nurse" utilizing the system etc... SO the use case i described could occur...
    We are getting close to "Machine that have rationality as humans rationalize" (think as humans).... with function calling and embedded (function calling) we get to the "machine which act like humans" with a large action model... ie: functions have been prestored in the training dataset, with executing pathways for completions... for specific actions... ie calculate A+B it should have access to a calculate function ! to be executed if possible ...or returned as the response to calculate ... given a response the llm system its connected to can execute in pythonnet.... possibitys endless ..
    Tutorial ReView:
    Problems: today no actual code was displayed for the full models .... As well as each point you discuss should end with enforcement : IE an image of FULL STOP! .... hence not easy to follow your train of thought or end point ! this you need to improve as the content is fine but the presentation is not quite right! ...(its Full stops ! usually done with voice intonation and pauses bro! perhaps the editing has removed these important aspects.... as well as often discussing things which could have also been displayed on the screen to enforce the idea.... switching back to the original diagram being discussed! ) hence sometimes it seems though your randomly speaking and drifting off.... i would hate to have you as a teacher when you get older !!!! it would be unbearable ! (knowing you have the knowledge i need but cant get it from you!) .....
    Tutorial Request:
    1. How to run multiple models at the same time;
    Reason : when creating a multi agent system ... we have specific model which are specialists and other which are generalists.. ie the generalists can be prompted to act as a specialist but cannot outperform the specialist on its specialization... hence requiring multiple models ...
    2. How to merge models : (LLAMA-Based GGUF) ?
    Some models have vastly improved and the keeping of multiple models often is not viable so merged models can provide a way to combine knowledge bases and skills.
    3. How to finetune a model to have a larger input context ???
    As you notice there are multiple prompts being combined into a super large prompt some of my favourite models are small context windows how can they be improved ? by adding ROPE? or other methods.... so how to really fine tune a model to have new skills... creating the multimode model .or mixture of experts such as mistral and code llama and white rabbit! and phi ?... ? as you notice the super charged models which are proposed for open source are being bought by cloud service providers ! hence creating such model is also need to know before they are no longer available or even finetuned down to unusable models ... ie refusing to answer questions deemed Dangerous by the POWERS AT BE!
    Or AGAINST FAIR COMPETITION ... ie AI creating Fully working apps !(always not quite right !) ... or not even able to create a transformer ? or GPT model from scratch !... BLOCKED!
    4. How to build Relational databases in Pinecone or a vector DB...
    4a. how to craft queries which will return relational results ..
    4b how to convert / import existing database table into pinecone or a vector DB..
    4c. how to create stored querys which match various question structure which may be asked by the augmented chat system , ie the domain results will be returned from the vector db, and presented as context to the query ie langchain has a table query system ? .
    Thanks !

  • @Rimidalvk
    @Rimidalvk ปีที่แล้ว +4

    🎯 Key Takeaways for quick navigation:
    00:00 🤖 The video will cover building a chatbot using retrieval augmented generation, starting from scratch.
    00:26 🧠 The chatbot will use OpenAI's GPT-3.5 model and the LangChain library to answer questions about recent events or internal documentation.
    02:43 🤯 Language models like GPT-3.5 can hallucinate answers when they lack knowledge about specific topics not covered in their training data.
    07:09 🧩 Chat logs with models like GPT-3.5 can be extended by appending AI-generated responses, creating dynamic conversations.
    09:29 💡 Language models like GPT-3.5 rely solely on knowledge acquired during training and have no access to external information.
    11:20 🔍 Retrieval Augmented Generation (RAG) adds external knowledge or context to a language model to improve its responses.
    14:03 🤔 Language models, like GPT-3.5, may include safeguards to handle questions they can't answer, indicating their limitations.
    16:09 📚 Adding source knowledge to the model's input through the prompt can improve its responses to specific queries.
    19:33 🚀 Retrieval Augmented Generation (RAG) combines language models with external knowledge sources to enhance responses.
    24:25 🔄 When embedding data for knowledge bases, it's essential to batch the process to avoid errors and to consider API limitations.
    24:37 📊 Building a fully-fledged Vector database for knowledge retrieval is a crucial step in integrating external knowledge into a language model (LM).
    28:30 🧠 Augmenting queries with relevant context from a Vector database enhances the LM's ability to provide informative answers.
    30:46 🤖 Language models benefit from augmented prompts with context, allowing them to better answer questions based on the provided information.
    33:19 💡 Implementing Retrieval-Augmented Generation (RAG) can significantly improve retrieval performance and answer a wide range of questions accurately.
    34:00 💬 When implementing RAG, it's essential to consider token usage, costs, and the balance between external knowledge and LM performance.
    Made with HARPA AI

  • @rowboboat8307
    @rowboboat8307 ปีที่แล้ว +2

    This is great. Are there any references for chunking the source docs and marking the chunks with metadata that you would recommend?

    • @jamesbriggs
      @jamesbriggs  ปีที่แล้ว +2

      yeah for sure, check out this video th-cam.com/video/eqOfr4AGLk8/w-d-xo.html :)

    • @TheCloudShepherd
      @TheCloudShepherd ปีที่แล้ว

      @@jamesbriggs

  • @DavidJones-cw1ip
    @DavidJones-cw1ip ปีที่แล้ว

    Would really love to see you focus more on opensource

  • @elenakusevska6266
    @elenakusevska6266 ปีที่แล้ว

    I think the initial answers from the AI about string theory were correct. I'm not an expert either, but it seems like they were. Also string theory is largely present in media, popular culture... so it would make sense for the AI to be well informed about it :)

  • @stevenlevy8213
    @stevenlevy8213 ปีที่แล้ว +1

    @jamesbriggs In production - do you tend not to use the Conversation Retrieval Chain? In the example you posted- how would you add memory into it?Great videos as always

  • @aravindudupa957
    @aravindudupa957 ปีที่แล้ว +1

    Is there any technique that can be used to verify each sentence of the response with the given context? Except feeding it to the llm as that could have hallucinations and consumes openai tokens

  • @GOPALJIGUPTA-lt8hs
    @GOPALJIGUPTA-lt8hs 11 หลายเดือนก่อน

    This is an excellent in depth tutorial. I just have one question : can we do all these things using Azure OpenAI? How do we integrate that?

  • @micbab-vg2mu
    @micbab-vg2mu ปีที่แล้ว +1

    In my corporation, I am thinking of creating a similar chatbot to answer medical questions related to our drugs. We have a lot of products, so I wonder if there are limitations on the context that I can put into the vector database - maybe using some metadata to give the LLMs hints about which documents to search would help.

    • @jamesbriggs
      @jamesbriggs  ปีที่แล้ว +1

      You can add metadata filters when using RAG with AI agents, or you could maybe try hybrid search for better keyword scoring in your searched

    • @micbab-vg2mu
      @micbab-vg2mu ปีที่แล้ว

      I will start small and test to find the best solution. My bosses are very sceptic towards AI - I must build some case studies to show ROI. Thank you for help.@@jamesbriggs

    • @jamesbriggs
      @jamesbriggs  ปีที่แล้ว

      best of luck!

    • @mcdaddiest
      @mcdaddiest ปีที่แล้ว +1

      Great application. I used the OG LLaMa to do this for my Mom actually. She went on an experimental late stage FDA candidate and had some issues when she started treating some side effects. The Drug was new enough that it took a little unpacking to understand more about the chemical make up, but once we got the right info, it immediately caught an interaction with another drug she'd been prescribed to address side effects of the experimental drug. This was overlooked by an entire team of oncologists who were aware of the chemical make up of this drug & still gave her something that caused an interaction. I LOVE this use case! Best of luck!

    • @micbab-vg2mu
      @micbab-vg2mu ปีที่แล้ว

      We are planning to develop a Q&A chatbot for physicians in Canada. I see some challenges related to data preparation for retrieval, as well as monitoring, maintaining, and updating the chatbot performance. The building aspect is the easier part.@@mcdaddiest

  • @dipayansarkar8870
    @dipayansarkar8870 ปีที่แล้ว

    Very helpful video. Is this doable with Llama-2 instead of using ChatOpenAI() function from OpenAI? Basically I am trying to do the same with Llama instead of using OpenAI's GPT model

  • @xspydazx
    @xspydazx 8 หลายเดือนก่อน

    Question : once loading a vector store , how can we output a dataset from the store to be used as a fine tuning object ?

  • @hellkaiser45500
    @hellkaiser45500 ปีที่แล้ว

    Hi, your video was great . I have a question. What is the advantages to use pinecone over Faiss to store your vectors?

  • @stephanembatchou5300
    @stephanembatchou5300 ปีที่แล้ว +1

    What happens if the user later in the conversation asks a task that like tell me more. The rag will fails because it will retrieve "tell me more" content from the indexes, therefore add noisy informations to the chat memory.
    Chatbot with rag also need some reasoning, so the model decides whether or not to augment the context

    • @jamesbriggs
      @jamesbriggs  ปีที่แล้ว

      yeah you need agents, I made an earlier video on these th-cam.com/video/H6bCqqw9xyI/w-d-xo.html - hope that helps!

  • @xitzprofessor
    @xitzprofessor 3 หลายเดือนก่อน

    What IDE are you using here?

  • @KyouKo-x7g
    @KyouKo-x7g ปีที่แล้ว

    I have a problem, How to make the chatbot don't say "according to your ....." everytime ???

  • @mahmoudqahawish8919
    @mahmoudqahawish8919 ปีที่แล้ว +1

    would this work with other LLMs from huggingface ? and other embeddings like Chroma ?
    if so , what LLM would you recommend ?

  • @dhwanibro
    @dhwanibro 7 วันที่ผ่านมา

    Do i have to get openAI’s subscription ? Because i keep seeing rate limit error

  • @ginomsz
    @ginomsz ปีที่แล้ว

    Great video James!

    • @maximoguerrero
      @maximoguerrero ปีที่แล้ว

      How would you do rag and add coverstational history as well?

  • @da-bb2up
    @da-bb2up ปีที่แล้ว

    Thx for the video :) can you update your vector database by a few lines ( if you want to add data to your knowledge base) automatically by running a python script or something like that?

  • @QuickAIstories
    @QuickAIstories ปีที่แล้ว

    What if I have structured data that I want to feed to the RAG. Is that possible?

  • @noorameera26
    @noorameera26 6 หลายเดือนก่อน

    Question. For real-application chatbots, how does one store conversational messages say if you have hundreds of users using the chatbot?

  • @utkarshashinde9167
    @utkarshashinde9167 11 หลายเดือนก่อน +1

    is serverless architecture paid ? i am getting forbidden error

  • @palychbasist
    @palychbasist 3 วันที่ผ่านมา

    Thank you for your video. I watched dozens of videos about building RAG but they are always the same. It's always about Q&A approach and nothing more. What about building an assistant, maintaining dialogues, saving context and other stuff. For example it could be dialogue "USER: What is llama2? ASSISTANT: answers. USER: Where can I use it?" how can I perform search in my vector database in that case? I can't just use user's message for search. I need to form proper search query at first. Maybe you have video about it on your channel?

  • @polly28-9
    @polly28-9 7 หลายเดือนก่อน

    Thanks for the video! Well Done! I want to know how to make the chatbot to return a list of results. Not only one result, but a list of relevant answer to the input question. I do not know what to change: search index, search parameters, metric_type or what? Can you help me? Thanks!

  • @therealsamho
    @therealsamho ปีที่แล้ว

    would it be possible to use some form of RAG pipeline to help improve an LLM's abliity to classify things that are in a niche domain? eg let's say we have 50 unnusual domains (that weren't in the LLM's training data, or at least, little exposure). We create a vector store where we have 50 labels with a few thousand examples of how each label is expressed. Would that work?

  • @gbisaga
    @gbisaga ปีที่แล้ว

    When trying to import pinecone I get errors with the message:
    If you cannot immediately regenerate your protos, some other possible workarounds are:
    1. Downgrade the protobuf package to 3.20.x or lower.
    2. Set PROTOCOL_BUFFERS_PYTHON_IMPLEMENTATION=python (but this will use pure-Python parsing and will be much slower).
    I've tried everything I can think of. No idea how to do this.

  • @emmanueladebiyi2109
    @emmanueladebiyi2109 ปีที่แล้ว

    Very well described!

  • @oscaralberto6835
    @oscaralberto6835 ปีที่แล้ว

    Hello, I don't get it. What's the difference between the RAG, ConversationalRetrievalChain and RetrievalQA?

  • @vigneshls8590
    @vigneshls8590 ปีที่แล้ว

    The major problem I face is, once the embedded data is loaded into the vector db, how do I delete it?.
    May be I want to delete all data that I fetched from a website and keep everything that I fetched from a SharePoint. How can the embedded data be isolated and deleted ?

  • @chuanjiang6931
    @chuanjiang6931 ปีที่แล้ว

    Is ChatGLM2-6B supported in this colab notebook?

  • @hmind9836
    @hmind9836 ปีที่แล้ว +1

    Can you please make a video with an opensource free vector database? Like chromadb, for example. I love your videos ^^

  • @caiyu538
    @caiyu538 ปีที่แล้ว

    Great lecture

  • @ayeshajan4891
    @ayeshajan4891 6 หลายเดือนก่อน

    Hi, Im facing the following error i'd appreciate it if you could guide me on how i could fix this : RROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
    langchain-groq 0.1.6 requires langchain-core=0.2.2, but you have langchain-core 0.1.23 which is incompatible.
    langchain-text-splitters 0.2.2 requires langchain-core=0.2.10, but you have langchain-core 0.1.23 which is incompatible.

  • @rmehdi5871
    @rmehdi5871 ปีที่แล้ว

    What Pinecone has advantages over Chroma or Weaviate? Would everything work in your codebase, say, for Azure OpenAI and Chroma? For data security concerns, most enterprises prefer Azure OpenAI and more secured database choices for embedding. Would be interested to know your expert opinion.

  • @ajithbaburaj2547
    @ajithbaburaj2547 ปีที่แล้ว

    excellent tutorial

  • @SattwikPanda-b4m
    @SattwikPanda-b4m ปีที่แล้ว

    I was wondering if is there any way to get the match score in the Langchain version of the RAG output? Retrieving results from Pinecone gives us the core metrics as well, but Langchain documentation does not mention anything. Any insight will be useful.

    • @rabomeister
      @rabomeister ปีที่แล้ว

      RLHF. not directly related, but check it, i think you will be interested.

  • @AshfaqueAzad-z1j
    @AshfaqueAzad-z1j ปีที่แล้ว

    SystemMessage, AIMessage, would these work for LLAMA2 too? Since it's syntax requires [INST] and [/INST]

  • @gjsxnobody7534
    @gjsxnobody7534 ปีที่แล้ว

    Wait wait. but.. you didn't tell us what to do with this new model now.. so, it's just sitting in Terminal, But, how do we add it to our websites? How to embedd into a chat that is sitting on a wordpress site? Secondly, what if it's being used, and then it answers incorrectly, how to reach it things, or list the items it got wrong to retrain?

  • @Aman-qv3jw
    @Aman-qv3jw 5 หลายเดือนก่อน

    Openai doesnot provide free api keys anymore so what alternative can i use?

  • @swetharangaraj4521
    @swetharangaraj4521 ปีที่แล้ว

    what is the process if i what to query chat from cloud mangoDB using llm and RAG

  • @alexanderacevedo962
    @alexanderacevedo962 ปีที่แล้ว

    ¿Se puede usar para el rastreo recursivo como base de conocimiento de una dirección URL?

  • @SmitModh-d4z
    @SmitModh-d4z ปีที่แล้ว

    updated version datasets==2.10.1 to datasets==2.15.0
    to remove the warning
    embeddings = OpenAIEmbeddings(chunk_size=1)
    vectorstore = Pinecone(
    index, embeddings, text_field
    )

  • @vijayvaradarajan4015
    @vijayvaradarajan4015 10 หลายเดือนก่อน

    You are talking about RAG and in your query you ask "What is special about Llama2". A better example would have been "RAG points to company specific data - say HR corpus". Query then should be company specific "Give me list of company holidays". That Llama2 related query is misleading.

  • @theneumann7
    @theneumann7 10 หลายเดือนก่อน

    👌🏼

  • @JohnD-u4c
    @JohnD-u4c 11 หลายเดือนก่อน

    Didn't know Jack Leach was into Machine Learning

    • @jamesbriggs
      @jamesbriggs  11 หลายเดือนก่อน

      I’m not going to play cricket all day every day

  • @caiyu538
    @caiyu538 ปีที่แล้ว

    Great

  • @ControlTheGuh
    @ControlTheGuh ปีที่แล้ว

    The problem is that all the vector databases suck in terms of retrieval

  • @OminousGrymm
    @OminousGrymm ปีที่แล้ว

    with the brain metaphor and the idea of being able to manage someone else's brain and plug in the knowledge needed, you came *this close* to using the Matrix ' I know Kung Fu! ' scene. Which seems pretty much the right metaphor for RAG... th-cam.com/video/6vMO3XmNXe4/w-d-xo.html (15 second snip)

    • @jamesbriggs
      @jamesbriggs  ปีที่แล้ว +1

      nice haha - matrix kung fu scene will be making an appearance the next time I talk about RAG

  • @toannn6674
    @toannn6674 ปีที่แล้ว

    Open AI GPT 3-5 must pay. so can you give solution free?

    • @Lampshadx
      @Lampshadx ปีที่แล้ว

      You could host your own open source LLM (llama2-7b-chat for example) if your hardware supports hosting it.

  • @rabomeister
    @rabomeister ปีที่แล้ว

    Your videos are great. but it seems like you are repeating your videos in somehow. As I believe that you are a respectable engineer and/or scientist with teaching abilities, I would like to see you are diving deeper into more different subjects. I say that because I am really uncomfortable about the current topic, where all of the people are trying to teach "Creating a Chatbot" for discord coders. I could implement a LLama model just by using your videos, despite that you and I do not share anything in common except Langchain in our codes. However, I spent hours and hours, finally did that.
    Therefore, I do not think that there is a need for another similar video. To emphasize, I am saying this because I see you as a valuable teacher, and you are not a full-time youtuber. Therefore, if you are spending your free time, I'd like to see different subjects and areas so that I can dive into them with a trustable guide.

    • @jamesbriggs
      @jamesbriggs  ปีที่แล้ว

      yes for sure, the LLM + RAG videos do well so I do talk about them often - but I try and talk about other things too, like deploying open LLMs, guardrails, agents etc - and I will do more in the space of LangChain, hybrid vectors, etc very soon
      But I'd love to hear what you'd be interested in seeing me talk about?

    • @rabomeister
      @rabomeister ปีที่แล้ว

      @@jamesbriggs I already suggested GPU-side of LLMs as I remember. Also, I guess the prompts are wrong templates in your videos, so it would be great if you can take a look at it, because I found the prompt template but could not figure out how to use it in code :)

    • @jacob----
      @jacob---- ปีที่แล้ว

      Could you please make a video about how to use the map reduce and refine Langchain features and show a use case where you grab chunks of data and then use map reduce on them? @@jamesbriggs

  • @byt3w4rri0r
    @byt3w4rri0r ปีที่แล้ว

    5min here 😂

  • @TheOleMissippian
    @TheOleMissippian 11 หลายเดือนก่อน

    .

  • @raiyanrazi1144
    @raiyanrazi1144 ปีที่แล้ว +1

    Speak Fast please. I slept in the middle of video.

    • @jamesbriggs
      @jamesbriggs  ปีที่แล้ว +2

      TH-cam has x1.25 or x1.5 playback speed in bottom toolbar

  • @drivebuss8079
    @drivebuss8079 9 หลายเดือนก่อน

    long wastful explination, goo directly to the point please.

  • @klammer75
    @klammer75 ปีที่แล้ว +1

    Would really be awesome to see the same thing implemented without the use of langchain and only relying on pinecone and the openai function calling approach. I’m really trying to get a more native architecture built and don’t want to be constrained to the langchain library if possible. Amazing video as always though James!🥳🦾😎

    • @jamesbriggs
      @jamesbriggs  ปีที่แล้ว +1

      Oh this is a cool idea actually, will put something together

    • @mcdaddiest
      @mcdaddiest ปีที่แล้ว

      @jamesbriggs I've been using an implementation that does this for the last 3-4 months (personal project). Feel free to reach out for a collab or even just a mind share.

  • @TheCloudShepherd
    @TheCloudShepherd ปีที่แล้ว