Michael AI
Michael AI
  • 17
  • 18 868
Enhancing RAG Architecture with Long-Term Memory
Enhancing RAG Architecture with Long-Term Memory: Simulating a Virtual AI Assistant
Source code: github.com/msuliot/rag-ltm-demo.git
In the rapidly advancing field of artificial intelligence, integrating long-term memory into Retrieval-Augmented Generation (RAG) architectures marks a significant leap forward. This enhancement enables virtual AI assistants to provide more intelligent and context-aware responses by retaining and utilizing knowledge over extended periods.
With long-term memory, AI systems can simulate human-like interactions, remembering past conversations and user preferences, leading to more personalized and efficient experiences. This capability is particularly beneficial for applications such as customer support, personal assistants, and educational tools, where continuity and a deep understanding of user history are essential.
The enhanced RAG architecture employs advanced memory management techniques, allowing the AI to dynamically store, retrieve, and update information. This enables the AI to recall previous interactions, learn from new data, and adapt its responses, resulting in a more intuitive and responsive virtual assistant that continuously improves.
Practically, this innovation empowers AI systems to handle complex queries, provide detailed explanations, and engage in meaningful ongoing dialogues. The long-term memory component ensures the system maintains context between sessions, making it invaluable for tasks requiring sustained attention and deep contextual understanding.
Enhancing RAG architecture with long-term memory moves us closer to creating truly intelligent virtual AI assistants capable of understanding, learning, and evolving with their users.
We are going to explore a fascinating capability of AI: long-term memory for your virtual assistant. Imagine an AI that remembers our past conversations not to push ads or intrude on privacy, but to enhance our interactions in a meaningful way. In this video, we'll delve deep into how we can integrate long-term memory into the RAG architecture, transforming how AI understands and responds to us.
To begin, we start with a simulated login process to establish a profile ID. This ID is pivotal as it allows us to store conversations securely in our systems. Once logged in, we prompt for questions from the user. These questions are processed to generate embeddings, which are then sent to both our vector database and long-term memory repository.
Here's where the magic happens: If our long-term memory contains relevant information related to the question asked, it enriches the response. This integration of past interactions ensures that each answer is not only accurate but also personalized to the user's history and preferences.
Our system combines various elements: profile information, both short-term and long-term memories, and data from Pinecone, our vector database. This comprehensive approach enables ChatGPT, our AI engine, to deliver nuanced and contextually relevant answers.
Once ChatGPT provides an answer, we store it in short-term memory for immediate recall and display. But that's not all-after concluding the conversation, we save a summarized version of the entire interaction into long-term memory. This ensures that future interactions benefit from past exchanges, creating a more seamless and informed user experience.
This video builds upon concepts explored in previous installments, particularly focusing on the integration of local and web data extraction within the RAG architecture. If you've followed along or have set up your vector database, you're well-prepared to explore this next step in AI development.
Throughout the demonstration, we utilize Pinecone and Mongo databases extensively. Pinecone serves as our robust vector database, housing the indexed data crucial for quick and accurate responses. Meanwhile, Mongo stores and manages our long-term memory profiles, ensuring that each user's interactions are securely archived and accessible.
In terms of implementation, our system is designed as a proof of concept. While it showcases the potential of integrating long-term memory into AI assistants, further refinements and optimizations would be needed for production environments. The main script orchestrates the login process, profile retrieval, and conversation flow, offering a clear path for developers to adapt and expand upon.
In conclusion, what we're witnessing is not just a technological advancement but a glimpse into the future of AI-driven interactions. This prototype lays the groundwork for virtual AI assistants capable of handling complex queries across multiple platforms-text, chat, and voice-transforming how businesses and individuals engage with information.
มุมมอง: 244

วีดีโอ

Enhancing ChatGPT with Long-Term Memory
มุมมอง 100หลายเดือนก่อน
github.com/msuliot/long-term-memory.git I'm super excited to show you how to add long-term memory to our chat system. This means that every time you return, it will remember all the previous conversations you've had, making it better at answering your questions. Trust me, this is going to be awesome, and you're going to love it! To give you a quick overview, I define short-term memory as the cu...
Building a RAG Architecture: Local Files and Web Data Extraction - Pinecone, Mongo and ChatGPT
มุมมอง 2.4K2 หลายเดือนก่อน
Welcome to our comprehensive guide on RAG (Retrieval-Augmented Generation) architecture! In this video, we'll take you step-by-step through the entire process, from extracting data from your local computer, file servers, and company websites to embedding this information into a Pinecone vector database and storing original contents in MongoDB. We'll also build an application that leverages this...
Amazon Bedrock - Getting Started with AWS
มุมมอง 1915 หลายเดือนก่อน
Michael-AI: michael-ai.com Github: github.com/msuliot/bedrock.git In this video, we take a deep dive into Amazon Bedrock, a remarkable service provided by AWS that grants access to AI models. We kick off this exciting journey with the configuration of AWS, ensuring we have the right groups, roles, and permissions in place. We then navigate through Bedrock, unlocking access to multiple AI models...
Retrieval Augmented Generation (RAG) with Confluence, Pinecone, and ChatGPT
มุมมอง 1.4K5 หลายเดือนก่อน
Michael AI GitHub github.com/msuliot/rag.git Embarking on a journey through the realms of Retrieval Augmented Generation (RAG), I found myself delving into an intricate process that seamlessly integrates databases with the power of artificial intelligence. Today, I’m excited to share my exploration into how we can utilize Confluence, a collaborative platform, in tandem with Pinecone, a vector d...
Jupyter - ChatGPT Fine-Tuning
มุมมอง 16110 หลายเดือนก่อน
github.com/msuliot/jupyter_fine_tuning In this video, I provides a companion guide to a Jupyter Notebook. I start by directing viewers to GitHub to download the necessary files. Then, I instruct viewers to navigate to Jupyter.org and click on "try" to access JupyterLab. I demonstrate how to upload the downloaded files, including a data JSON lines file and the Jupyter Notebook itself. Alternativ...
Developing a Custom Chatbot with ChatGPT Fine-Tuning & React (8 minutes)
มุมมอง 1.4K11 หลายเดือนก่อน
michael-ai.com github.com/msuliot/ai-api-demo github.com/msuliot/ai-react-demo Your Chatbot - Your Data In today's video, I dive deep into building a tangible application harnessing OpenAI's fine-tuning, tailored specifically for your chatbot and your unique data. Think of it as adding a personal touch to the already brilliant ChatGPT 3.5. If you missed the first part, we delved into creating a...
A Game Changer for Businesses: A Step-by-Step Guide to Fine-Tuning ChatGPT
มุมมอง 27511 หลายเดือนก่อน
michael-ai.com github.com/msuliot/open_ai_fine_tuning In the video, the presenter emphasizes the significance of the recent OpenAI update on August 22nd, concerning the fine-tuning of ChatGPT, which is a game-changer for the business community. This update enables businesses to customize ChatGPT using their data and business knowledge, resulting in a comprehensive model tailored to a company's ...
Hugging Face - Text to Image - Getting started in 4 mins
มุมมอง 2.4Kปีที่แล้ว
michael-ai.com github.com/msuliot/huggingface_text_to_image.git Introduction to HuggingFace: - HuggingFace is at the forefront of the AI and natural language processing revolution. - It offers a vast variety of pre-trained models and tools like transformers, tokenizers, and datasets. - The platform is designed to make AI easily accessible, allowing individuals to avoid starting from scratch. - ...
Hugging Face - Text to Speech - Getting started in 5 minutes
มุมมอง 6Kปีที่แล้ว
michael-ai.com github.com/msuliot/huggingface_text_to_speech.git In this video, the focus is primarily on coding and leveraging the HuggingFace platform. HuggingFace is recognized as a groundbreaking force in the world of natural language processing (NLP) and artificial intelligence (AI). It offers a comprehensive suite of pre-trained models and essential tools such as transformers, tokenizers,...
Hugging Face - Question & Answering - Getting started in 7 minutes
มุมมอง 268ปีที่แล้ว
michael-ai.com github.com/msuliot/huggingface_question_answering.git In this video, we dive deep into the HuggingFace platform, an innovative hub that's propelling advances in natural language processing and AI. If you're a novice or an expert, here's what you'll gain from this guide: 1. Introduction to HuggingFace: - HuggingFace is a trailblazer in AI, offering a vast array of pre-trained mode...
Hugging Face - Object Detection Model - Getting started in 7 minutes
มุมมอง 1.5Kปีที่แล้ว
michael-ai.com github.com/msuliot/huggingface_object_detection.git *Introduction*: This video introduces the viewer to HuggingFace, a leading company in the AI sector, specifically in natural language processing. HuggingFace offers various pre-trained models, tools like transformers, tokenizers, and datasets to make AI accessible and straightforward. The platform promotes collaboration, transpa...
Hugging Face - Summarization Model - Getting started in 6 minutes
มุมมอง 372ปีที่แล้ว
michael-ai.com github.com/msuliot/huggingface_summarization.git In this video, AI enthusiasts are introduced to HuggingFace, a pioneering AI company transforming the landscape of natural language processing. HuggingFace offers a comprehensive range of pre-trained AI models, datasets, and AI tools such as transformers and tokenizers. By providing these resources, it opens the door to AI technolo...
Mastering AI Interviews: Your Guide to Success in 7 minutes
มุมมอง 96ปีที่แล้ว
michael-ai.com If you are new to ChatGPT please watch my Getting Started video th-cam.com/video/99S6BLFZntc/w-d-xo.html AI is increasingly being used by employers for interviews and resume screenings. Applicant Tracking Systems (ATS) have been around since the 1990s, but with the addition of AI, the experience is taken to a new level. AI can analyze text-based, audio, and video interactions dur...
ChatGPT - Understanding Prompts in 9 minutes
มุมมอง 482ปีที่แล้ว
michael-ai.com In this 9 min insightful video, the presenter offers an introduction to the powerful AI tool, ChatGPT, an exemplary AI product. He sets the stage by mentioning that ChatGPT's AI-based knowledge is current only up until September 2021, which means the AI won't know about events or information from a later date. Next, he delves into a detailed AI demonstration of the short-term mem...
Getting started with AI and ChatGPT in 12 minutes.
มุมมอง 146ปีที่แล้ว
Getting started with AI and ChatGPT in 12 minutes.
Code Lab: Simple Chatbot using Llama Index
มุมมอง 1.6Kปีที่แล้ว
Code Lab: Simple Chatbot using Llama Index

ความคิดเห็น

  • @monostechhelp1143
    @monostechhelp1143 4 วันที่ผ่านมา

    Thankyou so much for this, i am going to make a local rag for my company's confluence wiki. This was helpful

    • @Michael-AI
      @Michael-AI 3 วันที่ผ่านมา

      I’m glad it was helpful. I hope your proof of concept is very beneficial to your company.

  • @TM-cb2te
    @TM-cb2te 10 วันที่ผ่านมา

    how would you go about getting a picture of say a specific dog reading a book? like if you uploaded a picture of the dog you wanted a picture of reading a book, what library would you use?

  • @arel3708
    @arel3708 15 วันที่ผ่านมา

    your code is clean and I got it to work within a few hours. I never believed it would be that easy to create a chatbot based on a given wiki.

    • @Michael-AI
      @Michael-AI 15 วันที่ผ่านมา

      Glad it was helpful

  • @duyhung89
    @duyhung89 28 วันที่ผ่านมา

    Hello Michael, I love this video quite much and watch it twice, but with no code like me, it should be quite difficult to build it! Could you make another video for kind of no code like me? I thing many many people will love that! Because It's super cool! Thanks in advance.

  • @autohmae
    @autohmae หลายเดือนก่อน

    Maybe not the biggest innovation in ML, but a nice addition I hadn't seen people mention yet.. I'm only half way deep into ML. But probably a very good user facing feature.

    • @Michael-AI
      @Michael-AI หลายเดือนก่อน

      Yeah, no argument. I think it’s one of those items that go on the innovation stack that can greatly increase repetitive tasks and personalizing customer service.

  • @SanjaySingh-gj2kq
    @SanjaySingh-gj2kq หลายเดือนก่อน

    Good one, Mike.

    • @Michael-AI
      @Michael-AI หลายเดือนก่อน

      Thanks

  • @alitomix
    @alitomix หลายเดือนก่อน

    I tried this but by using ChromaDB instead of Pinecode to have it locally, and llama3 instead chatgpt... can you please help me? I wanna try a full local RAG without API KEYs or external services.... my problem is trying to retrieve the index from the vectorstore

    • @Michael-AI
      @Michael-AI หลายเดือนก่อน

      Hey, I get your point about running everything locally. I'm currently finishing up another video, but once that's done, I'll make one that covers running everything locally with vector databases and Llama.

    • @alitomix
      @alitomix หลายเดือนก่อน

      @@Michael-AI That would be so helpful, thank you!

  • @mdsuzaul2477
    @mdsuzaul2477 หลายเดือนก่อน

    Boost your youTube channel’s visibillty and Grow your audience with my expert seo services!+++ Optimize titles, tags, and descriptions for maximum reach. Get discovered, gain subscribers, and watch your views soar! Start today and transform your channel!+++

  • @SanjaySingh-gj2kq
    @SanjaySingh-gj2kq หลายเดือนก่อน

    Hi Mike, amazing stuff covering very important aspects of an end-to-end RAG application. It was a great experience going through each git project and executing them with pinecone, MongoDB, and OpenAI. With minor changes, they all worked fine. Thanks for the video and codebase. Subscribed!

    • @Michael-AI
      @Michael-AI หลายเดือนก่อน

      Well, I’m glad it was helpful and got you started.

  • @user-lj1cr9yu1c
    @user-lj1cr9yu1c 2 หลายเดือนก่อน

    good stuff! your delivery of the content was smooth...kind of like blades of grass on a flowing stream.

    • @Michael-AI
      @Michael-AI 2 หลายเดือนก่อน

      I appreciate that!

  • @ukmevy
    @ukmevy 2 หลายเดือนก่อน

    Thanks a lot for the hands-on demonstration and the clear explanation! :)

    • @Michael-AI
      @Michael-AI 2 หลายเดือนก่อน

      Very glad to hear that it was helpful

  • @dishti
    @dishti 2 หลายเดือนก่อน

    I have fine tuned this detr model on a custom dataset but so far for only 50 epochs. The highest confidence score i've gotten so far was a 0.7 during inference test on google colab. However, when using the inference api for my fine tuned model that i uploaded on huggingface, for the same image with confidence score of 0.7, the api output was [] Can it be possible that the inference api only shows detections of scores above 0.9? If yes, is there any way i can lower its confidence threshold?

  • @tonyhill5966
    @tonyhill5966 2 หลายเดือนก่อน

    Nice work!!!

    • @Michael-AI
      @Michael-AI 2 หลายเดือนก่อน

      Hope it was helpful

  • @jonathandonda2700
    @jonathandonda2700 2 หลายเดือนก่อน

    Very good content, thanks.

    • @Michael-AI
      @Michael-AI 2 หลายเดือนก่อน

      Thanks

  • @nro337
    @nro337 2 หลายเดือนก่อน

    very cool!

    • @Michael-AI
      @Michael-AI 2 หลายเดือนก่อน

      Thanks! I’m glad you liked it.

  • @rokunuzjahanrudro7571
    @rokunuzjahanrudro7571 2 หลายเดือนก่อน

    This is awesome sir

    • @Michael-AI
      @Michael-AI 2 หลายเดือนก่อน

      Thanks

  • @VTC10English
    @VTC10English 2 หลายเดือนก่อน

    thank you, great content!

    • @Michael-AI
      @Michael-AI 2 หลายเดือนก่อน

      Thanks, glad you liked it.

  • @WesSimpson-mx1fn
    @WesSimpson-mx1fn 2 หลายเดือนก่อน

    this is so badass.

    • @Michael-AI
      @Michael-AI 2 หลายเดือนก่อน

      thanks for thinking it was badass 👍

  • @israeabdelbar8994
    @israeabdelbar8994 2 หลายเดือนก่อน

    Thank you a bunch for this amazing video and explanation!

    • @Michael-AI
      @Michael-AI 2 หลายเดือนก่อน

      Glad it was helpful!

  • @rhuanbarros
    @rhuanbarros 3 หลายเดือนก่อน

    Thanks for the video! Also, as feedback, I found the music during the video a bit annoying because it interferes with my understanding of your speech. I think the music is cool at the beginning, but it distracts me as the video progresses.

    • @Michael-AI
      @Michael-AI 3 หลายเดือนก่อน

      Thanks for the tip! Those are one of my early videos, later videos I learned… thanks 👍

  • @thebillowydiaz9976
    @thebillowydiaz9976 4 หลายเดือนก่อน

    Hello, can you help me with a practice, where can I contact you, it's simple.

  • @tiissick
    @tiissick 4 หลายเดือนก่อน

    This is great, thanks so much for the content and the Git repository!

    • @Michael-AI
      @Michael-AI 4 หลายเดือนก่อน

      I’m glad I got you started

  • @vctorroferz
    @vctorroferz 4 หลายเดือนก่อน

    Amazing Michael ! question, is this working with confluence on premise? thanks again for such nice content!

    • @Michael-AI
      @Michael-AI 4 หลายเดือนก่อน

      Yes, the demo was on confluence on premise. I haven’t hit the cloud service, I will in the future, i’m sure they’ll be a couple of changes.

  • @rasan2kun
    @rasan2kun 5 หลายเดือนก่อน

    Hello micheal, any ideas on how to work on TTS for foreign languages that don't have models already?

  • @paulvazo3264
    @paulvazo3264 5 หลายเดือนก่อน

    Thank you so much, excellent video!!

    • @Michael-AI
      @Michael-AI 5 หลายเดือนก่อน

      Glad it was helpful!

  • @tootemakan
    @tootemakan 5 หลายเดือนก่อน

    I am getting an error which basically says that it can not fetch confluence

    • @Michael-AI
      @Michael-AI 5 หลายเดือนก่อน

      Cloud or local version of Confluence

  • @user-lj1cr9yu1c
    @user-lj1cr9yu1c 5 หลายเดือนก่อน

    good_stuff, Michael!

    • @Michael-AI
      @Michael-AI 5 หลายเดือนก่อน

      Thank you Thank_you

  • @user-lj1cr9yu1c
    @user-lj1cr9yu1c 5 หลายเดือนก่อน

    Good stuff Michael. I learned something new today.

    • @Michael-AI
      @Michael-AI 5 หลายเดือนก่อน

      Glad to hear it!

  • @pavan5257
    @pavan5257 5 หลายเดือนก่อน

    Awesome Mike, Cool Stuff!

    • @Michael-AI
      @Michael-AI 5 หลายเดือนก่อน

      Glad you enjoyed it

  • @debabarnachatterjee3803
    @debabarnachatterjee3803 6 หลายเดือนก่อน

    thank for sharing. This really helped me.Please upload more videos for confluence loader

    • @Michael-AI
      @Michael-AI 6 หลายเดือนก่อน

      I actually do have one coming up on a RAG architecture. Calls confluence and pulls in all the information then stores embeddings in a pinecone vector database and send it off to open AI for a NLP response. Should enjoy it.

  • @stuartrichards54
    @stuartrichards54 7 หลายเดือนก่อน

    this was super helpful. thanks Michael!

    • @Michael-AI
      @Michael-AI 6 หลายเดือนก่อน

      Glad it was helpful!

  • @kreptogaming5201
    @kreptogaming5201 8 หลายเดือนก่อน

    Hi Michael love your content on fine tuning, is there an application you have made in order to make the creation of the jsonL data easier, did you use ai to generate the example responses, did you handwrite the json?

    • @Michael-AI
      @Michael-AI 7 หลายเดือนก่อน

      Sorry for the late response :( in a productive setting the information you get to fine-tune your model should come from a data source of some type. So in theory, you'd have to create a basic template that loops through your records creating the JSON lines file. I'll think about putting a video together to show how that's done. Thanks for the suggestion.

  • @MarlonSalewski
    @MarlonSalewski 8 หลายเดือนก่อน

    Could you show how to train a Dataset? I’ve got one, but there's no finished Model available.

    • @Michael-AI
      @Michael-AI 7 หลายเดือนก่อน

      Sorry for the late response... I will create a video on this. This has been requested multiple times. Thank you again for watching the video.

  • @BoBozlo
    @BoBozlo 8 หลายเดือนก่อน

    Hi, thanks for the great tutorial! Do you know how the systems messages from the create_model/data.jsonl file and the system message from the _api.py file (line 18) are related? Do they have to be the same message, or can they be different? Do these messages have to be logically related?

    • @Michael-AI
      @Michael-AI 8 หลายเดือนก่อน

      yes, the data.jsonl file is used to only train the model, basically the fine tuning part of it. Here is a detail explanation of the fine-tuning process that may help. th-cam.com/video/Q1VfJwLk3hg/w-d-xo.html Line 18 is just the prompt when you ask a question, prompt engineering, how do you want to tell ChatGPT to act with all your questions you were going to ask. This prompt should change based on how you tuned the model, and what you're looking to get out of questions. Hopefully that helps

  • @hoduchoa1727
    @hoduchoa1727 8 หลายเดือนก่อน

    It's response too slow, can you tell me what's the reason? thank you verrymuch

    • @Michael-AI
      @Michael-AI 8 หลายเดือนก่อน

      I'll check it out this weekend. I know this video was created back in early July and there has been a lot of advancements.

  • @learncommerce.mp4
    @learncommerce.mp4 9 หลายเดือนก่อน

    Hi Michael, Is there a way to get in touch with you? I need help developing my AI application. Thankyou in advance!

    • @Michael-AI
      @Michael-AI 8 หลายเดือนก่อน

      You can always reach me on LinkedIn www.linkedin.com/in/suliot also note, because of conflict of interest I cannot help in actual coding, but I have no problem giving advice.

  • @user-yi4sz7fh6z
    @user-yi4sz7fh6z 9 หลายเดือนก่อน

    helped a lot thankss

    • @Michael-AI
      @Michael-AI 9 หลายเดือนก่อน

      Glad it helped

  • @antdx316
    @antdx316 11 หลายเดือนก่อน

    I've done it but it just shows [] ??

    • @Michael-AI
      @Michael-AI 11 หลายเดือนก่อน

      Usually, if you get an empty array, it did not detect any object it was trained on. Validate with one of the images that was included on the GitHub. For more image try free-images.com/

    • @antdx316
      @antdx316 11 หลายเดือนก่อน

      @@Michael-AI I used the better one, I think its resnet101 and it worked instead of 50

  • @emilbalis5010
    @emilbalis5010 11 หลายเดือนก่อน

    Hello Michal. Thank you so much for the content and willingness to share it. Great video!

    • @Michael-AI
      @Michael-AI 11 หลายเดือนก่อน

      I'm glad you found it beneficial. I hope it propels your progress in AI advancement.

  • @chimbosm
    @chimbosm 11 หลายเดือนก่อน

    Great stuff, worth the watch.

    • @Michael-AI
      @Michael-AI 11 หลายเดือนก่อน

      Thanks

  • @krishnan828
    @krishnan828 ปีที่แล้ว

    Please reduce background music!

    • @Michael-AI
      @Michael-AI ปีที่แล้ว

      Will do in the future. Thanks

  • @carlosandresgomezdaza9255
    @carlosandresgomezdaza9255 ปีที่แล้ว

    hi michael, does this work in another languages?

    • @Michael-AI
      @Michael-AI ปีที่แล้ว

      Yes, to validate I tested a PDF file that was in Spanish. The cool part is that the system will respond in the language you asked the question in👍

    • @SuvorajBiswas
      @SuvorajBiswas 11 หลายเดือนก่อน

      @@Michael-AI Internally it is using Embeddings and similarity search to find the relevant contents so language is not a problem in this kind of scenario

  • @derzeitzeuge9233
    @derzeitzeuge9233 ปีที่แล้ว

    Just saw a view of your videos and I have to say thank you for sharing these information with us.

    • @Michael-AI
      @Michael-AI ปีที่แล้ว

      Glad you like them!

  • @jimjones26
    @jimjones26 ปีที่แล้ว

    Interesting, I have been playing around with AI for a few months now. Looking forward to your videos.

    • @Michael-AI
      @Michael-AI ปีที่แล้ว

      I hope your AI journey is exciting and rewarding.