RAG + Langchain Python Project: Easy AI/Chat For Your Docs

pixegami

มุมมอง 98 713

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 18 พ.ค. 2024
Learn how to build a "retrieval augmented generation" (RAG) app with Langchain and OpenAI in Python.
You can use this to create chat-bots for your documents, books or files. You can also use it to build rich, interactive AI applications that use your data as a source.
👉 Links
🔗 Code: github.com/pixegami/langchain...
📄 (Sample Data) AWS Docs: github.com/awsdocs/aws-lambda...
📄 (Sample Data) Alice in Wonderland: www.gutenberg.org/ebooks/11
📚 Chapters
00:00 What is RAG?
01:36 Preparing the Data
05:05 Creating Chroma Database
06:36 What are Vector Embeddings?
09:38 Querying for Relevant Data
12:47 Crafting a Great Response
16:18 Wrapping Up
#pixegami #python

ความคิดเห็น • 272

@colegoddin9034 4 หลายเดือนก่อน ⁺³⁵
Easily one of the best explained walk-throughs of LangChain RAG I’ve watched. Keep up the great content!
@pixegami 3 หลายเดือนก่อน
Thanks! Glad you enjoyed it :)
@wtcbd01 หลายเดือนก่อน
Thanks so much for this. Your teaching style is incredible and the subject is well explained.
@gustavojuantorena 6 หลายเดือนก่อน ⁺⁶
Your channel is one of the best of TH-cam. Thank you. Now I'll go watch the video.
@elijahparis3719 5 หลายเดือนก่อน ⁺¹⁷
I never comment on videos, but this was such an in-depth and easy to understand walkthrough! Keep it up!
@pixegami 5 หลายเดือนก่อน
Thank you :) I appreciate you commenting, and I'm glad you enjoyed it. Please go build something cool!
@gustavstressemann7817 3 หลายเดือนก่อน ⁺²
Straight to the point. Awesome!
@pixegami 3 หลายเดือนก่อน
Thanks, I appreciate it!
@MattSimmonsSysAdmin 4 หลายเดือนก่อน ⁺³
Absolutely epic video. I was able to follow along with no problems by watching the video and following the code. Really tremendous job, thank you so much! Definitely subscribing!
@pixegami 4 หลายเดือนก่อน
Thank you for your comment! I'm really glad to hear it was easy to follow - well done! Hope you build some cool stuff with it :)
@theneumann7 หลายเดือนก่อน
Perfectly explained👌🏼
@lalalala99661 13 วันที่ผ่านมา ⁺¹
Clean, strucktured, good to follow, tutorial. Thank you for that
@pixegami 6 วันที่ผ่านมา
Thank you! Glad you enjoyed it!
@kwongster 2 หลายเดือนก่อน ⁺¹
Awesome walkthrough, thanks for making this 🎉
@pixegami 2 หลายเดือนก่อน
Thank you! Glad you liked it.
@geoffhirst5338 หลายเดือนก่อน
Great walkthrough, now all thats needed is a revision to cope with the changes to the langchain namespaces.
@niklasvilnersson24 28 วันที่ผ่านมา
What changes have ben done, I cant get this to work :-(
@jim93m 3 หลายเดือนก่อน ⁺²
Thank you, that was a great walk through very easy to understand with a great pace. Please make a video on LangGraph as well.
@pixegami 3 หลายเดือนก่อน
Thank you! Glad you enjoyed it. Thanks for the LangGraph suggestion. I hadn't noticed that feature before-tech seems to move fast in 2024 :)
@insan2080 9 วันที่ผ่านมา ⁺¹
This is what I look for! Thanks for the simplest explanation. There are some adjustments on the codebase during the updates but it doesn't matter. Keep it up!
@pixegami 6 วันที่ผ่านมา
You're welcome, glad it helped! I try to keep the code accurate, but sometimes I think these libraries update/change really fast. I think I'll need to lock/freeze package versions in future videos so it doesn't drift.
@voulieav 3 หลายเดือนก่อน ⁺¹
Epic.
Thank you for sharing this.
@pixegami 3 หลายเดือนก่อน
Thank you!
@MrValVet 6 หลายเดือนก่อน ⁺³
Thank you for this. Looking forward to tutorials on using Assistants API.
@pixegami 5 หลายเดือนก่อน ⁺¹
You're welcome! And great idea for a new video :)
@StringOfMusic 9 วันที่ผ่านมา ⁺¹
Fantastic, clear, concise and to the point. thanks so much for your efforts to share your knowledge with others.
@pixegami 6 วันที่ผ่านมา
Thank you, I'm glad you enjoyed it!
@michaeldimattia9015 5 หลายเดือนก่อน ⁺²
Great video! This was my first exposure to ChromaDB (worked flawlessly on a fairly large corpus of material). Looking forward to experimenting with other language models as well. This is a great stepping stone towards knowledge based expansions for LLMs. Nice work!
@pixegami 5 หลายเดือนก่อน
Really glad to hear you got it to work :) Thanks for sharing your experience with it as well - that's the whole reason I make these videos!
@RZOLTANM หลายเดือนก่อน ⁺¹
Really good. Thank very much sir. Articulated perfectly!
@pixegami หลายเดือนก่อน
Thank you! Glad you enjoyed it :)
@narendaPS หลายเดือนก่อน ⁺¹
this is the best tutorial i have ever seen on this topic, thank you so much, Keep up the good work. Immediately subscribed.
@pixegami หลายเดือนก่อน
Glad you enjoyed it. Thanks for subscribing!
@ahmedamamou7221 หลายเดือนก่อน ⁺¹
Thanks a lot for this tutorial! Very well explained.
@pixegami หลายเดือนก่อน
Glad it was helpful!
@jasonlucas3772 หลายเดือนก่อน ⁺¹
This was excellent. easy to follow, has codes and very useful! Thank you.
@pixegami หลายเดือนก่อน
Thank you, I really appreciate it!
@chrisogonas หลายเดือนก่อน ⁺¹
Well illustrated! Thanks
@pixegami 26 วันที่ผ่านมา
Thank you!
@elidumper52 2 หลายเดือนก่อน ⁺¹
Super helpful, thank you!
@pixegami หลายเดือนก่อน
Glad it was helpful!
@lucasboscatti3584 4 หลายเดือนก่อน ⁺¹
Huge class!!
@thatoshebe5505 3 หลายเดือนก่อน ⁺¹
Thank you for sharing, this was the info I was looking for
@pixegami 3 หลายเดือนก่อน
Glad it was helpful!
@aiden9990 3 หลายเดือนก่อน ⁺¹
Perfect thank you!
@pixegami 3 หลายเดือนก่อน
Glad it helped!
@MartinRodriguez-sx2tf 24 วันที่ผ่านมา ⁺¹
Muy bueno y esperando el próximo 🎉
@pixegami 23 วันที่ผ่านมา
Thank you!
@mao73a 15 วันที่ผ่านมา ⁺¹
This was so informative and well presented. Exactly what I was looking for. Thank you!
@pixegami 6 วันที่ผ่านมา
You're welcome, glad you liked it!
@williammariasoosai1153 3 หลายเดือนก่อน ⁺¹
Very well done! Thanks
@pixegami 3 หลายเดือนก่อน
Glad you liked it!
@erikjohnson9112 5 หลายเดือนก่อน ⁺¹
I too am quite impressed with your videos (this is my 2nd one). I have now subscribed and I bet you'll be growing fast.
@pixegami 5 หลายเดือนก่อน ⁺¹
Thank you! 🤩
@shapovalentine 3 หลายเดือนก่อน ⁺¹
Useful, Nice, Thank You 🤩🤩🤩
@pixegami 3 หลายเดือนก่อน
Glad to hear it was useful!
@israeabdelbar8994 3 หลายเดือนก่อน ⁺²
Very helpful video! Keep going, you are the best!
Thank you very much, I am looking forward to see a video about Virtuel assistant doing actions. By communicating others applications using API.
@pixegami 3 หลายเดือนก่อน
Glad you enjoyed it! Thanks for the suggestion :)
@israeabdelbar8994 2 หลายเดือนก่อน
You're welcome
@@pixegami
@chandaman95 2 หลายเดือนก่อน ⁺¹
Amazing video, thank you.
@pixegami หลายเดือนก่อน
Thank you!
@kewalkkarki6284 4 หลายเดือนก่อน ⁺¹
This is Amazing 🙌
@pixegami 4 หลายเดือนก่อน
Thank you! Glad you liked :)
@bcippitelli 5 หลายเดือนก่อน ⁺¹
thanks dude!
@basicvisual7137 2 หลายเดือนก่อน ⁺¹
Finally a good langchain video to understand better. Do you have a video in mind to use local llm using Ollama and local embeddings to port the code ?
@rikhavthakkar2015 2 หลายเดือนก่อน ⁺²
Simple explained and kept an engaging tone.
I would also look for a use case where the source of vector data is a combination of files (PDF, DOCX, EXCEL etc.) along with some database (RDBMS or File based database)
@pixegami หลายเดือนก่อน
Thanks! That's a good idea too. You can probably achieve that by detecting what type of file you are working with, and then using a different parser (document loader) for that type. Langchain should have custom document loaders for all the most common file types.
@tinghaowang-ei7kv 28 วันที่ผ่านมา ⁺¹
Nice,how pretty that is it.
@jianganghao1857 10 วันที่ผ่านมา ⁺¹
Great tutorial, very clear
@pixegami 6 วันที่ผ่านมา
Glad it was helpful!
@pojomcbooty หลายเดือนก่อน ⁺³
VERY well explained. thank you so much for releasing this level of education on youtube!!
@pixegami หลายเดือนก่อน ⁺¹
Glad you enjoyed it!
@serafeiml1041 28 วันที่ผ่านมา ⁺¹
you got a new subscriber. nice work
@pixegami 26 วันที่ผ่านมา
Thank you! Welcome :)
@stevenla2314 วันที่ผ่านมา
Love your videos. I was able to follow along and build my own RAG. Can you expand more on this series and explain RAPTOR retrieval and how to implement it?
@litttlemooncream5049 2 หลายเดือนก่อน ⁺¹
helpful if I wanna do analysis on properly-organized documents
@pixegami 2 หลายเดือนก่อน
Yup! I think it could be useful for searching through unorganised documents too.
@shikharsaxena9989 16 วันที่ผ่านมา ⁺¹
best explanation of rag
@pixegami 6 วันที่ผ่านมา
Thank you!
@PoGGiE06 หลายเดือนก่อน ⁺³
Great explanation. Perhaps one criticism would be using open ai’s embedding library: would rather not be locked into their ecosystem and i believe that free alternatives exist that are perfectly good! But would have loved a quick overview there.
@pixegami หลายเดือนก่อน ⁺³
Thanks for the feedback. I generally use OpenAI because I thought it was the easiest API for people to get started with. But actually I've received similar feedback where people just want to use open source (or their own) LLM engines.
Feedback received, thank you :) Luckily with somehitng like Langchain, swapping out the LLM engine (e.g. the embedding functionality) is usually just a few lines of code.
@PoGGiE06 หลายเดือนก่อน
@@pixegami It's a pleasure :).
Yes, everyone seems to be using OpenAI by default, because everyone is using chatGPT. But there are lots of good reasons why one might not wish to get tied to open AI, anthropic, or any other cloud-based provider besides the mounting costs if one is developing applications using LLM. E.g. data privacy/integrity, simplicity, reproducibility (e.g. chatGPT is always changing and that is out of your control), in addition a general suspicion of non-open-source frameworks whose primary focus is often (usually?) on wealth extraction, not solution provision. There is not enough good material out there on how to create a basic RAG with vector storage using a local LLM, something that is very practical with smaller models e.g. mistral, dolphincoder, Mixtral 8x7b etc., at least for putting together an MVP.
Re: avoiding openAI:
I've managed to use embed_model = OllamaEmbeddings(model="nomic-embed-text").
I still get occasional 'openAI' related errors, but gather that Ollama has support for mimicking openAI now, including a 'fake' openAI key, so am looking into that as a fix.
ollama.com/blog/windows-preview
I also gather that with llama-cpp, one can specify model temperature and other configuration options, whereas with Ollama, one is stuck with the configuration used in the modelfile when the Ollama-compatible model is made (if that is the correct terminology). So I may have to investigate that.
I'm currently using llama-index because I am focused on RAG and don't need the flexibility of langchain.
Good tutorial in the llama-index docs: docs.llamaindex.ai/en/stable/examples/usecases/10k_sub_question/
I'm also a bit sceptical that langchain isn't another attempt to 'lock you in' to an ecosystem that can then be monetised e.g. minimaxir.com/2023/07/langchain-problem/. I am still learning, so don't have a real opinion yet. Very exciting stuff! Kind regards.
@pampaniyavijay007 13 วันที่ผ่านมา ⁺¹
This very simple and useful video for me 🤟🤟🤟
@pixegami 6 วันที่ผ่านมา
Thank you! I'm glad to hear that.
@user-md4pp8nv7u หลายเดือนก่อน ⁺¹
very great!! thanks you
@pixegami หลายเดือนก่อน
Glad you liked it!
@seankim6080 2 หลายเดือนก่อน
Thanks so much! This is super helpful to better understand RAG. Only the thing is still not sure how to run this program that I clonned from your github repository via windows terminal. Will try on my own but if you could provide any guidance or sources TH-cam links anything like that would be much more appreciated.
@mohanraman หลายเดือนก่อน
this is an awesome video. Thank You !! ! Am curious how to leverage these technologies with structured data , like business data thats stored in tables. Appreciate any videos about that.
@matthewlapinta7388 14 ชั่วโมงที่ผ่านมา
This video was pure gold. Really grateful for the concise and excellent walkthrough. I have two additional questions in regards to the metadata and resulting chunk reference displayed. Can you return a screenshot of the chunk/document referenced now that models are multimodal? Also a document title or ability to download such document would also be a cool feature. Thanks so much in advance!
@user-iz7wi7rp6l 5 หลายเดือนก่อน ⁺¹
first thank you very much and now also tell to apply memory of various kinds
@pixegami 4 หลายเดือนก่อน ⁺¹
Thanks! I haven't looked at how to use the Langchain memory feature yet so I'll have to work on that first :)
@user-iz7wi7rp6l 4 หลายเดือนก่อน ⁺¹
@@pixegami ohk i i have implemented memory and other features also also as well as worked with windows also after some monstor errors,, thank once again for the clear working code (used in production)
hope to see more in future
@FrancisRodrigues หลายเดือนก่อน ⁺¹
That's the best and most reliable content about LangChain I've ever seen, and it only took 16 minutes.
@pixegami หลายเดือนก่อน ⁺¹
Glad you enjoyed it! I try to keep my content short and useful because I know everyone is busy these days :)
@Shwapx หลายเดือนก่อน
@@pixegamihey great work can we have an updated version with the langchain imports because its throwing all kind of errors of imports which are changed
@quengelbeard 2 หลายเดือนก่อน ⁺²
Hi, by far the best video on Langchain - Chroma! :D
Quick question: How would you update the chroma database if you want to feed it with documents (while avoiding duplication of documents) ?
@pixegami 2 หลายเดือนก่อน
Glad you liked it! Thank you. If you want to add (modify) the ChromaDB data, you should be able to do that after you've loaded up the DB:
docs.trychroma.com/usage-guide#adding-data-to-a-collection
@SantiYounger 2 หลายเดือนก่อน ⁺²
thanks for the video, this looks great, but I tried to implement it and seems like the langchain packages needed are no longer available has anyone had any luck getting this to work?
Thanks
@AdandKidda 2 หลายเดือนก่อน
hi , thanks for such ultimate knowledge sharing .
I have a use case:
1. can we perform some action (call an api) as response ?
2. how can we use mistral and opensource embedding for this purpose?
@theobelen-halimi2862 3 หลายเดือนก่อน ⁺²
Very clear video and tutorial ! Good job ! Just have a question : Is it possible to use Open Source model rather than OpenAI ?
@pixegami 3 หลายเดือนก่อน ⁺¹
Yes! Check out this video on how to use different models other than OpenAI: th-cam.com/video/HxOheqb6QmQ/w-d-xo.html
And here is the official documentation on how to use/implement different LLMs (including your own open source one) python.langchain.com/docs/modules/model_io/llms/
@corbin0dallas 19 วันที่ผ่านมา ⁺¹
Great tutorial, Thanks! My only feedback is that any LLM already knows everything about Alice in wonderland
@SongforTin 18 วันที่ผ่านมา ⁺¹
You can create custom apps for Businesses using their own documents = huge Business opportunity If it really works.
@pixegami 6 วันที่ผ่านมา
Yeah that's a really good point. What I really needed was a data-source that was easy to understand, but would not appear in the base knowledge of any LLM (I've learnt that now for my future videos).
@user-fj4ic9sq8e 2 หลายเดือนก่อน
Hello,
thank you so much for this video.
i have a question related of sumuraze questions in LLM documents.for example in vector database have thousands documents with date property, and i want ask the model how much document i received in the last week?
@frederikklein1806 3 หลายเดือนก่อน ⁺¹
This is a really good video, thank you so much! Out of curiosity, why do you use iterm2 as a terminal and how did you set it up to look that cool? 😍
@pixegami 3 หลายเดือนก่อน ⁺¹
I use iTerm2 for videos because it looks and feels familiar for my viewers. When I work on my own, I use warp (my terminal set up and theme explained here: th-cam.com/video/ugwmH_xzkCA/w-d-xo.html)
And if you're using Ubuntu, I have a terminal setup video for that too: th-cam.com/video/UvY5aFHNoEw/w-d-xo.html
@JJaitley 3 หลายเดือนก่อน ⁺¹
@pixegami What are your suggestions on cleaning the company docs before chunking? Some of the challenges faced are how to handle the index pages in multiple pdfs also the headers and footers. You should definitely make some video related to cleaning a pdf before chunking much needed.
@pixegami 3 หลายเดือนก่อน
That's a tactical question that will vary from doc to doc. It's a great question and a great use-case though for creative problem solving-thanks for the suggestion and video idea.
@slipthetrap 4 หลายเดือนก่อน ⁺¹⁹
As others have asked: "Could you show how to do it with an open source LLM?" Also, instead of Markdown (.md) can you show how to use PDFs ? Thanks.
@pixegami 4 หลายเดือนก่อน ⁺⁸
Thanks :) It seems to be a popular topic so I've added to my list for my upcoming content.
@danishammar.official 2 หลายเดือนก่อน
If made video on above request kindly give link in description it gonna be a good for all users
@raheesahmed56 2 หลายเดือนก่อน ⁺²
Instead of md extension you can simply use txt or pdf extension thats it just replace the file extension
@yl8908 หลายเดือนก่อน
Yes, pls share how to work with pdfs directly instead of .mds . Thanks !
@jimg8296 หลายเดือนก่อน ⁺¹
Thank you SO MUCH! Exactly what I was looking for. Your presentation was easy to understand and very complete. 5 STARS! Not to be greedy, but I'd love to see this running 100% locally.
@pixegami หลายเดือนก่อน ⁺²
Glad it was helpful! Running local LLM apps is something I get asked quite a lot about and so I do actually plan to do a video about it quite soon.
@jessicabull3918 หลายเดือนก่อน
@@pixegami Yes please!
@nachoeigu หลายเดือนก่อน ⁺¹
You gained a new subscriber. Thank you, amazing content! Only one question, how about the cost associated with this software? How match it consumes per request?
@pixegami 26 วันที่ผ่านมา
Thank you, welcome! To calculate pricing, it's based on which AI model you use. In this video, we use OpenAI, so check the pricing here: openai.com/pricing
1 Token ~= 1 Word. So to embed a document with 10,000 words (tokens) with "text-embedding-3-large" ($0.13 per 1M token), it's about $0.0013. Then apply the same calculation to the prompt/response for "gpt-4" or whichever model you use for the chat.
@RajAIversion 2 หลายเดือนก่อน
Nailed it and Easy Understandable, can i make this an chat bot ?
Anyone please share your thoughts
@sunnysk43 5 หลายเดือนก่อน ⁺³
Amazing video - directly subscribed to your channel ;-) Can you also provide an example with using your own LLM instead of OpenAI?
@pixegami 5 หลายเดือนก่อน ⁺¹
Yup! Great question. I'll have to work on that, but in the meantime here's a page with all the LLM supported integrations: python.langchain.com/docs/integrations/llms/
@user-wi8ne4qb6u 4 หลายเดือนก่อน ⁺¹
Excellent coding! working wonderful! Appreciate. One question please: what difference if I change from md to pdf?
@pixegami 4 หลายเดือนก่อน
Thanks, glad you enjoyed it. It should still work fine :) You might just need to use a different "Document Loader" from Langchain: python.langchain.com/docs/modules/data_connection/document_loaders/pdf
@xspydazx 15 วันที่ผ่านมา
Question : once loading a vector store , how can we output a dataset from the store to be used as a fine tuning object ?
@Chisanloius 15 วันที่ผ่านมา ⁺²
Great level of knowledge and details.
Please where is your Open AI key stored.
@pixegami 6 วันที่ผ่านมา
Thank you! I normally just store the OpenAI key in the environment variable `OPENAI_API_KEY`. See here for storage and safety tips: help.openai.com/en/articles/5112595-best-practices-for-api-key-safety
@yangsong8812 2 หลายเดือนก่อน ⁺¹
Would love to hear your thoughts if hats on how to use evaluation to keep LLM output in check. Can we set up framework so that we can have an evaluation framework?
@pixegami 2 หลายเดือนก่อน
There's currently a lot of different research and tools on how to evaluate the output - I don't think anyone's figured out the standard yet. But stuff like this is what you'd probably want to look at: cloud.google.com/vertex-ai/generative-ai/docs/models/evaluate-models
@cindywu3265 2 หลายเดือนก่อน ⁺¹
Thanks for sharing the examples with OpenAI Embedding model. I'm trying to practice using the HuggingFaceEmbeddings because it's free but wanted to check the evaluation metrics - like the apple and orange example you showed. Do you know if it exists by any chance?
@pixegami 2 หลายเดือนก่อน
Yup, you should be able to override the evaluator (or extend your own) to use whichever embedding system you want: python.langchain.com/docs/guides/evaluation/comparison/custom
But at the end of the day, if you can already get the embedding, then evaluation is usually just a cosine similarity distance between the two, so it's not too complex if you need to calculate it yourself.
@vlad910 4 หลายเดือนก่อน ⁺¹
Thank you for this very instructive video. I am looking at embedding some research documents from sources such as PubMed or Google scholar. Is there a way for the embedding to use website data instead of locally stored text files?
@pixegami 4 หลายเดือนก่อน ⁺¹
Yes, you can basically load any type of text data if you use the appropriate document loader: python.langchain.com/docs/modules/data_connection/document_loaders/
Text files are an easy example, but there's examples of Wikipedia loaders in there too (python.langchain.com/docs/integrations/document_loaders/). If you don't find what you are looking for, you can implement your own Document loader, and have it get data from anywhere you want.
@jessicabull3918 หลายเดือนก่อน
@@pixegami Exactly the question and answer I was looking for, thanks
@RobbyRobinson1 5 หลายเดือนก่อน ⁺¹
I was just thinking about this, great work.
Hypothetically, what if your data sucks? What models can I use to create the documentation? (lol)
@pixegami 5 หลายเดือนก่อน ⁺¹
Haha that's a topic for another video. But yeah, if the data is not good, then I think that should be your first focus. This RAG technique builds on the assumption that your data is good-and it just adds value on top of that.
@uchiha_mishal 5 วันที่ผ่านมา ⁺¹
Nicely explained but I had to go through a ton of documentation for using this project with AzureOpenAI instead of OpenAI.
@pixegami 5 วันที่ผ่านมา
Thanks! I took a look at Azure Open AI documentation on Langchain and you're right-it doesn't exactly look straightforward: python.langchain.com/v0.1/docs/integrations/llms/azure_openai/
@FrancisRodrigues หลายเดือนก่อน ⁺¹
pls, I'd like to see a Recommendation model (products, images, etc) based on our different sources, it could be scraping from webpages. Something to use in e-commerce.
@pixegami หลายเดือนก่อน
Product recommendations are a good idea :) Thanks for the suggestion, I'll add it to my list.
@mlavinb 3 หลายเดือนก่อน ⁺¹
Great content! Thanks for sharing.
Can you suggest a Chat GUI to connect?
@pixegami 3 หลายเดือนก่อน
If you want a simple, Python based one, try Streamlit (streamlit.io/). I also have a video about it here: th-cam.com/video/D0D4Pa22iG0/w-d-xo.html
@hoangngbot 25 วันที่ผ่านมา
I want to hear your thoughts on what approach is likely the better one:
1. Chop the document into multiple chunks and convert chunks to vectors
2. Convert the whole document to a vector
Thank you
@pixegami 23 วันที่ผ่านมา
I think it really depends on your use-case and the content. The best way to know is to have a way to evaluate (test) the results/quality.
In my own use-cases, I find that a chunk length of around 3000 characters work quite well (you need enough context for the content to make sense). I also like to concatenate some context info into the chunk (like "this is page 5 about XYZ, part of ABC".
But I haven't done enough research into this to really give a qualified answer. Good luck!
@hoangngbot หลายเดือนก่อน ⁺¹
Thank you for a great video. What if I already did word embedding and in the future I have some updates for the data?
@pixegami หลายเดือนก่อน
Thanks! I'm working on a video to explain techniques like that. But in a nutshell, you'll need to attach an ID to each document you add to the DB (derived deterministically from your page meta-data) and use that to update entries that change (or get added): docs.trychroma.com/usage-guide#updating-data-in-a-collection
@fengshi9462 4 หลายเดือนก่อน ⁺¹
hi, your video is so good. I just wanna know,if i want to automatically change my document in the production environment and keep the query service don't stop and always use the latest document as the sources, how can i do this by changing the code?❤
@pixegami 4 หลายเดือนก่อน ⁺¹
Ah, if you change the source document, you actually have to generate a new embedding and add it to the RAG database (the Prisma DB here). So you would have to figure out which piece of document changes, then create a new entry for that into the database. I don't have a code example right now, but it's definitely possible.
@AjibadeYakub 10 วันที่ผ่านมา ⁺¹
This is great work, Thank you
How can I use the result of a sql query or a dataframe, rather than text files
@pixegami 6 วันที่ผ่านมา
Yup, looks like there is a Pandas Dataframe Document loader you can use with Langchain: python.langchain.com/v0.1/docs/integrations/document_loaders/pandas_dataframe/
@ailenrgrimaldi6050 หลายเดือนก่อน ⁺¹
Thank you for this video, is NLTK something required to do this?
@pixegami หลายเดือนก่อน ⁺¹
The NLTK library? I don't think I had to use it here in the project, a lot of the other libraries might give you all the functionality at a higher abstraction already.
@annialevko5771 3 หลายเดือนก่อน ⁺¹
Hey nice video, I was just wondering, whats the difference on doing it like this and using chains? I noticed you didnt use any chain and directly used the predict with the prompt 🤔
@pixegami 3 หลายเดือนก่อน
With chains, I think you have a little bit more control (especially if you want to do things in a sequence). But since that wasn't the focus of this video, I just did it using `predict()`.
@user-wm2pb3hi7p หลายเดือนก่อน
how can we make a RAG system which will answer both stuctured and unstructured data.
for example, user upload a csv and a text file and start asking question, then chatbot has to answer from both database.
(structured data should store in different database and pass to a tool to process) unstructured should store in the vector database.
how can we do effectively?
@NahuelD101 5 หลายเดือนก่อน ⁺²
Very nice video, what kind of theme do you use to make the vscode look like this? Thanks.
@pixegami 5 หลายเดือนก่อน ⁺¹
I use Monokai Pro :)
@pixegami 5 หลายเดือนก่อน ⁺²
The VSCode theme is called Monokai Pro :)
@henrygagejr.-founderbuildg9199 3 หลายเดือนก่อน ⁺²
Great instructions! I read through all the comments.
How do you get paid? I value the work of others and want to explore an affiliate model that I tested a year ago.
What is a good way to connect with you and explore possibilities of mutual interest.
@pixegami 3 หลายเดือนก่อน ⁺¹
Thanks for your kind words, but I'm actually doing these videos as a hobby and I already have a full time job so I'm not actually interested in exploring monetisation options right now.
@user-iz7wi7rp6l 5 หลายเดือนก่อน
any one face tesseract error in windows,,it works well at linux
?
@naveeng2003 3 หลายเดือนก่อน ⁺²
How did you rip the aws documentation
@lukashk.1770 26 วันที่ผ่านมา ⁺¹
do these tools work with code also? for example when having a big codebase, and qrying that codebase asking about how xyz is implemeted would be really usefull. Or generating doc etc.
@pixegami 23 วันที่ผ่านมา
I think the idea of a RAG app should definitely work with code.
But you'll probably need to have an intermediate step to translate that code close to something you'd want to query for first (e.g. translate a function into a text description). Embed the descriptive element, but have it refer to the original code snippet.
It sounds like a really interesting idea to explore for sure!
@gvtanuja4874 4 หลายเดือนก่อน ⁺²
Great video..but where have you added OpeAI API keys ?
@pixegami 3 หลายเดือนก่อน ⁺¹
You can add them to your environment variable :) I add mine into my .bashrc file.
@moriztrautmann8231 22 วันที่ผ่านมา ⁺¹
Thank you very much for the video. It seems the adding of chunks to the chroma database takes a really long time. If i just save the embeddings to a json its takes a few seconds but the to the chroma it takes like 20 minutes...Ist there somthing i am missing? I am doing this only on a document about one page long.
@pixegami 6 วันที่ผ่านมา
Hey, thanks for commenting!
It does seem to me something is wrong - the way you're generating embeddings (as a JSON) and via Chroma seems to be doing different things (because they should normally take the same amount of time, if it's for the same amount of text).
Have you tried using different embedding functions? Or is your ChromaDB saved onto a slower disk drive?
@MichaelChenAdventures 2 หลายเดือนก่อน ⁺¹
does the data have to be in a .md format? Also, how do you prep the data beforehand?
@pixegami 2 หลายเดือนก่อน
The data can be anything you want. Here's a list of all the Document loaders supported in Langchain (or you can even write your own): python.langchain.com/docs/modules/data_connection/document_loaders/
The level of preparation is up to you, and it depends on your use case. For example, if you want to split your embeddings by chapters or headers (rather than some length of text), your data format will need a way to surface that.
@spicytuna08 หลายเดือนก่อน
would this work for a general question such as this: please summarize the book in 5 sentences?
@mohsenghafari7652 26 วันที่ผ่านมา ⁺¹
Hi dear friend .
Thank you for your efforts .
How to use this tutorial in PDFs at other language (for example Persian )
What will the subject ?
I made many efforts and tested different models, but the results in asking questions about pdfs are not good and accurate!
Thank you for the explanation
@pixegami 23 วันที่ผ่านมา
Thank you for your comment. For good performance in other languages, you'll probably need to find an LLM model that is optimized for that language.
For Persian, I see this result: huggingface.co/MaralGPT/Maral-7B-alpha-1
@marselse 16 วันที่ผ่านมา ⁺¹
Amazing video! How would I make this into a webapp? Like a support chat bot?
@pixegami 6 วันที่ผ่านมา ⁺¹
Thank you! To make it into a web-app, you'd first need to turn this into an API and deploy it somewhere (my next video tutorial will cover that, so stay tuned!). Then you'll need to connect the API to a front-end (webpage or app) to for your users.
There's a couple of low-code/no-code tools to help you achieve this, but I haven't looked into them in detail yet. Or you can code it all up yourself as well if you are technical.
@kashishvarshney2225 4 หลายเดือนก่อน ⁺¹
hai bro i am creating a chatbot which takes data from third party api which means there is less data but dynamic data for every call so should i use RAG approach if not then suggest me a batter approach
@pixegami 4 หลายเดือนก่อน
Hmm, I think RAG probably isn't the right approach. But it depends... For example if you get +3000 characters of dynamic data each call, then it might still be helpful to generate a vector DB on the spot so you can use RAG to narrow down the answer. But it's going to make each call a lot slower.
But if you have way less data than that (say
@gabrielmartinphilot980 2 หลายเดือนก่อน
Hello man, first of all Great video! The best i seen soo far! I looked all there's about this topic on the web kkkk already sub and like the vid!
When i tried to replicated it, the database did go welll, but the query_data had some erros!
Can u please help this poor noob dude here =)
The erros will be in the coment.
@gabrielmartinphilot980 2 หลายเดือนก่อน
python query_data.py "Me diga as 5 etapas da Metodologia Crisp"
/home/gabriel/linux_folder/studies/model_own_data_streamlit/venv/lib/python3.10/site-packages/langchain/embeddings/__init__.py:29: LangChainDeprecationWarning: Importing embeddings from langchain is deprecated. Importing from langchain will no longer be supported as of langchain==0.2.0. Please import from langchain-community instead:
`from langchain_community.embeddings import OpenAIEmbeddings`.
To install langchain-community run `pip install -U langchain-community`.
warnings.warn(
/home/gabriel/linux_folder/studies/model_own_data_streamlit/venv/lib/python3.10/site-packages/langchain_core/_api/deprecation.py:117: LangChainDeprecationWarning: The class `langchain_community.embeddings.openai.OpenAIEmbeddings` was deprecated in langchain-community 0.0.9 and will be removed in 0.2.0. An updated version of the class exists in the langchain-openai package and should be used instead. To use it run `pip install -U langchain-openai` and import as `from langchain_openai import OpenAIEmbeddings`.
warn_deprecated(
Traceback (most recent call last):
File "/home/gabriel/linux_folder/studies/model_own_data_streamlit/src/dev/query_data.py", line 61, in
main()
File "/home/gabriel/linux_folder/studies/model_own_data_streamlit/src/dev/query_data.py", line 41, in main
results = db.similarity_search_with_relevance_scores(query_text, k=3)
File "/home/gabriel/linux_folder/studies/model_own_data_streamlit/venv/lib/python3.10/site-packages/langchain_core/vectorstores.py", line 324, in similarity_search_with_relevance_scores
docs_and_similarities = self._similarity_search_with_relevance_scores(
File "/home/gabriel/linux_folder/studies/model_own_data_streamlit/venv/lib/python3.10/site-packages/langchain_core/vectorstores.py", line 272, in _similarity_search_with_relevance_scores
docs_and_scores = self.similarity_search_with_score(query, k, **kwargs)
File "/home/gabriel/linux_folder/studies/model_own_data_streamlit/venv/lib/python3.10/site-packages/langchain_community/vectorstores/chroma.py", line 437, in similarity_search_with_score
query_embedding = self._embedding_function.embed_query(query)
File "/home/gabriel/linux_folder/studies/model_own_data_streamlit/venv/lib/python3.10/site-packages/langchain_community/embeddings/openai.py", line 697, in embed_query
return self.embed_documents([text])[0]
File "/home/gabriel/linux_folder/studies/model_own_data_streamlit/venv/lib/python3.10/site-packages/langchain_community/embeddings/openai.py", line 668, in embed_documents
return self._get_len_safe_embeddings(texts, engine=engine)
File "/home/gabriel/linux_folder/studies/model_own_data_streamlit/venv/lib/python3.10/site-packages/langchain_community/embeddings/openai.py", line 494, in _get_len_safe_embeddings
response = embed_with_retry(
File "/home/gabriel/linux_folder/studies/model_own_data_streamlit/venv/lib/python3.10/site-packages/langchain_community/embeddings/openai.py", line 116, in embed_with_retry
return embeddings.client.create(**kwargs)
File "/home/gabriel/linux_folder/studies/model_own_data_streamlit/venv/lib/python3.10/site-packages/openai/resources/embeddings.py", line 113, in create
return self._post(
File "/home/gabriel/linux_folder/studies/model_own_data_streamlit/venv/lib/python3.10/site-packages/openai/_base_client.py", line 1200, in post
return cast(ResponseT, self.request(cast_to, opts, stream=stream, stream_cls=stream_cls))
File "/home/gabriel/linux_folder/studies/model_own_data_streamlit/venv/lib/python3.10/site-packages/openai/_base_client.py", line 889, in request
return self._request(
File "/home/gabriel/linux_folder/studies/model_own_data_streamlit/venv/lib/python3.10/site-packages/openai/_base_client.py", line 980, in _request
raise self._make_status_error_from_response(err.response) from None
openai.AuthenticationError: Error code: 401 - {'error': {'message': 'Incorrect API key provided: sua_chave. You can find your API key at platform.openai.com/account/api-keys.', 'type': 'invalid_request_error', 'param': None, 'code': 'invalid_api_key'}}
Where did u use your OpenIA API-key?
Cause i use:
from dotenv import load_dotenv
load_dotenv()
And its working.
@noubgaemer1044 4 หลายเดือนก่อน ⁺¹
this is great but can we use our custom LLM as server i dont want to use OPENAI is there any way
@pixegami 4 หลายเดือนก่อน
Yes, Langchain is LLM agnostic. So whilst OpenAI is just the easiest one to demo with (and also the default option), you should be able to swap it out to any other LLM by changing the LLM implementation. See python.langchain.com/docs/modules/model_io/llms/
@johnfakes1298 3 หลายเดือนก่อน ⁺¹
what is the compiler you are using? I am using jupyter notebook but yours looks better
@pixegami 3 หลายเดือนก่อน ⁺¹
If you mean the development environment (editor or IDE), then I'm using VSCode with the Monokai Pro theme.
@johnfakes1298 3 หลายเดือนก่อน
@@pixegami thank you
@khalilchi5726 3 หลายเดือนก่อน ⁺¹
I've been trying to use llama2 instead of ChatOpenAI but i couldn't, could you please provide some insights that could help in that matter?
@pixegami 3 หลายเดือนก่อน ⁺²
This was quite a commonly requested thing, so I made another tutorial about how to switch to different LLMs here: th-cam.com/video/HxOheqb6QmQ/w-d-xo.html
I think you just have to switch the LLM implementation that you use (but the prompt structure might be different too). In that video, you'll see code for how to use Llama2 via AWS Bedrock.
@khalilchi5726 3 หลายเดือนก่อน
@@pixegami Thanks alot! I switched to Gemini Pro and it worked!
@canasdruid 18 วันที่ผ่านมา ⁺¹
What is more advisable if I work with PDF documents, transforming them into text using a library like PyPDFLoader, or transforming them into another format that is easier to read?
@pixegami 6 วันที่ผ่านมา
I haven't done a deep dive on what's the most optimal way to use PDF data yet. I think it really depends on the data in the PDF, and what the chunk outputs look like. You probably need to do a bit of experimentation.
If you have specific patterns with your PDFs (like lots of tables or columns) I'd probably try to pre-process them somehow first before feeding them into the document loader.
@NinVibe หลายเดือนก่อน ⁺¹
what would be the steps to deploy this on the cloud and be ready for an internal team to use it?
@pixegami หลายเดือนก่อน ⁺¹
Good question. I think I wanna cover this in one of my new videos this month. I can make a tutorial on how to deploy this and turn it into a produciton-grade API.
@NinoRisteski หลายเดือนก่อน
@@pixegami great, waiting for it!
@user-cc3ev7de9v 2 หลายเดือนก่อน
which model you are using in this ?

ต่อไป

เล่นอัตโนมัติ

Python RAG Tutorial (with Local LLMs): AI For Your PDFs