"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

AI Jason

มุมมอง 515 880

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 30 ธ.ค. 2024

ความคิดเห็น • 317

@Jim-ey3ry 8 หลายเดือนก่อน ⁺²⁴⁵
This is prob one of the best RAG video I've seen, so many learnings in 20 mins
@kenchang3456 8 หลายเดือนก่อน ⁺⁸⁷
Man, your videos keep getting better every time I look. You have a great mind and your presentation is excellent. Thank you very much, again, for sharing!
@magicismagic123 7 หลายเดือนก่อน ⁺⁴
he is much better than 99.9% wanna be over hyped ai gurus on youtubu, twitter and linkedin!
@free_thinker4958 6 หลายเดือนก่อน
@@magicismagic123 of course, jason is ahead of those fake ai gurus throwing bull***shit content only for views and buzz
@LauraMoreno-z3d 3 หลายเดือนก่อน
@@magicismagic123 hes lying to you, this solution isnt local
@CynicalWilson 8 หลายเดือนก่อน ⁺⁷⁶
Holy crap! This gave me such amazing background knowledge, love it! Now, what would be extra cool, would be if you could do a real "hands-on" type of workshop to go through it all by setting up the environment completely, including the actual training/RAG implementation of a set of various document types (PDF, excel, website etc..) to extend a locally running llama 3 instance 😊
@JanMasata-b4m วันที่ผ่านมา
Did you find something like that?
@shyamvai 8 หลายเดือนก่อน ⁺¹³
One of the most informative RAG videos I’ve seen. Can’t wait to see more from your channel.
@FightFlixTv 8 หลายเดือนก่อน ⁺¹²
This is the best RAG video on the internet, awesome job, no fluff, high complexity but easy to understand, nice work
@digitallifedigitalindia9023 8 หลายเดือนก่อน ⁺⁵
00:05 AI can revolutionize Knowledge Management
01:46 Llama3 can process precise knowledge with fast inference
05:27 Market strategy for AI startups
07:16 Convert PDF files to markdown format for enhanced accuracy and control
10:47 Finding the optimal chunk size through experiments
12:34 Hybrid search combines Vector search and keyword search for better results
16:12 Building a local agentic RAG with llama3
17:48 Running Llama3 model on local machine and using Visual Studio Code
20:53 Setting up key components for Llama3 performance
22:20 Creating a complex agentic RAG workflow for document retrieval and answering
@jamesrruff 5 หลายเดือนก่อน ⁺³
I appreciate your generosity in taking the time to break this down. Thank you.
@esiriel 3 หลายเดือนก่อน ⁺⁵
Interesting ! I enjoyed your full breakdown, under the hood - thank you. The Private part is very important but I didn't want to re-invent the wheel and deal with bugs... so I'm using Spheria AI for my life's knowledge base - full privacy and data ownership... Almost same as running on local, and their UX is actually great !
@MrLiteratur 8 หลายเดือนก่อน ⁺²
Thanks, Jason, incredible as always! Would you consider sharing the code from the walkthrough? 🙏
@AIJasonZ 7 หลายเดือนก่อน
Thanks mate, appreciate it! Code is in the description link!
@yashsrivastava677 7 หลายเดือนก่อน
@@AIJasonZ Link is not there
@loserc1854 6 หลายเดือนก่อน
@AIJasonZ Link is not there 😭😿
@scottmiller2591 8 หลายเดือนก่อน ⁺⁵¹
1) The link for the corrective RAG agent had an extra URL attached at the end which caused it to fail; manually tracing the link got me to the proper location
2) LlamaParse looks like a wonderful tool, since I have a lot of documents with equations, and I really need it to grab equations, if for no other reason than to return them. Unfortunately, LlamaParse requires an API key and seems to send PDFs off for processing, something that others have noted and there is an open issue from 2 weeks ago. As of 3 hours ago, it's still an open issue - clearly most companies don't want to send internal docs out of house. Hopefully this gets resolved soon.
3) Really liked your presentation - easy to follow every step with the provided materials.
@dennou2012 8 หลายเดือนก่อน ⁺¹
Hopefully we will have more better options for local use - shame it's not a local only pipeline yet
@yunxinglu4020 7 หลายเดือนก่อน ⁺²
yes - I have found this issue too. LlamaParse seems use OpenAI llm to process the pdf and it leads to the privacy concerns.
@SahlEbrahim 5 หลายเดือนก่อน
i cant find this particular code. the code in the link uses open ai.
@thekr0w561 3 หลายเดือนก่อน
@scottmiller2591 Hey, do you have any suggestions for good local PDF Parsers?? LLamaIndex is amazing but as you pointed out, it is only accessed through API unfortunately...
@scottmiller2591 3 หลายเดือนก่อน
@@thekr0w561 I haven't tried either of the following, but they are the most recent contenders for reading PDFs with LaTeX math and tables (both on GitHub):
1. opendatalab/MinerU
2. facebookresearch/nougat
@kaushalbhavsarrocks 7 หลายเดือนก่อน ⁺³
Really well done, sir! To the point and informative. I wish people just made videos like you do. Hats off!
@JonathanCombs2023 8 หลายเดือนก่อน
Thanks!
@AIJasonZ 8 หลายเดือนก่อน
Thanks Jonathan! ❤️
@heliolucio7691 7 หลายเดือนก่อน ⁺³
This is the best RAG video that I saw, I current work with it so I can say: this video is GOLD, enjoy everybody! And thank you so much, man.
@fredygerman_ 8 หลายเดือนก่อน ⁺⁴
You always amaze me by the amount of knowledge I get from your videos
@tkp2843 8 หลายเดือนก่อน ⁺⁴³
Firecrawl boosted our RAG accuracy at our company. fast + provided good markdown format.
Llama parse also super helpful too! Amazing video Jason! This is gold!
Edit: thanks for the likes :)
@rafaelmiller9147 8 หลายเดือนก่อน
The search api is just insane on firecrawl
@jasonfinance 8 หลายเดือนก่อน ⁺⁴
Didn't know about the Agentic RAG techniques, thanks for sharing!! That's definitely a trade off between speed & quality, but good to have the option
@GlueTubber 19 วันที่ผ่านมา ⁺²
it's like this: you have trained yourself to be a computer programmer, but you still go to stack overflow to answer questions. Your programmer training is the baked in knowledge, stack overflow is your RAG.
@realistindenial 2 หลายเดือนก่อน
Bravo, well done! This video is not for the faint of heart, but it's great for those who are good at grasping technical concepts quickly and leveraging the information contained into real world usage. Thank you!
@amitbuch 4 หลายเดือนก่อน
Very thorough introduction. Very valuable inputs on refining and ranking of retrieval pipeline design.
@brandonmchugh9821 2 หลายเดือนก่อน
Omg, this is exactly the info ive been looking for. Great video. Insanely insightful.
@Mystrykdlad หลายเดือนก่อน
This is one of the most detailed RAG video, I came across
Hope I can create atleast one such rag model using this information
Thanks a lot buddy 💥💥
@adityapanwar1220 7 หลายเดือนก่อน ⁺¹
Wow, This turns me back to think that the RAGs i implemented before was just a mini brother of this. Amazing work buddy.
@titusblair 8 หลายเดือนก่อน ⁺⁵
Yet again an amazing tutorial, thanks so much Jason!
@Deeneeshsu หลายเดือนก่อน
You are doing great man. Keep it up ❤ love from Europe
@Entropy67 7 หลายเดือนก่อน ⁺¹
Subscribed, dont have an AI company since I'm still a poor student... this video was very informative, the man speaks at two times speed just like my professor. I respect it 😁
@starmap 7 หลายเดือนก่อน ⁺²
Great content! Thanks for putting in the effort. Will use this.
@thenickcornelius 7 หลายเดือนก่อน ⁺¹⁸
Came to train my 3 Llamas... Now I'm a full stack developer.
@seba29 6 หลายเดือนก่อน
you won a new subscriber with this superb video! thanks a lot for sharing all the know how
@guidoponzio6894 7 หลายเดือนก่อน
Bro i been watching a lot of llm videos this past days, and by far this is the best i have seen. Thank you for your work
@NdxtremePro 6 หลายเดือนก่อน
Thanks for this video! You explained the over arching idea very well, and thanks for the idea.
@MerleRichardson-z1d 4 หลายเดือนก่อน
❤ your teaching style. Thanks you. I have subscribed and I will be following you.
@lamprime 7 หลายเดือนก่อน ⁺⁶
Is there a GitHub repo for the examples that you've demoed? Excellent video!
@rab0309 8 หลายเดือนก่อน ⁺¹²
great video keep making these please.. only "criticism" / advice if you can call if that is to keep things focused on local / open source solutions as much as possible.. love the use of Ollama here for example.. things that perhaps don't require API keys, subscriptions, external integrations / dependencies help people like me understand more of what's going on in a workflow like this! thanks again!
@MyWatermelonz 8 หลายเดือนก่อน ⁺⁷
I prefer finetuning to RAG first then RAG on top of the finetuned model. Just a simple QLORA is all you need. It really helps a ton.
@helix8847 8 หลายเดือนก่อน ⁺¹
How would you go about doing that, as in just do it backwards from the video?
@bruinx1679 7 หลายเดือนก่อน
Excellent video! I don't have much experience with RAG and this was sooo helpful!
@contractorwolf 7 หลายเดือนก่อน
Jason, I watch a lot of AI videos but I learn the most from yours. I am actually excited everytime i see you have put another one out. Keep up the great work!
@gaijinshacho 8 หลายเดือนก่อน ⁺¹⁰
Great timing! Why do you always read my mind JASON!!?! lol
@offeraviad 26 วันที่ผ่านมา
Love the details!! great video!
@Saintel หลายเดือนก่อน
Great job!
@seventhapex 8 หลายเดือนก่อน ⁺²
dude... great video! Thanks for the knowledge!
@beastmastern159 20 วันที่ผ่านมา
Hi jason from spain good video
@MyAmazingUsername 8 หลายเดือนก่อน ⁺¹
Really great tutorial, teaches a lot in very short time! Thanks!
@omvrio-d2v 5 หลายเดือนก่อน
MAAAAAAAAAAAN AMAZING VIDEO VERY INSIGHTFUL AND HELPFUL TO LEVEL-UP YOUR RAG IMPLEMENTATION. HIGHLY RECOMMEND.
@EveDe-ug3zv 8 หลายเดือนก่อน ⁺³
Great video Jason, I only missed routing as a technique to determine if your question should really go through the RAG. James Briggs has done a few good videos on “semantic routing”.
Is your example notebook available somewhere?
@christenjacquottet9799 7 หลายเดือนก่อน
I'm wondering the same thing. Don't see a link to a github repo
@Paulo-ut1li 6 หลายเดือนก่อน
That's the most useful RAG video on YT. Thank you!
@Omar-p9r3c 8 หลายเดือนก่อน
right when i needed it, thank you man!
also, just finished watching and i understood the theory behind it but kinda got lost during the code explanation, i might watching again and again
@tonygil8617 8 หลายเดือนก่อน ⁺³
Hi brilliant session , do you have a link for the notebook ?
@kartiknighania8588 8 หลายเดือนก่อน ⁺³
OG Jin Yang from Silicon Valley.. Amazing video 🎉
@arianetrek7049 7 หลายเดือนก่อน
The corrective RAG schema explains why AI often tries to bring results from the web even when you tell them not to in prompt. If it doesn't understand the source properly it will look elsewhere. This was insightful, thank you.
@alinada9496 7 หลายเดือนก่อน
You said it all !!!!! , Thanks for this illustrative video
@azathought_games 8 หลายเดือนก่อน ⁺¹
Such a bait and switch. Thumbnail promises fine tuning tutorial. Delivers best improve-your-RAG video on the internet. Excellent work.
@PIOT23 8 หลายเดือนก่อน ⁺¹
What a great video! Thanks for sharing your knowledge
@MrStevemur 7 หลายเดือนก่อน
Thanks! It's so fascinating how these programs 'think.' Even if I don't install one, concepts like chunking seem to translate to humans as well.
@MinoasPediadas หลายเดือนก่อน
Thank you for the explanations, very helpful. I would appreciate your thoughts on the following scenario:
Let's say you have a pdf file that contains some criteria/requirements. Assume also you have a second pdf file which describes a project and also describes how the project will address the specific criteria/requirements defined in the 1st file. I was wondering if there is an LLM-based workflow that could examine how well the project description addresses the criteria in the 1st file. I was thinking of a process like this:
1. Get some ready-made LLM model like Ollama or BERT
2. Train it again with your 1st file as an input
3. Load the second file to a vectorDB and create a chain
4. Query the second file
I'm not sure if this works though and I'm not sure how I can specify my queries to cross-examine SPECIFICALLY the second file against the SPECIFIC criteria of the 1st file. Would you have some kind of guidance for me?
@MosheRecanati 7 หลายเดือนก่อน ⁺²
Where I can find the notebook that you're presenting in this great video?
@PoGGiE06 7 หลายเดือนก่อน ⁺¹
Great video, thanks. New subscriber (and like) here. I had a couple of questions though: why use langchain? It seems unnecessary from what I have read. Would also love a demo ipynb/copy of code.
@beelzebub2808 7 หลายเดือนก่อน ⁺¹
This is extremely helpful! Awesome!
@vjunloc1 3 วันที่ผ่านมา
Yo Jin Yang, Great video man
@dataanalysiscourse785 8 หลายเดือนก่อน ⁺⁴
Awesome content!
@akash.vekariya 6 หลายเดือนก่อน
Really the best RAG video on youtube so far
@walidr 6 หลายเดือนก่อน ⁺⁴
Llama parse is used via their api. You send data to their servers. So it’s not truly local.
@adrowsypoet หลายเดือนก่อน
Doesn't it work offline tho? Couldn't you just keep it unconnected to the internet? Jw
@walidr หลายเดือนก่อน
@ no. Llamaparse is an online service, running its own proprietary model in their online server.
@edmundwong2671 21 วันที่ผ่านมา
You're the best programming teacher.
@sd5853 8 หลายเดือนก่อน ⁺¹
I don’t understand everything but I can feel the gold penetrating my ears
@DanielPeeters 2 หลายเดือนก่อน
So there's no way to do sort of intelligent/adaptive chunking size, based on how information-rich the particular text is? It could do overlapping emebedding too, e.g. if one paragraph of a textbook is explaining an overall process, that woudl be one vector overall, but then you have further sub-vectors for the individual steps
@thebardlydm 5 หลายเดือนก่อน
Do you have a video that goes a bit more into the Agentic query restructuring metadata filtering section or sample?
@free_thinker4958 8 หลายเดือนก่อน ⁺³
You're the man 💯👏
@FernandoOtt 8 หลายเดือนก่อน ⁺¹
Awesome content Jason. A Question. I need to create an AI psychologist and store college data, but this college data is a guide of what to speak, not the content itself.
In that case, what is the best approach, RAG or Fine-tuning?
@jonm6834 7 หลายเดือนก่อน
You got a sub. Finally, an AI channel that actually teaches.
@98hghghg98 8 หลายเดือนก่อน ⁺¹
great video jason! quick question, im wondering if a knowledge graph in place of vector database would be better since it mitigates the lost in the middle problem?
@justinwong2442 7 หลายเดือนก่อน
Well said, thank you for this video!
@nyceyes 2 หลายเดือนก่อน
19:10 Can you provide the source location for the Notebook shown?
@rachelchu5363 6 หลายเดือนก่อน
what is your opinion on using hypothetical answer before search? e.g. send query to llm to get general answer, which is used to enrich the query to get more context, then send enriched query to vector store to do a search
@CaptainOverpants 7 หลายเดือนก่อน ⁺⁶
where can i get the notebook code?
@dr.akshayprakash5735 6 หลายเดือนก่อน ⁺²
Has anyone built an AI chatbot for a client/ company? If so, I wanted to know if a tool that monitors your AI chatbot for incorrect or dangerous responses and alert the developer and log it when it happens would be useful? Me and my friends had built such a AI monitoring tool for a hackathon and wanted to know it would be helpful for others. Any advice would be helpful 🙂
@sharex21 8 หลายเดือนก่อน ⁺²
I'm a simple man. I see a new AI Jason video, I click.
@asetkn 8 หลายเดือนก่อน
Platform agnostic LLM space overview videos from Jason are the best on AI YT
@jorper98 8 หลายเดือนก่อน
Amazing info shared -. Thank you!
@chiluone 7 หลายเดือนก่อน
Can we have access to the jupyter notebook for closer inspection and deeper learning? :)
@javierhoyos9382 14 ชั่วโมงที่ผ่านมา
Have you tried these other ai agent builder ?
Could you make a comparison between them?
Vectorshift ai
N8n
triggerdev
crewai
langchain
Lanflow
Flowise
Dify ai
@nrusimha11 7 หลายเดือนก่อน
Thank you. Can you say a little about your hardware setup for this work? This information is missing from a lot of online sources.
@ujjwaltyagi9981 7 หลายเดือนก่อน
Man we need a complete course from you on RAG
@gdr189 7 หลายเดือนก่อน
Hi, what are the areas current LLMs excel at?
I am new to this world of AI, but not IT (familiar with infra). It is good that people are trying out things to see what it can do. But my naïve thoughts are that as a language tool, it just looks for patterns of words that appear close together, and knows enough of the formation of language that it produces text that is not only readable, but also relevant. But this surely must have limits, if it does not actually understand?
Would it be serving up answers from a well vetted and written sources such as internal KMS by using this RAG method? Our team was thinking about it use for education / learning - perhaps tied into custom flashcard and evaluation of human provided answers. Alongside the still very useful text summarisation, alternative wording suggestions.
@estebann 6 หลายเดือนก่อน
I wonder if the typo inside "template" ("premable" instead if "preamble") has any impact on results
@nitesh795 7 หลายเดือนก่อน
Great video Jason, but do you have a workflow video for a windows wsl setup?
@AJPHIL-bt4me 2 หลายเดือนก่อน
AI JSON do you still have a github link to this project, the one in project description was removed
@jackmermigas9465 7 หลายเดือนก่อน ⁺¹
wow nice work thanks!
@Max-hj6nq 8 หลายเดือนก่อน ⁺¹
Solid video Jason
@morffisTFT 8 หลายเดือนก่อน ⁺¹⁰⁰
Can you share the code in the video?
@basedmuslimbooks 8 หลายเดือนก่อน ⁺¹⁵
I was hoping that was the case since it's a "simple" workflow
@pollywops9242 8 หลายเดือนก่อน ⁺⁴
The code is personal you need to apply for a download link with meta and it will provide the code to copy / paste
@christenjacquottet9799 7 หลายเดือนก่อน
@@pollywops9242 apply where? I don’t see it
@joesmoo9254 7 หลายเดือนก่อน
@@pollywops9242😂
@williamcase426 7 หลายเดือนก่อน
Plz
@kaankorkmaz8180 6 หลายเดือนก่อน
Checking if the LLM hallucinated by using an LLM... how can we rely on this? Thanks for the great overview.
@mateuszzemke9194 7 หลายเดือนก่อน
great content! why wouldn't you use groq to speed up the agent response?
@jasonkergosien3159 6 หลายเดือนก่อน
Great video!
Maybe I missed it. Is there a link to the python notebook?
@mahmood392 7 หลายเดือนก่อน
Would you have plans to create a tutorial that connects what ur teaching here and running thing on something like AnythingLLM that allows document reading to create embeddings.
@szpiegzkrainydeszczowcow8476 8 หลายเดือนก่อน
You are relevant, Subscribing to your channel!
@nuluai 7 หลายเดือนก่อน
We been trying to build a middleware that connects with any inventory ERP to be able to have real time data information about inventory data for the chatbot
@biiiiiimm 8 หลายเดือนก่อน
What about preparing data, for exemple as question / response, the response would be used to generate embedding and the response would be the data retrieved ?
@nikhilmaddirala 7 หลายเดือนก่อน
Do you think this could be combined with the "group of agents" framework you described in a previous video?
@HollyTroll 5 หลายเดือนก่อน
thank you so much for the info
@BobLazar-zm7wb 6 หลายเดือนก่อน
Thanks Jason I just subbed can you make a video to setup my own LLM to write a novel? thanks
@rawleyc 4 หลายเดือนก่อน
I want to train an LLM using videos and pictures. How can I achieve this? How do I make sure it can interpret the videos and pics?
@ConsultingjoeOnline 8 หลายเดือนก่อน
Great video. Thanks! A lot of very good tips!

ต่อไป

เล่นอัตโนมัติ

"I want Llama3.1 to perform 10x with my private knowledge" - Self learning Local Llama3.1 405B