RAG From Scratch: Part 3 (Retrieval)

RAG From Scratch: Part 1 (Overview)

90% of CS grads can't handle a binary tree!?!!

ชุดเทประวังแตก สีดำ⚫️ VS สีแดง🔴 เลือกอะไร? #shorts

ผจญภัย 4 วันสุดโหด เอาชีวิตรอดกู้ชีพอาเมย์ ณ ป่าโจโฉ!! (Part.2/2)

แค่เรียกหา (Call My Name) - Sky Wongravee, Nani Hirunkit

RAG From Scratch: Part 2 (Indexing)

LangChain

มุมมอง 52 909

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 10 ก.พ. 2025

ความคิดเห็น • 16

@FabianSchaefer-e6i ปีที่แล้ว ⁺¹²
For the very first time in my mechanical engineering life, I think i learned something in detail about software engineering! Thank you!
@hxxzxtf 8 หลายเดือนก่อน ⁺⁴
🎯 Key points for quick navigation:
00:02 *📹 The second video in the RAG from Scratch series focuses on indexing, a crucial component of RAG pipelines.*
00:28 *🔍 The goal of indexing is to retrieve documents related to a given question using numerical representations of documents.*
00:53 *📊 Numerical representations of documents are used for easy comparison and search, with approaches including sparse vectors and machine learning-based embedding methods.*
01:08 *💡 Embedding methods compress documents into fixed-length vectors that capture their meaning, allowing for efficient search and retrieval.*
02:03 *📈 Documents are split into smaller chunks to accommodate embedding models' limited context windows, and each chunk is compressed into a vector representation.*
Made with HARPA AI
@alexenax1109 ปีที่แล้ว
Amazing series! Thank you Lance!
@hamzafarouk1538 ปีที่แล้ว ⁺¹
Thanks. Nice video short and clear. But why do you need to store the embedding in a vector db
@Orcrambo 11 หลายเดือนก่อน ⁺²
For efficient similarity searches
@bald_ai_dev 11 หลายเดือนก่อน ⁺³
When performing RAG, you need to encode your input data into embeddings because that is what the LLM understands, it is from these embeddings that the model will perform decoding of the output and subsequently give you the result you've asked for. These embeddings need to be stored somewhere, they can be small or very large, some vector dbs can be free, opensource and be in memory like faiss and chroma, they can also be paid and hosted like pinecone
@clivejefferies 10 หลายเดือนก่อน
@horyekhunley thanks for the insight
@sepia_tone 10 หลายเดือนก่อน ⁺⁷
@@bald_ai_dev the process is a little different
The LLM doesn't touch the embeddings. The embeddings are used to convert the documents to a form that can more quickly and accurately be compared to the question (which is also converted to an embedding). This is done by an embeddings model (in this example an embeddings model from OpenAI (referred to as OpenAIEmbeddings() in the code) is used. These embeddings and their associated documents need to be stored somewhere (in this example Chroma is used). This is the indexing phase.
After the comparison between the embedding of the question and the stored documents, a subset of the documents which have a high similarity (in embedding space) with the question is given to the LLM. This is the retrieval phase.
Finally, the LLM uses the returned documents and it's own knowledge to reason and given an answer to the user. This is the generation phase..
@tongweiwang9775 10 หลายเดือนก่อน ⁺¹
@@sepia_tone Still didn't quite get what's the role of indexing here. Relevant documents are retrieved based on the embeddings of the split documents and the embedding of the question. So what's indexing doing here?
@kanishksaxena7735 5 หลายเดือนก่อน ⁺²
Still did nt understand what indexing is
@appyyc-ai 4 หลายเดือนก่อน
hey Lants, would you consider 'RAG' frameworks to be fairly Machiavellian?
@mokisable 5 หลายเดือนก่อน ⁺¹
Langchain, Python, Jupyter notebooks... I'm tired of people getting more and more abstracted from real knowledge of the systems they use.
You guys are GONE.
Your systems are so high level but you have no idea what you are using or doing. Guineapiggies... everything you work on is accessible like u in a glass house.
Don't you feel any remorse for supporting/advocating for systems that have no evident connection to what they are doing?
Honestly all our children in the world shouldnt even be given access to a smart phone until they can understand what is in their hands.
Same goes for all the child-adults here who are glossy eyed from their shiny new toys, of highlevel tools. You know how to use them, but have no idea how it works, or why it matters. How can you verify your data is safe with all of these layers upon layers of products/frameworks/toys/services?
Getting so far lost in all this without advocating for LOCAL, OPENSOURCE, PRIVATE, MINIMALIST anything is lowkey disgusting. I hope more people start malding because other wise we will live in a society where 1 out of a million people think for themselves... and everyone just blindly follows shills
@TheCuriousCurator-Hindi 3 หลายเดือนก่อน
they do have a support for local LLMs and they also support FAISS. Me coming from search ads and machine learning, just here to learn what this fuss is all about. will let you know if there is anything new which search ads doesn't already do. :)

ต่อไป

เล่นอัตโนมัติ

RAG From Scratch: Part 3 (Retrieval)

RAG From Scratch: Part 3 (Retrieval)

RAG From Scratch: Part 1 (Overview)

RAG From Scratch: Part 1 (Overview)

90% of CS grads can't handle a binary tree!?!!

90% of CS grads can't handle a binary tree!?!!

ชุดเทประวังแตก สีดำ⚫️ VS สีแดง🔴 เลือกอะไร? #shorts

ชุดเทประวังแตก สีดำ⚫️ VS สีแดง🔴 เลือกอะไร? #shorts

ผจญภัย 4 วันสุดโหด เอาชีวิตรอดกู้ชีพอาเมย์ ณ ป่าโจโฉ!! (Part.2/2)

ผจญภัย 4 วันสุดโหด เอาชีวิตรอดกู้ชีพอาเมย์ ณ ป่าโจโฉ!! (Part.2/2)

แค่เรียกหา (Call My Name) - Sky Wongravee, Nani Hirunkit

แค่เรียกหา (Call My Name) - Sky Wongravee, Nani Hirunkit

Vector Databases simply explained! (Embeddings & Indexes)

Vector Databases simply explained! (Embeddings & Indexes)

Building Corrective RAG from scratch with open-source, local LLMs

Building Corrective RAG from scratch with open-source, local LLMs

3. How do Large Language Models work?

3. How do Large Language Models work?

Large Language Models explained briefly

Large Language Models explained briefly

What if all the world's biggest problems have the same solution?

What if all the world's biggest problems have the same solution?

RAG From Scratch: Part 4 (Generation)

RAG From Scratch: Part 4 (Generation)

Python RAG Tutorial (with Local LLMs): AI For Your PDFs

Python RAG Tutorial (with Local LLMs): AI For Your PDFs

What is Retrieval-Augmented Generation (RAG)?

What is Retrieval-Augmented Generation (RAG)?

จักรยานที่น่าสงสัยว่ามีเอเลี่ยน👽 #findthealien

จักรยานที่น่าสงสัยว่ามีเอเลี่ยน👽 #findthealien

ฟังสดเดอะโกสเรดิโอ 9/2/2568 เรื่องเล่าผีเดอะโกส

ฟังสดเดอะโกสเรดิโอ 9/2/2568 เรื่องเล่าผีเดอะโกส

ตัดไฟ-น้ำมันเมียนมาเริ่มแผลงฤทธิ์ พญาตองซูไล่จีนเทาออกจากพื้นที่ - เว็บพนันปลดพนักงานนับ 100 คน

ตัดไฟ-น้ำมันเมียนมาเริ่มแผลงฤทธิ์ พญาตองซูไล่จีนเทาออกจากพื้นที่ - เว็บพนันปลดพนักงานนับ 100 คน

เธอถูกครอบครัวเกลียดหนีออกจากบ้าน พยายามหลายปีจนกลายเป็นซีอีโอ ทำให้ทุกคนที่เคยรังแกเสียใจ

เธอถูกครอบครัวเกลียดหนีออกจากบ้าน พยายามหลายปีจนกลายเป็นซีอีโอ ทำให้ทุกคนที่เคยรังแกเสียใจ

How Strong is Glass? 💪

How Strong is Glass? 💪

The World's Highest Security Prison: CECOT (The most evil are kept here)

The World's Highest Security Prison: CECOT (The most evil are kept here)

สักวันฉันจะหายดี - INK WARUNTORN [Official MV]

สักวันฉันจะหายดี - INK WARUNTORN [Official MV]

แม่ลำเอียงให้รถเข็นแต่น้อง!! #ตลก #แม่ลูก #shortsfeed #funny

แม่ลำเอียงให้รถเข็นแต่น้อง!! #ตลก #แม่ลูก #shortsfeed #funny