Google Gemini vs OpenAI chatGPT 4 bake-off - SURPRISING results!

Building an OpenAI Custom RAG with LangChain: The Ultimate Tutorial to Chat with your Videos!

How we consistently get better results

LIVE : Japan vs China PR | AFC Asian Qualifiers™ - Road to 26 (Round 3) | 05.09.24

MISS GRAND SARABURI 2025 | Swimsuits Saraburi in Wonderland ✨

บอลคนละเท้า! หงส์ขยี้ผียับคารัง | คู่ซี้ขยี้บอล | EP.1 | Siamsport

Advanced RAG tutorial with Llamaindex & OpenAI GPT: Sentence Window Retrieval vs Basic Chunking

Hubel Labs

มุมมอง 6 201

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 5 ก.ย. 2024

ความคิดเห็น • 32

@mikestaub 8 หลายเดือนก่อน ⁺³
This is currently the best RAG tutorial on the internet.
@unclecode 8 หลายเดือนก่อน ⁺²
Fascinating! Your approach to teaching and presenting is poetic. It is well organized, well explained, and well illustrated. Indeed, kudos to you. If I could, I would subscribe to your channel twice!
@sitedev 8 หลายเดือนก่อน ⁺¹
Great video with an awesome easy to follow explanation of RAG. Reminds of a recent Andrej Karpathy video.
@victorfeight9644 4 หลายเดือนก่อน
clear effective explanations thank you
@ginisksam 6 หลายเดือนก่อน
Llama Index has new version 0.10 - will migrate your codes n learn same time.
Thanks for introducing Sentence Window Retrieval. Most basic straight-split and retrieve/chat doesnt produce much meanings on our docs.
@danielvalentine132 8 หลายเดือนก่อน
Great job explaining window and how vector store and doc store relate and where window lives. I’ve been trying to understand this aspect of llamaindex, and you made it very clear!
@hubel-labs 8 หลายเดือนก่อน
Yeah, it took me a while to figure that out too. Glad it helped you!
@nazihfattal974 8 หลายเดือนก่อน
As always, well prepared, easy to follow video that delivers a lot of information and value. Thank you!
@MaliRasko 8 หลายเดือนก่อน ⁺¹
Your explanations and delivery is on point. Thank you for an excellent content and relaxed narration style.
@arjoai 8 หลายเดือนก่อน
Wow, that’s a wonderful piece of advice from such a talented professional in the field. Thank you 😊
@sayanbhattacharyya95 7 หลายเดือนก่อน
Really cool video! Is there an "ideal" or "recommended" value of window_size?
@JOHNSMITH-ve3rq 8 หลายเดือนก่อน ⁺¹
Wow _ darn useful !!
@sivi3883 7 หลายเดือนก่อน
Awesome video explained very clearly! Thanks a ton!
If I may ask, what tool do you use for those visual flows. Love it!
@nunoalexandre6408 8 หลายเดือนก่อน
Love it!!!!!!!!!!!!!!!!!
@mayurmoudhgalya3840 8 หลายเดือนก่อน
Another great video!
@natal10 8 หลายเดือนก่อน
Great video!! Love it
@hubel-labs 8 หลายเดือนก่อน ⁺¹
Thank you!!
@goonymiami 8 หลายเดือนก่อน
Seeing your explaination at around 09:30, it seems like we can only use K windows to serve as knowledge base to answer a prompt. What If the prompt asks information that is contained in more than K windows? Like if I have several documents containing each a bio of a person, and if the user asks to sort those 10 people by age... how can it figure it out? I guess we can use a big value for K, if the cosine similarity engine can take it... but I am guessing providing too much context to the LLM will cost a lot of money?
@hubel-labs 8 หลายเดือนก่อน
Yeah, it wouldn’t work very well in that scenario. I wonder if perhaps there is a strategy where the LLM can determine the right K based on what it is trying to do. Or instead of setting K to be a fixed number, to return all chunks where the cosine similarity is above a given threshold.
@Work_Pavan-mu9ye 8 หลายเดือนก่อน
Excellent video. Liked the workflow you showed in the beginning. What SW are you using to create this workflow?
@hubel-labs 8 หลายเดือนก่อน
Excalidraw
@peteredmonds1712 8 หลายเดือนก่อน
Small correction: embedding are not a 1536 digit number but of vector of size 1536
@hubel-labs 8 หลายเดือนก่อน ⁺¹
Yes, you are right …. I was thinking of hashes for some reason! I’ll add a correction!
@peteredmonds1712 8 หลายเดือนก่อน
I'm a bit confused on the use case for re-ranking. Doesn't that defeat the purpose the top-k search in that we include all chunks, significantly increasing the number of tokens we use? Is the idea to do re-ranking with a smaller & cheaper LLM before sending the resultant top-K chunks to a more robust LLM?
@hubel-labs 8 หลายเดือนก่อน ⁺³
yeah, it took me a while to understand it as well. It does end up using more tokens but I guess that can be, as you said, mitigated with a lower spec LLM. The idea is that embeddings is not a perfect representation of the text - it's a good way to sort through millions of text chunks efficiently but the exact ranking within the top 20, for instance, may not be as good as using the text directly. So if you were to do reranking, you might use the embeddings to return j results (where j is somewhat larger than k) and then rerank those j results using LLM into a final top k that you then pass into your user-facing LLM.
@chiggly007 8 หลายเดือนก่อน
Great video? Is the diagram anywhere to refeeence?
@hubel-labs 8 หลายเดือนก่อน
I’ll post it up tomorrow - need to fix an error
@grabani 8 หลายเดือนก่อน
@hubel-labs Any update on the diagram please - thanks? Great vid by the way.
@hubel-labs 8 หลายเดือนก่อน ⁺¹
I added it to the description. Here is the link: link.excalidraw.com/readonly/m6DK7oyEFpyQnuw55DVP?darkMode=true
@user-jj1ff2gz7u 8 หลายเดือนก่อน
df
@jannessantoso1405 7 หลายเดือนก่อน
it is 1784 in teahistory.txt and 1794 in chinahistory.txt so bit confusing
but anw great tutorial Thanks

ต่อไป

เล่นอัตโนมัติ

Google Gemini vs OpenAI chatGPT 4 bake-off - SURPRISING results!

Google Gemini vs OpenAI chatGPT 4 bake-off - SURPRISING results!

Building an OpenAI Custom RAG with LangChain: The Ultimate Tutorial to Chat with your Videos!

Building an OpenAI Custom RAG with LangChain: The Ultimate Tutorial to Chat with your Videos!

How we consistently get better results

How we consistently get better results

LIVE : Japan vs China PR | AFC Asian Qualifiers™ - Road to 26 (Round 3) | 05.09.24

LIVE : Japan vs China PR | AFC Asian Qualifiers™ - Road to 26 (Round 3) | 05.09.24

MISS GRAND SARABURI 2025 | Swimsuits Saraburi in Wonderland ✨

MISS GRAND SARABURI 2025 | Swimsuits Saraburi in Wonderland ✨

บอลคนละเท้า! หงส์ขยี้ผียับคารัง | คู่ซี้ขยี้บอล | EP.1 | Siamsport

บอลคนละเท้า! หงส์ขยี้ผียับคารัง | คู่ซี้ขยี้บอล | EP.1 | Siamsport

LIVE : Uzbekistan vs DPR Korea | AFC Asian Qualifiers™ - Road to 26 (Round 3) | 05.09.24

LIVE : Uzbekistan vs DPR Korea | AFC Asian Qualifiers™ - Road to 26 (Round 3) | 05.09.24

The 5 Levels Of Text Splitting For Retrieval

The 5 Levels Of Text Splitting For Retrieval

Get structured Json Output from OpenAI GPT API Consistently!

Get structured Json Output from OpenAI GPT API Consistently!

Chunking Strategies in RAG: Optimising Data for Advanced AI Responses

Chunking Strategies in RAG: Optimising Data for Advanced AI Responses

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

OpenAI Assistants API - Part 1: information retrieval functionality with working code

OpenAI Assistants API - Part 1: information retrieval functionality with working code

OpenAI Assistants API - Part 3: Code Interpreter with run steps walk through

OpenAI Assistants API - Part 3: Code Interpreter with run steps walk through

Adding Agentic Layers to RAG

Adding Agentic Layers to RAG

Advanced RAG 01: Small-to-Big Retrieval with LlamaIndex

Advanced RAG 01: Small-to-Big Retrieval with LlamaIndex

Understanding Embeddings in RAG and How to use them - Llama-Index

Understanding Embeddings in RAG and How to use them - Llama-Index

นุดในสายตา น้องหมาและแมว #แมว #ทาสแมว #เลี้ยงแมว #meowbarn #เหมียวบาน

นุดในสายตา น้องหมาและแมว #แมว #ทาสแมว #เลี้ยงแมว #meowbarn #เหมียวบาน

Wait for end 😂 | Best family game 😜 #shorts

Wait for end 😂 | Best family game 😜 #shorts

โดดตกเหวในชิงแชมป์โลก #shorts #valorant

โดดตกเหวในชิงแชมป์โลก #shorts #valorant

ผัวช้ำ เมียคบชู้ 5 คน ขอฝ่ายหญิงกลับมาหย่า ชู้ล่าสุดเป็นถึงทนายความ l EP.1750 l 3 ก.ย.67

ผัวช้ำ เมียคบชู้ 5 คน ขอฝ่ายหญิงกลับมาหย่า ชู้ล่าสุดเป็นถึงทนายความ l EP.1750 l 3 ก.ย.67

บุษบา (Cover Live Session) กระต่าย พรรณนิภา

บุษบา (Cover Live Session) กระต่าย พรรณนิภา

UNO REVERSE #beatbox #tiktok

UNO REVERSE #beatbox #tiktok

Naoya Inoue vs TJ Doheny | INTERNATIONAL LIVE STREAM

Naoya Inoue vs TJ Doheny | INTERNATIONAL LIVE STREAM

Cute kitty gadget 💛💕

Cute kitty gadget 💛💕