5 tiers of long-term memory and personalization for LLM applications (in-person workshop)

Introducing Exa Websets - A breakthrough toward perfect search

كورس هندسة التلقين | Prompt Engineering MasterClass

กินขนมมั้ยจ้ะน้อง หนมน้า😝

มายคราฟ แต่ ผมห้ามตาย..!!! #minecraft #พี่เก้า #มายคราฟ #minecraftmtr

#อึ้ง!เหลือจะเชื่อ!ไทยพลิกนรกดับสิงคโปร์คาบ้าน ทะลุเข้ารอบรองชนะเลิศ! คารวะอิชิอิโคตรการเปลี่ยนแปลง!

Don't naive RAG do hybrid search instead (Pinecone Weaviate or pgvector + full text search & rerank)

LLMs for Devs

มุมมอง 10 737

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 7 ม.ค. 2025

ความคิดเห็น •

@devlearnllm 5 หลายเดือนก่อน ⁺⁴
Hey yall, in case you didn't get good full text search results like me, the CEO of Supabase (Paul Copplestone) sent me this to use instead: supabase.com/docs/guides/database/extensions/pgroonga
@alienPear 4 หลายเดือนก่อน ⁺¹
Thanks for sharing, bro! Greetings from Colombia
@devlearnllm 4 หลายเดือนก่อน
My pleasure!
@pabloarroyo7952 3 หลายเดือนก่อน ⁺²
Watching this 2 months later. Great video, thanks for sharing
@devlearnllm 3 หลายเดือนก่อน
Glad you enjoyed it!
@blackswann9555 2 หลายเดือนก่อน ⁺¹
3 Months im here and enjoying
@UnemployMan396-xd7ov หลายเดือนก่อน
4 months here love it gonna put this in my graduate thesis
@JamesRBentley 5 หลายเดือนก่อน ⁺¹
Nice video sir. I have already been experimenting with the colab - sincerest thanks
@devlearnllm 5 หลายเดือนก่อน
Great to hear!
@gregmeldrum 5 หลายเดือนก่อน ⁺¹
Very informative! A great resource. Thanks for sharing your wealth of knowledge!!
@ironbondar 5 หลายเดือนก่อน ⁺¹
very good workshop. straight to the point
@Phoenix-gi3gu 4 หลายเดือนก่อน ⁺¹
For experimenting I would recommend using no database at all. You can simply use the cosine similarity (i.e. from torch functional) or quickly implement it and you are nearly done. Just use some argsort to get the best matches. It's like five lines of code or so. For easy store/load you can use pickle to serialize/unserialize the object that holds the embeddings. It is fast on CPU too, but of course you can run it on GPU without any bigger changes.
No services required.
@devlearnllm 4 หลายเดือนก่อน
good point
@oamarkanji3153 3 หลายเดือนก่อน
Incredible content. Thank you.
@devlearnllm 3 หลายเดือนก่อน
Much appreciated!
@ThoughtfullySo 5 หลายเดือนก่อน ⁺¹
You should've tried Qdrant.
@ofrylivney367 5 หลายเดือนก่อน ⁺²
Nice workshop! I'll definitely try out the hybrid search. Do you recon it'll work with nomic text embeddings and ollama?
@devlearnllm 5 หลายเดือนก่อน
Most likely!
@magnusjensen5867 3 หลายเดือนก่อน ⁺¹
Nice workshop, thank you for sharing! You mentioned early on that you tried decomposing your queries if they were multi-hop queries / abstract queries. Would you still suggest that approach or is there any new research specifically on this matter? Imagine a query in which a user want to retrieve information from multiple documents at get a comparison or summarization.
@devlearnllm 2 หลายเดือนก่อน ⁺²
I'm still doing the same for my app, and what I'm hoping to do eventually is to prompt the query expansion step so it's expanding in a coherent way. E.g question is about how X affects Y -> find X, find Y
@magnusjensen5867 2 หลายเดือนก่อน
@@devlearnllm Thank you for your response. How exactly would you go about this? Have you played with knowledge graph (GraphRAG) like Neo4j etc?
@samarthsaraogi6088 3 วันที่ผ่านมา
How can we store the fitted model? I want to use the fitted BM25 model repeatedly on my app. Is there a way to keep it?
@SandeeshCroos 2 หลายเดือนก่อน
Hey, great content! Thanks for sharing your knowledge. However, instead of just using tsvector in PostgreSQL, you can leverage sparse vector search by utilizing the pg_search extension, right?
@devlearnllm 2 หลายเดือนก่อน ⁺¹
yup, they're both full text search. Or use pgroonga
@ajkdrag 4 หลายเดือนก่อน ⁺¹
Hi. I have a video request. Is there a way to contact you?
@devlearnllm 4 หลายเดือนก่อน
tally.so/r/n9djRQ
@ajkdrag 4 หลายเดือนก่อน
@@devlearnllm done. Thanks
@vijishmadhavan6093 4 หลายเดือนก่อน
what happens if we use all the 25000 cases, will it work?
@devlearnllm 4 หลายเดือนก่อน
Most likely. Pinecone, Weaviate and pgvector are very performant.
@zuowang5185 5 หลายเดือนก่อน
Is Openai embedding v3 model better than Bert?
@devlearnllm 5 หลายเดือนก่อน
Hard to tell unless experiments are run.
huggingface.co/spaces/mteb/leaderboard
@artur50 5 หลายเดือนก่อน
is it possible to run it with Ollama?
@devlearnllm 5 หลายเดือนก่อน
Most likely
@ArunKumar-bp5lo 5 หลายเดือนก่อน ⁺¹
great
@flor.7797 5 หลายเดือนก่อน
I just use Google 🙃

ต่อไป

เล่นอัตโนมัติ

5 tiers of long-term memory and personalization for LLM applications (in-person workshop)

5 tiers of long-term memory and personalization for LLM applications (in-person workshop)

Introducing Exa Websets - A breakthrough toward perfect search

Introducing Exa Websets - A breakthrough toward perfect search

كورس هندسة التلقين | Prompt Engineering MasterClass

كورس هندسة التلقين | Prompt Engineering MasterClass

กินขนมมั้ยจ้ะน้อง หนมน้า😝

กินขนมมั้ยจ้ะน้อง หนมน้า😝

มายคราฟ แต่ ผมห้ามตาย..!!! #minecraft #พี่เก้า #มายคราฟ #minecraftmtr

มายคราฟ แต่ ผมห้ามตาย..!!! #minecraft #พี่เก้า #มายคราฟ #minecraftmtr

#อึ้ง!เหลือจะเชื่อ!ไทยพลิกนรกดับสิงคโปร์คาบ้าน ทะลุเข้ารอบรองชนะเลิศ! คารวะอิชิอิโคตรการเปลี่ยนแปลง!

#อึ้ง!เหลือจะเชื่อ!ไทยพลิกนรกดับสิงคโปร์คาบ้าน ทะลุเข้ารอบรองชนะเลิศ! คารวะอิชิอิโคตรการเปลี่ยนแปลง!

Apko konsa RC Bus Accah laga

Apko konsa RC Bus Accah laga

How to scrape the web for LLM in 2024: Jina AI (Reader API), Mendable (firecrawl) and Scrapegraph-ai

How to scrape the web for LLM in 2024: Jina AI (Reader API), Mendable (firecrawl) and Scrapegraph-ai

Comparing 10 different models, including Gemini Flash 2 0, Grok, Claude, GPT, Llama for OCR

Comparing 10 different models, including Gemini Flash 2 0, Grok, Claude, GPT, Llama for OCR

Why Build Enterprise RAG with Postgres?

Why Build Enterprise RAG with Postgres?

Marek Jílek: Hey ADCS, gimme DA!

Marek Jílek: Hey ADCS, gimme DA!

Setup your first LLM observability traces with LangSmith and iterate on prompts with Quotient AI

Setup your first LLM observability traces with LangSmith and iterate on prompts with Quotient AI

Agentically scrape the web with Firecrawl & LangGraph (LangChain)

Agentically scrape the web with Firecrawl & LangGraph (LangChain)

The missing pieces to your AI app (pgvector + RAG in prod)

The missing pieces to your AI app (pgvector + RAG in prod)

Weaviate Explained: Understanding Schema & Querying: Optimizing Results Using Querying & Filtering

Weaviate Explained: Understanding Schema & Querying: Optimizing Results Using Querying & Filtering

I Made a FAST Search Engine

I Made a FAST Search Engine

【หนังพากย์ไทย】ยอดฝีมือสังหารนักโทษ แต่นักโทษเป็นปรมาจารย์กังฟูที่ซ่อนอยู่ เขาจัดการทั้งหมดในทันที

【หนังพากย์ไทย】ยอดฝีมือสังหารนักโทษ แต่นักโทษเป็นปรมาจารย์กังฟูที่ซ่อนอยู่ เขาจัดการทั้งหมดในทันที

#เดอะตุ๊ก !! เจาะเดือด ทีมชาติ ผ่าฟอร์ม !! ทีมชาติไทย มันส์ เปิด สาเหตุ !! ระบบ+แท็คติก

#เดอะตุ๊ก !! เจาะเดือด ทีมชาติ ผ่าฟอร์ม !! ทีมชาติไทย มันส์ เปิด สาเหตุ !! ระบบ+แท็คติก

ทำผิดกฏหมาย 100 ข้อ ในวันเดียว!!

ทำผิดกฏหมาย 100 ข้อ ในวันเดียว!!

#นายกแพทองธาร ลงพื้นที่มอบถุงยังชีพ บริเวณ ซ.พัฒนาการคูขวาง ๑๐ (ถ.ท่าโพธิ์) จ.นครศรีธรรมราช

#นายกแพทองธาร ลงพื้นที่มอบถุงยังชีพ บริเวณ ซ.พัฒนาการคูขวาง ๑๐ (ถ.ท่าโพธิ์) จ.นครศรีธรรมราช

How Strong Is Tape?

How Strong Is Tape?

#โด่งดัง!ญี่ปุ่นซูฮก บอลอาเซียนเร้าใจ!! โค๊ชสิงคโปร์พูดแบบนี้ถึงไทย!! มาเลย์ขอบคุณไทยที่ให้ชีวิต..?

#โด่งดัง!ญี่ปุ่นซูฮก บอลอาเซียนเร้าใจ!! โค๊ชสิงคโปร์พูดแบบนี้ถึงไทย!! มาเลย์ขอบคุณไทยที่ให้ชีวิต..?

รวม10 เจ้าพ่อบ้านใหญ่! ลุ้น "โกทร" เกมหรือรอด? : 14-12-67 | iNN Top Story

รวม10 เจ้าพ่อบ้านใหญ่! ลุ้น "โกทร" เกมหรือรอด? : 14-12-67 | iNN Top Story

ตรวจหวยงวดวันที่ 16 ธันวาคม 2567 พร้อมรางวัล N3 รางวัลพิเศษ รางวัล 2 ตัว : Matichon Online

ตรวจหวยงวดวันที่ 16 ธันวาคม 2567 พร้อมรางวัล N3 รางวัลพิเศษ รางวัล 2 ตัว : Matichon Online