Pinecone Vector Database - Build Knowledgable AI

Pinecone Extension for GitHub Copilot

RAG Brag with Peter Werry from Unblocked

#JasonDeruloTV // Funny #GotPermissionToPost From @SofiManassyan #SlowLow

มายคราฟแต่ "น้ำกับลาวา" สลับกัน!?

ไฮไลท์ ฟุตบอล ASEAN MITSUBISHI ELECTRIC CUP 2024 : สิงคโปร์ พบ ไทย

Build Contextual Retrieval with Anthropic and Pinecone

Pinecone

มุมมอง 1 430

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 21 ม.ค. 2025
วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 11

@kenchang3456 6 วันที่ผ่านมา ⁺²
Truly impressive.
@ylazerson วันที่ผ่านมา ⁺¹
amazing!
@IzzyLazerson-q3z วันที่ผ่านมา ⁺¹
awesome!
@BrentPeluso 8 วันที่ผ่านมา ⁺¹
This is amazing. Do you have more tutorials or I can find more resources with Anthropic Pinecone and n8n? Thank you!
@pinecone-io 6 วันที่ผ่านมา
we're coming out with more content all the time! Is there a specific subtopic or use case for this specific combo that you're enthusiastic about?
@PedroNihwl 22 วันที่ผ่านมา ⁺¹
I don't know if I'm implementing 'prompt caching' incorrectly, but in my case, each chunk processing is taking too long (about 20s) with a 30-page file. Due to the processing time, this approach becomes unfeasible.
@emiliod90 22 วันที่ผ่านมา ⁺¹
Hi Pedro, if I may ask please - how big is each chunk you are sending to Claude (in tokens) and what is the format sent (text or images)? Additionally, what kind of task are you asking it to perform for generating context - e.g., are you asking Claude to create a simple summary vs more intensive analysis? Have you tried experimenting with smaller chunks vs larger chunks? Likewise have you tried experimenting with constraints on the outputs? Have you tried Haiku vs Sonnet? All these factors and more will play a role in the latency estimates.
Likewise, you may be able to find speed optimisations via your application code. If you are looking for lightning fast latency, you can also explore executing your tasks asynchronously, followed by either thread pooling to allow concurrent requests, and if necessary and your use-case allows, running in parallel with a library like multiprocessing. The limiting factor will be the API limits set by Claude.
@PedroNihwl 22 วันที่ผ่านมา
@@emiliod90 Hi, thanks for responding.
The entire document (35 pages) has about 14K tokens, and each chunk has around 490 tokens. I'm trying to perform a simple task like "Chat with your PDF" by adding context to each chunk, but the processing time for each request makes it unfeasible. I agree that I could run the requests in parallel, but it's still strange that each request takes about 20 seconds, even with the "Prompt Caching" feature enabled. I believe I'm doing something wrong during the calls, and the cache isn't working.
Is there any example project I can run locally?
@emiliod90 22 วันที่ผ่านมา
@@PedroNihwl Hey Pedro, no problem. We are creating a similar chat-with-your-PDF type knowledge base at my work, so I am also very interested. I will share what we have learnt and hopefully it can help you. For code bases I unfortunately can't share ours but I can guarantee that anything you find on TH-cam and Google within the PDF chat and RAG domain are good. I found support from watching channels Prompt Engineering, TwoSetAI and AI Engineer, particularly Jerry Liu.
Now to answer your specific question, just so I understand, do you need the "pre-processing", i.e., extracting context, then embedding this into a vector store to be faster, or are you referring to the time required once you've already retrieved the chunk, and sent this to the LLM to await a response?
Describing your overall architecture might be worth sharing so I understand how your tackling this issue please?
@DownunderGraham 2 หลายเดือนก่อน ⁺²
Contextual retrieval looks like it will definitely help with one of the downsides of RAG (ie. loosing context in a chunk). The other approach I have been looking into quite a bit is the combination of RAG and knowledge graphs. I wonder if knowledge graphs and contextual retrieval for RAG combined would be even better.
@pinecone-io 2 หลายเดือนก่อน
you might be interested in th-cam.com/video/ubtLxr7B1Vc/w-d-xo.html

ต่อไป

เล่นอัตโนมัติ

Pinecone Vector Database - Build Knowledgable AI

Pinecone Vector Database - Build Knowledgable AI

Pinecone Extension for GitHub Copilot

Pinecone Extension for GitHub Copilot

RAG Brag with Peter Werry from Unblocked

RAG Brag with Peter Werry from Unblocked

#JasonDeruloTV // Funny #GotPermissionToPost From @SofiManassyan #SlowLow

#JasonDeruloTV // Funny #GotPermissionToPost From @SofiManassyan #SlowLow

มายคราฟแต่ "น้ำกับลาวา" สลับกัน!?

มายคราฟแต่ "น้ำกับลาวา" สลับกัน!?

ไฮไลท์ ฟุตบอล ASEAN MITSUBISHI ELECTRIC CUP 2024 : สิงคโปร์ พบ ไทย

ไฮไลท์ ฟุตบอล ASEAN MITSUBISHI ELECTRIC CUP 2024 : สิงคโปร์ พบ ไทย

Real Vs Mannequin Challenge😱

Real Vs Mannequin Challenge😱

PDFs for AI Masterclass with @matthew_berman and @jamesbriggs

PDFs for AI Masterclass with @matthew_berman and @jamesbriggs

Intro to Cascading Retrieval: Boost RAG and search precision by up to 48%

Intro to Cascading Retrieval: Boost RAG and search precision by up to 48%

The Magic of Multilingual Search with Pinecone Serverless and Inference

The Magic of Multilingual Search with Pinecone Serverless and Inference

Build Real-Time RAG with Pinecone, Databricks, and Fivetran

Build Real-Time RAG with Pinecone, Databricks, and Fivetran

Graph Exploration with Pinecone and Neo4j

Graph Exploration with Pinecone and Neo4j

Getting started with Pinecone serverless

Getting started with Pinecone serverless

Five New Things about Pinecone Assistant #rag #pinecone #chatbots

Five New Things about Pinecone Assistant #rag #pinecone #chatbots

RAG Brag with John Wang of Assembled

RAG Brag with John Wang of Assembled

Inscreva-se para mais dicas de codificação!🔥 #ti #desenvolvimentodesktop #computerprogrammer #dev

Inscreva-se para mais dicas de codificação!🔥 #ti #desenvolvimentodesktop #computerprogrammer #dev

คุณเปา ผู้ก่อตั้ง iHAVECPU บริษัทขายอุปกรณ์คอมพิวเตอร์ยอดขายอันดับ 1 ของไทย I PERSPECTIVE [1 ธ.ค.67]

คุณเปา ผู้ก่อตั้ง iHAVECPU บริษัทขายอุปกรณ์คอมพิวเตอร์ยอดขายอันดับ 1 ของไทย I PERSPECTIVE [1 ธ.ค.67]

Which one made you like this video?#keyboard

Which one made you like this video?#keyboard

ดีขึ้นเยอะ ! การ์ดจอ Intel Gen นี้ งัด NVIDIA , AMD ได้สบายๆ

ดีขึ้นเยอะ ! การ์ดจอ Intel Gen นี้ งัด NVIDIA , AMD ได้สบายๆ

Introducing the "VitaWear SmartBand," a next-generation wearable gadget🎉

Introducing the "VitaWear SmartBand," a next-generation wearable gadget🎉

Replacing the Battery Connector, These Types Have Many Similarities, What Types? #phonerepair

Replacing the Battery Connector, These Types Have Many Similarities, What Types? #phonerepair

This Device is SO AMAZING❗️Which Could Allow You to See the Shape of Airflow❗️

This Device is SO AMAZING❗️Which Could Allow You to See the Shape of Airflow❗️

APT APT tutorial #rosé #apt #cute #robot #tutorial

APT APT tutorial #rosé #apt #cute #robot #tutorial