Pgai Tutorial - Vector Embeddings in PostgreSQL Made Easy

Dave Ebbelaar

มุมมอง 5 169

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 3 ก.พ. 2025

ความคิดเห็น • 21

@rembautimes8808 2 หลายเดือนก่อน
I’ve been waiting for a video like this as I was quite interested in using timescale db
@ClimateDS 2 หลายเดือนก่อน
If you already have a Postgresql database, this makes sense, but if you are starting from scratch, you could also use vector databases that allow hybrid search, which most do. and this also allows for more options in respect to embedding models which could be very critical for a given use case.
@Arviddehaas 2 หลายเดือนก่อน
Interesting take on embedding management! Love the sync mechanics. What would you say are the main commonalities and differences between pgvector, pgvectorscale and pgai?
@MarkBekooy หลายเดือนก่อน
Cool video! At around 8:35 you have a Python file open and select some lines which are then run and shown in an interactive Python environment on the right side. This is also awesome! How can this be done? Is this a VS Code extension?
Edit: Or do you run the lines by highlighting them and the pressing shift + enter?
@AlexSurf2008 2 หลายเดือนก่อน
Excellent! 🙏for sharing!
@daveebbelaar 2 หลายเดือนก่อน
My pleasure!
@jonaaapschl1204 2 หลายเดือนก่อน
Great video! Can you make a video about security for RAG-Applications, options to avoid exploits etc?
@zeelthumar 2 หลายเดือนก่อน
Keep it up bro
@sriganeshnagaraj9626 2 หลายเดือนก่อน
Awesome
@sndrstpnv8419 2 หลายเดือนก่อน ⁺³
where is lexical search ? even a separate lexical search index ?
@daveebbelaar 2 หลายเดือนก่อน
Check out this video: th-cam.com/video/TbtBhbLh0cc/w-d-xo.html
@dantedocerto หลายเดือนก่อน
is there a way to not use OPENAI for the embedding?
@durand101 2 หลายเดือนก่อน
This is really great! I've been running into so many limitations using chromadb and redis (with redisearch). ChromaDB in particular uses a lot of memory as the number of embeddings increases, which gets very expensive to host. Is that also the case with pgai? Is it affordable? Also, random question, but what vscode extension are you using to run the script as a jupyter notebook at th-cam.com/video/8oTnUtFYAes/w-d-xo.htmlsi=J21f6uJF09ipZ7sj&t=903?
@Bøølæn 2 หลายเดือนก่อน
How efficient and feasible pgvector store i wanna scale it to production?!
@daveebbelaar 2 หลายเดือนก่อน ⁺¹
It can handle production workloads with millions of vectors using pgvectorscale
@Bøølæn 2 หลายเดือนก่อน
@daveebbelaar also if i use timescale db image, wouldn't it cause any trouble down the line?
@daveebbelaar 2 หลายเดือนก่อน
@ nope, its just Postgres in the backend with added extensions.
@mamadou-diandjalo6723 2 หลายเดือนก่อน ⁺¹
First #notificationGang
@daveebbelaar 2 หลายเดือนก่อน
You fast haha!
@lLvupKitchen 21 วันที่ผ่านมา
Can I do this with celery worker with more control over chunking logic / metadata handling or is this something that pgai can already do?
I'm using Supabase, which doens't support pgvectorscal nor pgai. Not sure if I should host just plain postgreSQL with these extensions rather than using Supabase with pgvector extension, which they support.
Seems like handling embedding related coding is more managable than handling vanilla postgreSQL though.
@lLvupKitchen วันที่ผ่านมา
Turns out, celerey worker can do this. Although, it's quite tricky to implement when the database gets updated very frequently.

ต่อไป

เล่นอัตโนมัติ

OHANA บ้าพลัง EP.134 : เกมการ์ดโอฮาน่า X วัยหนุ่ม 2544