Search with LLMs and vector embeddings

แชร์
ฝัง
  • เผยแพร่เมื่อ 18 ต.ค. 2024

ความคิดเห็น • 20

  • @howardlock6430
    @howardlock6430 ปีที่แล้ว +1

    Vivek - Thank you for this 'first principles' explanation. Very helpful indeed!

  • @The_Pavanputra
    @The_Pavanputra ปีที่แล้ว

    The best video on llm so far

  • @renhit2005
    @renhit2005 ปีที่แล้ว

    I went through few videos, where people directly jump in to code and go through line by line, It took me 2 - 3 videos to understand underlying design pattern. Your explanation has summed it up again for me with better clarity. Thank you.

    • @VivekHaldar
      @VivekHaldar  ปีที่แล้ว

      Thanks for the kind words -- glad it helped!

  • @mrinalde
    @mrinalde ปีที่แล้ว +1

    Thanks Vivek. It is important that everyone understands this new paradigm better.

  • @chromashift
    @chromashift ปีที่แล้ว +2

    informative; other videos have stated how VDB work, via embeddings;
    but the "design pattern" as you've taught it was the missing piece;
    sure it can be untuited, but having the clarity you've brought will likely save many some time.
    thanks.

  • @sawri
    @sawri ปีที่แล้ว +1

    Awesome, this approach is very promising!

  • @nomad_citizen
    @nomad_citizen ปีที่แล้ว +1

    amazing explanation. Simple and clean! thanks! subscribed :D

  • @shuoyang9085
    @shuoyang9085 ปีที่แล้ว +3

    Great video, Vivek! I wonder if we can make further tuning to make this Q&A use case into an article drafting use case. Love to hear your thoughts or even a video sharing your idea on that.

  • @ghassenhedhili2708
    @ghassenhedhili2708 10 หลายเดือนก่อน

    Great video, Mr. Vivek! Your insights on using Language Models for search are fantastic. I'm working on a project and considering applying this approach to enhance resume matching with job descriptions. Any quick tips for this application?
    Thanks!

    • @VivekHaldar
      @VivekHaldar  10 หลายเดือนก่อน

      That would be a cool application. You could try something along the lines of treating the job descriptions as docs (put them in the vector db) and resumes as "queries", but maybe after extracting key skills etc from them using a summarization prompt.

  • @user-ig5go6qo3n
    @user-ig5go6qo3n ปีที่แล้ว

    This was fantastic thanks Vivek, I'm completely non technical yet was able to understand most of this.
    One question if I may: Let's say I've got a database that contains information on 100,000 companies (roughly 500 words on each company). Is there any reason why I wouldn't be able to query that database with something like 'give me a list of all companies in the database that are in the engineering industry' and get an accurate answer? Or would it still not be 100% accurate.

    • @VivekHaldar
      @VivekHaldar  ปีที่แล้ว

      Thanks for the kind words!
      What you will get is a measure of "similarity" (vector similarity, to be precise) between words in your query and words in your rows of your database. That embedding (mapping of words to an n-dimensional vector) is learned, not a deterministic lookup like a SQL query, and the query is usually open-ended, so there isn't a concept of "100% accurate".

    • @user-ig5go6qo3n
      @user-ig5go6qo3n ปีที่แล้ว

      @@VivekHaldar Thanks very much for the reply Vivek, and please keep these quality videos coming!

  • @saiveeramalla5507
    @saiveeramalla5507 7 หลายเดือนก่อน

    Are you going to launch any course on Deep Learning involving everything? From end to end...if yes would like to join..

    • @VivekHaldar
      @VivekHaldar  6 หลายเดือนก่อน

      Flattered that you asked but probably not.

  • @koltracker
    @koltracker ปีที่แล้ว

    why don't we directly send the query to LLM, what's the benefit here using a vector DB?

    • @VivekHaldar
      @VivekHaldar  ปีที่แล้ว +2

      So what we can answer queries over a corpus of documents that was 1) not in the training set of the LLM and 2) is too large to fit into the context window.

    • @koltracker
      @koltracker ปีที่แล้ว

      ah, so it extends the original query, what would be example for this? if i may ask

  • @vijaybrock
    @vijaybrock 6 หลายเดือนก่อน

    Give me the best RAG pipeline approach to chat with 10-K reports of different companies for the past 10 to 15 years accumulated count can range between 50 - 100 pdf files of 10K reports.