LlamaIndex Sessions: Practical Tips and Tricks for Productionizing RAG (feat. Sisil @ Jasper)

แชร์
ฝัง
  • เผยแพร่เมื่อ 17 ก.พ. 2024
  • In this video, Sisil Mehta (ML eng @, Jasper) walks through practical tips and tricks that his team implemented for productionizing a RAG system at Jasper.ai, backed by LlamaIndex abstractions.
    These tricks include the following:
    1. Picking a proper PDF parser that can maintain semantic structure, parse text from tables/images, and be represented as XML or Markdown
    2. Adding the right "layers" of metadata; besides global document context, also inject summary context from "sub-documents" to more precisely localize context.
    3. Hybrid fusion between different retrieval methods
    4. LLM-powered reranking. Reduce token usage by reranking summaries that reference underlying chunks.
    5. Use XML and emotion prompting to get well-structured outputs free of hallucinations

ความคิดเห็น • 14

  • @philipagenmonmen
    @philipagenmonmen 2 หลายเดือนก่อน

    Super comprehensive. Thanks for this.

  • @seanbergman8927
    @seanbergman8927 4 หลายเดือนก่อน +4

    This was an excellent talk. Thanks so much for sharing your experience and this RAG framework. If there could be a follow up sometime with a sample notebook that uses these techniques, and a code walkthrough video , I’m sure many people would greatly benefit from it.

  • @user-hq6or3fh9d
    @user-hq6or3fh9d 5 หลายเดือนก่อน +1

    Sisil's presentation was exemplary, addressing key pain points with innovation.
    Congratulations on your work!. We as a company also trying to solve all pain points you have mentioned in the pdf docs area.
    Thanks to Jerry for spotlighting talent like Sisil. Excited for more content!

  • @bszhang2115
    @bszhang2115 5 หลายเดือนก่อน +1

    a showcase sample notebook would be deeply appreciated!

  • @falven
    @falven 3 หลายเดือนก่อน

    Would have loved if you shared the slides in the description. :)

  • @chenrannight2272
    @chenrannight2272 5 หลายเดือนก่อน

    amazing!!! Sisil really explained the difference between benchmarks and real world openended questions!
    btw, could you include the datasets name Sisil talked about regarding "retrieval" and "reranking" on evaluation?

  • @karthikmukundakrishnan7853
    @karthikmukundakrishnan7853 5 หลายเดือนก่อน +1

    Is there a sample code you can post a link to, specifically for indexing the subdocs, then the chunks and retrieving them?

  • @jeevan999able
    @jeevan999able 5 หลายเดือนก่อน

    this was fantastic, can you provide a tutorial notebook?

  • @jeevan999able
    @jeevan999able 5 หลายเดือนก่อน +1

    how does lexical indexing work for subdocs

  • @qingsongyao4974
    @qingsongyao4974 5 หลายเดือนก่อน +2

    I wonder what is the PDF parse Sisil 's team is using?

    • @bram1223
      @bram1223 5 หลายเดือนก่อน +1

      Adobe Extract PDF API or something. I haven't tested it but it works pretty well from what I've heard.

    • @user-tb5il8dm2r
      @user-tb5il8dm2r 4 หลายเดือนก่อน

      There is an opensource version as well (Works in Linux Distro) called unstructured. You just need to do pip install unstructured[all].

  • @user-rb8de9ds3l
    @user-rb8de9ds3l 5 หลายเดือนก่อน +2

    I didn't catch what was the pdf parser used? Can you name it please?

    • @jochenade
      @jochenade 5 หลายเดือนก่อน

      I think it was adobe api