How to Compare Multiple Large PDF Files Using AI (w/ Jerry Liu, Co-Founder of LlamaIndex)

แชร์
ฝัง
  • เผยแพร่เมื่อ 3 พ.ย. 2024

ความคิดเห็น • 29

  • @chatwithdata
    @chatwithdata  ปีที่แล้ว +3

    Timestamps:
    00:04 PDF demo (analysis of two annual reports)
    01:30 Visual overview of flawed architecture
    06:00 Jerry Liu, Llama index RAG overview
    18:34 Code walkthrough

  • @loicbaconnier9150
    @loicbaconnier9150 ปีที่แล้ว +3

    Great video
    but what if we have 200 documents and we woulld like to compare strategy of bank for example.
    let's suppose we have 20 banks in it. The retreiver will return to much document to deal with prompt max token number.
    How to deal with such use case please ?
    regards

  • @meetnehahere
    @meetnehahere ปีที่แล้ว +3

    But here you are comparing only two documents. Have you tried the performance of the model when asked to compare between more than 2 documents, say around 10-15 documents? Or even just asking if a particular information is present in a xyz document and just interacting with a single document when the vector database has been already uploaded with the embeddings from multiple documents with each document being super long (500-600 pages)?

    • @aaroldaaroldson708
      @aaroldaaroldson708 7 หลายเดือนก่อน

      this approach won’t work in this case. You need to look for agents, maybe?

  • @toluladeademisoye
    @toluladeademisoye ปีที่แล้ว +1

    Does this code work with other LLMs outside OpenAI's? Because i keep getting this error -Not Implemented Error for; response = s_engine.query("Compare and contrast the risk factors of Uber and Lyft?")
    print(str(response))

  • @nuclear_AI
    @nuclear_AI ปีที่แล้ว +3

    This is AMAZING content!
    Thanks for sharing 🤜🤛

  • @julianwelzel2328
    @julianwelzel2328 ปีที่แล้ว +1

    hey, just mentioning, looking at the code an openai key is exposed, not sure if this intentional. Thanks for the video!

  • @LuisRodriguez-j5n
    @LuisRodriguez-j5n 10 หลายเดือนก่อน

    Is this strictly used to compare the documents? I have 4 different PDF documents with information in them. The total is only 50 pages. Would it make sense to put them all into one PDF and use your approach that you used in your GPT-4 Tutorial: How to Chat With Multiple PDF Files (~1000 pages of Tesla's 10-K Annual Reports) video?

  • @e.l.5402
    @e.l.5402 ปีที่แล้ว +2

    Love your videos

  • @santosh_kumar_123
    @santosh_kumar_123 2 หลายเดือนก่อน

    How is the question being divided into sub question , wasn't clear about that

  • @KnightPeg
    @KnightPeg ปีที่แล้ว +1

    Hey, will the other stuff about tabular data etc. be coming in a future video?

  • @mikew2883
    @mikew2883 10 หลายเดือนก่อน

    Excellent video!👍

  • @AdinAlihodzic
    @AdinAlihodzic 11 หลายเดือนก่อน

    hey guys, does any one know of exmaples of llamaindex use cases where the source if returned/referenced in teh llm response. im building a complex multi document RAG and need to show the source of each answer even if taken across multiple documents?

  • @geekazoid47
    @geekazoid47 ปีที่แล้ว

    Great guest. Let your guest speak.

  • @PizzaLord
    @PizzaLord ปีที่แล้ว

    what is the link for the collab ?

    • @chatwithdata
      @chatwithdata  ปีที่แล้ว

      colab.research.google.com/drive/1Y_lUUKMdC627J5EP0dK1H8NveovpYisM?usp=sharing

    • @itsRevela
      @itsRevela 7 หลายเดือนก่อน

      did you manage to find what versions of langchain and llama-index to use? I'm having version dependency issues and I can't seem to resolve them

  • @thomasbrunelouellet4370
    @thomasbrunelouellet4370 ปีที่แล้ว

    wut ! what a surprize ! haha

  • @PizzaLord
    @PizzaLord ปีที่แล้ว

    This video is only 6 days old and the code is already giving warnings or am I missing something
    C:\Users\JJ\AppData\Local\Programs\Python\Python311\Lib\site-packages\langchain\__init__.py:39: UserWarning: Importing BasePromptTemplate from langchain root module is no longer supported.
    warnings.warn(
    Because of this line
    from llama_index import SimpleDirectoryReader, ServiceContext, VectorStoreIndex
    How do we get rid of these warnings?

    • @JerryLiu-ff8xv
      @JerryLiu-ff8xv ปีที่แล้ว

      yeah we'll have a fix for these warnings soon, thanks for flagging. but this shouldn't break your current flows

    • @PizzaLord
      @PizzaLord ปีที่แล้ว

      @@JerryLiu-ff8xv It doesn't. Please let us know when the update to make these warnings goes away is live though as it does not look good for client demos when you are showing other people

    • @PizzaLord
      @PizzaLord ปีที่แล้ว

      @@JerryLiu-ff8xv BTW where are the links to the slides?

  • @michamichalik9719
    @michamichalik9719 9 หลายเดือนก่อน +1

    Hey,
    Anyone worked with the notebook recently ?
    Im getting this error for following line:
    from llama_index import SimpleDirectoryReader, ServiceContext, VectorStoreIndex
    "ImportError: cannot import name 'BaseCache' from 'langchain.cache' (/usr/local/lib/python3.10/dist-packages/langchain/cache.py)"

    • @PrakharPartha-nz5qw
      @PrakharPartha-nz5qw 8 หลายเดือนก่อน +1

      Even I am getting the same error

    • @itsRevela
      @itsRevela 7 หลายเดือนก่อน

      i am getting the same error. haven't been able to find the correct versions yet

    • @san2sreshta
      @san2sreshta 7 หลายเดือนก่อน

      @michamichalik9719 - did you find a way to fix the issue?