Ollama with Vision - Enabling Multimodal RAG

แชร์
ฝัง
  • เผยแพร่เมื่อ 25 พ.ย. 2024

ความคิดเห็น • 16

  • @throwaway-g9f
    @throwaway-g9f 18 วันที่ผ่านมา +4

    This video got me hyped; I was waiting for ollama multi-modal for a long time.

  • @TheRealHassan789
    @TheRealHassan789 18 วันที่ผ่านมา +3

    this video and tools have so much value! ...people will sleep on it tho

  • @samsquamsh78
    @samsquamsh78 18 วันที่ผ่านมา +1

    great video and cool project! will chekc it out!! thanks!

  • @faucetcryptos8148
    @faucetcryptos8148 18 วันที่ผ่านมา +1

    Muito Legal

  • @stunspot
    @stunspot 18 วันที่ผ่านมา

    Neat!

  • @HappyDancerInPink
    @HappyDancerInPink 18 วันที่ผ่านมา +1

    Nice, what GPU do you use for these tests?

    • @engineerprompt
      @engineerprompt  17 วันที่ผ่านมา +2

      I have a MacBook Pro M2 Max with 96GB unified memory

  • @ChristopherMcKinley-c1s
    @ChristopherMcKinley-c1s 18 วันที่ผ่านมา

    Is there api calling in the future for this project? I would love to be able to use it as a replacement/upgrade from fine tuning models and running them from ollama.

    • @truthwillout1980
      @truthwillout1980 18 วันที่ผ่านมา

      ???

    • @ChristopherMcKinley-c1s
      @ChristopherMcKinley-c1s 18 วันที่ผ่านมา

      @@truthwillout1980 The idea in my head is that I can host this on the LAN and have other programs just make an api call so as to not go through a GUI. Is that already an option and I missed it?

    • @truthwillout1980
      @truthwillout1980 18 วันที่ผ่านมา

      @@ChristopherMcKinley-c1s Yes you should already be able to do that. I think there's a section in the video that explains it in fact (though I'm going off memory, I haven't watched it again). Just spin it up on a port number and call it.

  • @Masoud2xm
    @Masoud2xm 11 วันที่ผ่านมา

    I am getting an error during indexing saying "Torch not compiled with CUDA enabled". I am using Mac M4. Could you help with this, please?

    • @timstevens3361
      @timstevens3361 9 วันที่ผ่านมา

      get an rtx 3060 12 gig gpu or rtx 4060 16 gig
      they run alot of diff model really well !!!

  • @mr.gk5
    @mr.gk5 17 วันที่ผ่านมา

    Can it generate graphs or reports on tabular data?

    • @engineerprompt
      @engineerprompt  17 วันที่ผ่านมา +1

      At the moment, it can't but I think its possible to integrate it with a code interpreter for plots or table generation.

  • @Know_Ur_World
    @Know_Ur_World 17 วันที่ผ่านมา

    So can u help me with the usecase
    My usecase it to extract the relevant text and images available in the pdf.when any prompt is given then relevant text alogh with image should display as reponse in a sequential manner, not images separate text separate.
    Query:Give steps in RSA agent installation
    Answer:
    1.Text1
    Image1
    2.Text2
    3.Image2
    Text 3
    4.Image4
    Text4
    5.image5
    Image 6
    Text5