GraphRAG Ollama: 100% Local Setup, Keeping your Data Private

แชร์
ฝัง
  • เผยแพร่เมื่อ 5 ก.ย. 2024

ความคิดเห็น • 83

  • @anubisai
    @anubisai 2 หลายเดือนก่อน +6

    Great work as usual. Humble. Concise. Helpful. Perfect. 👌

  • @maxs6128
    @maxs6128 หลายเดือนก่อน +10

    Hey! Cool video. I actually built a full local solution using Ollama, no need for LM Studio at all. Here's what I did: I created a proxy that translates between OpenAI API embeddings and Ollama's format, both ways.
    The cool thing is, it works flawlessly for both global and local queries. I'd be happy to share the script with you if you're interested!

    • @matthewsoltis3383
      @matthewsoltis3383 หลายเดือนก่อน +1

      Please.

    • @MervinPraison
      @MervinPraison  หลายเดือนก่อน +3

      Yes please

    • @maxs6128
      @maxs6128 หลายเดือนก่อน +1

      ollama embeddings proxy on git hub

    • @user-ul6fz8fw3l
      @user-ul6fz8fw3l หลายเดือนก่อน +1

      Please.

    • @maxs6128
      @maxs6128 หลายเดือนก่อน

      ollama embeddings proxy on git hub

  • @SullyOrchestration
    @SullyOrchestration 2 หลายเดือนก่อน +10

    Can you please show a way to visualize the knowledge graph with an interactive UI?

  • @nikhielsingh748
    @nikhielsingh748 2 หลายเดือนก่อน +2

    best GenAI TH-camr, I mean it .

  • @MattJonesYT
    @MattJonesYT หลายเดือนก่อน

    Looking forward to more on this, it is the most interesting cutting edge tech in AI and almost no one else on youtube is talking about it

  • @ignaciopincheira23
    @ignaciopincheira23 หลายเดือนก่อน +1

    It is essential to conduct a thorough preprocessing of the documents before entering them into the RAG. This involves extracting the text, tables, and images, and processing the latter through a vision module. Additionally, it is crucial to maintain content coherence by ensuring that references to tables and images are correctly preserved in the text. Only after this processing should the documents be entered into a LLM.

  • @MeinDeutschkurs
    @MeinDeutschkurs 2 หลายเดือนก่อน +1

    A start, great!

  • @Gurdershan
    @Gurdershan 2 หลายเดือนก่อน +1

    I was eagerly waiting for this, big thanks

  • @NimaAmini2000
    @NimaAmini2000 2 หลายเดือนก่อน

    Another great video about GraphRAG, good job.

  • @yingzhu505
    @yingzhu505 หลายเดือนก่อน

    Great tutorial! Thank you!

  • @girijeshthodupunuri1300
    @girijeshthodupunuri1300 2 หลายเดือนก่อน

    This is not all feasible on my computer but I would love move graph rag videos aiming more at how we can get this technology production ready.

  • @Gwaboo
    @Gwaboo หลายเดือนก่อน +1

    can you please show or explain how to get the visualization of the data ? looks verry good, and thanks for the tutorial

  • @alamgirqazi1
    @alamgirqazi1 2 หลายเดือนก่อน +2

    You are amazing 🎉

  • @GeertBaeke
    @GeertBaeke หลายเดือนก่อน

    Good stuff. As expected, on a Mac M2, indexing and global queries are quite slow. Local queries are doable because it's usually just one LLM call after the similarity & graph search.

  • @sharankumar31
    @sharankumar31 2 หลายเดือนก่อน

    Thank you for this tutorial. Very useful..

  • @JV61
    @JV61 2 หลายเดือนก่อน

    This is what I was looking for

  • @macjonesnz
    @macjonesnz 2 หลายเดือนก่อน +8

    Mervin, Hi from New Zealand, I see that took 20 minutes to index…. what are the specs of your machine?

    • @d.d.z.
      @d.d.z. 2 หลายเดือนก่อน +2

      Same question. My computer runs Gemma 2 quite slow and I prefer to use Llama3 or Phi. The results will be the same?

    • @dudicrous
      @dudicrous 2 หลายเดือนก่อน

      @@d.d.z. What's your pc spec?

    • @d.d.z.
      @d.d.z. 2 หลายเดือนก่อน +1

      @@dudicrous Intel core i5 8th gen 8gb RAM I have a HP pavilion 13-an0012la.

    • @MervinPraison
      @MervinPraison  หลายเดือนก่อน

      It’s very slow for me too, I use Mac M2 32GB
      In the video I had to cut that part, because it took 26 mins just for indexing and considering it’s small chunk of data

  • @xinzhang3502
    @xinzhang3502 หลายเดือนก่อน +1

    So compared to GPTs, his search generation effect will be better?

  • @nz6291
    @nz6291 หลายเดือนก่อน

    Thanks this really helped!

  • @MattJonesYT
    @MattJonesYT หลายเดือนก่อน

    At 7:10 I believe the reason it's giving errors is the url in the settings file is missing the word embeddings at the end. It probably tested some different urls until it figured it out.

  • @lesptitsoiseaux
    @lesptitsoiseaux 2 หลายเดือนก่อน

    What a perfect video to wake up to after yesterday's video :) I'm starting to think that we're abusing graphRAG here, all of us. You see, and I may be wrong I'm still a n00b here, we are not using semantic chunking and also, for those of us with thousands of files, say transcripts, feeding graphRAG a summary and tags might be good enough for a recommendation engine and if the user wants to dive in, then you use rag but you create a rag for each main collection of documents. So the graph rag may be able to list say what cooking classes you can take much faster and then querying each class that is its own rag for details should be also much faster and overall cheaper? What do you think?

    • @MervinPraison
      @MervinPraison  หลายเดือนก่อน +1

      Basic RAG is fine for basic tasks but this GraphRAG is for advanced and more meaningful response.

  • @SonGoku-pc7jl
    @SonGoku-pc7jl หลายเดือนก่อน

    thanks. coming soon local vs. global

  • @BatukeshwarVats
    @BatukeshwarVats หลายเดือนก่อน

    Nice and useful video, but still not getting one thing. You made this video around 3 weeks ago, but in april , ollama released some embedding models. Then how we are saying it is not having embedding compatibility?.

  • @nikhielsingh748
    @nikhielsingh748 2 หลายเดือนก่อน +1

    quick question, I already have a folder of embeddings and chunks, can I just pass the documents and embeddings to GraphRAG ?

  • @codelucky
    @codelucky หลายเดือนก่อน

    Can you create a video on how to use GraphRAG with the GROQ API? Looks like nobody has done it yet. Thank you.

  • @song8775
    @song8775 หลายเดือนก่อน

    Hi, how do you fix the issues with running local search using command line?

  • @mllearning-qc6dt
    @mllearning-qc6dt หลายเดือนก่อน

    What is the average query time that you were experiencing with the global/local search?

  • @debarghyadasgupta1931
    @debarghyadasgupta1931 2 หลายเดือนก่อน

    Would you recommend GraphRAG for structured data as well like Postgres or MySql? I am still stuck between LLM SQL agent vs Vectors. I did explored Vanna already and like it. Appreciate your thoughts on this.

  • @bradmiles358
    @bradmiles358 2 หลายเดือนก่อน

    Great stuff, id be really interested to see csv rather than txt import as it looks like it might possibly give higgher reasoning by leveraging the structure. Ie better temporal reasoning etc...
    source_column: "author" # the column containing the source/author of the data text_column: "message" # the column containing the text of the data timestamp_column: "date(yyyyMMddHHmmss)" # optional, the column containing the timestamp of the

  • @jcalonsoc
    @jcalonsoc 2 หลายเดือนก่อน

    Great video! Can you export a CSV file to visualize the graph using an external tool like Gephi?

  • @eduardoconcepcion4899
    @eduardoconcepcion4899 2 หลายเดือนก่อน

    Can't we use nomic-embed-text provided by ollama for the embeddings?

  • @Yannick-ei2tz
    @Yannick-ei2tz 2 หลายเดือนก่อน

    Thx for the prez. It is about graph so is it possible to get a grip on the underlying graph db and vizualise it using a ds tool ?

  • @mohsenghafari7652
    @mohsenghafari7652 หลายเดือนก่อน

    Thanks

  • @chending5419
    @chending5419 หลายเดือนก่อน

    After a few tries, my conclusion on graphRAG was that it is buggy when running local, took too long to process - as such that it is not practical to run locally.

  • @KS-tj6fc
    @KS-tj6fc 2 หลายเดือนก่อน

    Do a video that finds the balance of speed (local takes forever) and cost by using DeepSeek-V2, which is only $0.14/million tokens input and $0.28/million tokens output.
    Once you get your results, I would say that at RAG = 2 the Deepseek would be at least equal to or slightly better than GPT 4o at =1.
    Then add an additional python step, prompt to take overall points and run limited amounts of tokens via GPT 4o or even Gemini Flash 1.5, which is quite good, for improvement to 1.6~1.7 level outputs at almost “free token costs”.

  • @changtimwu
    @changtimwu หลายเดือนก่อน

    Is LM Studio necessary? I believe OpenWebUI should also suffice, potentially creating a completely open-source graphrag solution.

  • @Gwaboo
    @Gwaboo หลายเดือนก่อน

    how can we use graph rag on data that's in another language? i have hundreds of documents i want to put into a rag but i cant get a good result out of the rag. the only things that come out are generic and often even things that i didn't ask about. do i have to use a German llm and embeddings model and translate all the prompts into German or do i have to translate all the data into English and live with it being in English?

  • @shiweiluo
    @shiweiluo หลายเดือนก่อน

    Why do you read the settings.yaml file by default when you create an index, but mine reads the .env file?

  • @mrchongnoi
    @mrchongnoi 2 หลายเดือนก่อน

    Good video. Please do work out the error you encountered. Do you have a GPU on your laptop. 20+ minutes makes this unusable for a company with 100s or Thousands of documents.

  • @rockypunk91
    @rockypunk91 หลายเดือนก่อน

    I want to understand, how can we use it in a actual application. I reality users will upload their documents anytime they want, If I run indexer for different documents seperately it creates a seperate timestamp based folders in output, now how will the graph rag work when we have multiple artifacts? Our do I have to run indexer on entire documents even if one new document is added? and how do we trigger it programitically

  • @mamadousow7323
    @mamadousow7323 2 หลายเดือนก่อน

    Didn’t understand the final sentence, running things in llmstudio , what about many pdf documents

  • @lemon268
    @lemon268 หลายเดือนก่อน

    is this completely free? or are there open api calls?

  • @dove8998
    @dove8998 2 หลายเดือนก่อน +2

    You need an OpenAI API Key to run it. It is unfortunatly not 100% local. Is there a way to run it without an OpenAI paid subscription?

    • @linmichael7935
      @linmichael7935 หลายเดือนก่อน

      yes you can, just export the "string or some text" for the OpenAI API Key. And then you can run it locally.

  • @ShadowyLane
    @ShadowyLane หลายเดือนก่อน

    Something is off. It works with openapi, no problem... But local models and embedding models - no chance. Followed all instructions.
    EDIT: httpS made me problems for local model, amateur issue. Sorry. It works locally now. Thank you.

  • @dawn_of_Artificial_Intellect
    @dawn_of_Artificial_Intellect 2 หลายเดือนก่อน

    can you use NIM to keep your Data Private?

  • @JrTech-rw6wj
    @JrTech-rw6wj 2 หลายเดือนก่อน +1

    can i use gemini model with graph rag ??

    • @MervinPraison
      @MervinPraison  2 หลายเดือนก่อน +2

      Yes, but not straight out of the box as I can see. You might need to modify the code slightly

    • @takshitmathur2761
      @takshitmathur2761 2 หลายเดือนก่อน +2

      @@MervinPraison can you make a video on this please.

  • @PradeepKumar-zy6cd
    @PradeepKumar-zy6cd 2 หลายเดือนก่อน

    The problem is LM studio getting error

  • @iham1313
    @iham1313 2 หลายเดือนก่อน +2

    as long as ms does not add other formats besides txt and csv, this graphrag is useless to me and all the business use cases i know and have.

    • @eggmaster88
      @eggmaster88 2 หลายเดือนก่อน +1

      Why? Can't you transform the data?

    • @iham1313
      @iham1313 2 หลายเดือนก่อน +1

      @@eggmaster88 can you? including images, diagrams, tables, metadata like page numbers, … please tell me, if there is a good solution

    • @eggmaster88
      @eggmaster88 2 หลายเดือนก่อน

      @@iham1313 for tables you can use .csv from what i know but it seems that you don't really need graph database for your data, more like sql.

    • @littledaddi3
      @littledaddi3 2 หลายเดือนก่อน

      @@iham1313 look up LangChain GraphRag vids on the topic - they show how to do images diagrams tables and metadata.

  • @zz-dy7bz
    @zz-dy7bz 26 วันที่ผ่านมา

    Are you serious? It's not working at all.

  • @zhengwu-jw6fm
    @zhengwu-jw6fm 2 หลายเดือนก่อน

    GRAPHRAG_API_KEY= "ollama"?big thanks

    • @matthewsoltis3383
      @matthewsoltis3383 หลายเดือนก่อน

      I got it to run without adding any key

    • @gagnongui78
      @gagnongui78 หลายเดือนก่อน

      @@matthewsoltis3383 how please?