Best Tool For Getting Your Data Ready For RAG

แชร์
ฝัง
  • เผยแพร่เมื่อ 27 ม.ค. 2025

ความคิดเห็น • 21

  • @TooyAshy-100
    @TooyAshy-100 8 หลายเดือนก่อน

    Congratulations on reaching 10K, and soon it will be even greater.

  • @aalamansari8643
    @aalamansari8643 6 หลายเดือนก่อน +1

    Sir, using partition_pdf not able to get the bulllet points from pdf like (- Any bullet point). How to get the bullet points, need help sir!

  • @d3mist0clesgee12
    @d3mist0clesgee12 8 หลายเดือนก่อน

    Congratulations on ur 10K !!!!!!!

  • @mchl_mchl
    @mchl_mchl 8 หลายเดือนก่อน

    Great video! I have been using unstructured data connectors to do hydrid searches and text embedding with elasticsearch - would love to see if you have some tips for the JSON mapping there or anything else. Would love to get a function for all data types that can handle all the edge cases

  • @awakenwithoutcoffee
    @awakenwithoutcoffee 6 หลายเดือนก่อน +1

    HI sir, have you tried Azure AI Document Intelligence ? we are figuring out which data parser is the most suitable for production RAG apps. Cheers

  • @yazanrisheh5127
    @yazanrisheh5127 8 หลายเดือนก่อน

    Hello Sudarshan. Can you please make a video of the RAG on several PDFs where these PDFs have all text, images, and tables please.

    • @datasciencebasics
      @datasciencebasics  8 หลายเดือนก่อน

      Will take that in my to do list ✅

  • @SantK1208
    @SantK1208 8 หลายเดือนก่อน

    Thanks Sudarshan, could you please make a video on fine tuning llama 3 model ???

    • @datasciencebasics
      @datasciencebasics  8 หลายเดือนก่อน

      You are welcome. Will note that in my to do list !!

  • @Rahulrawat-ht4fy
    @Rahulrawat-ht4fy 18 วันที่ผ่านมา

    Do i need to download the poppler and tessaract to use the unstructured api for pdf files?

    • @datasciencebasics
      @datasciencebasics  17 วันที่ผ่านมา

      If you use locally then yes, if you use the unstructured client then no.

    • @Rahulrawat-ht4fy
      @Rahulrawat-ht4fy 16 วันที่ผ่านมา

      @@datasciencebasics Thanks a lot for replying

  • @CC-zg4el
    @CC-zg4el 8 หลายเดือนก่อน

    Hi Sudarshan, I have been trying to follow your unstructured tutorials, but I keep getting an erro at the beginning because, apparently, my virtual environment lacks something which I cannot figure out. I also forked and cloned your repository locally, in hope that there is a spec-file.txt file to clone your environment. However, It seems there is not such file. Would you mind sharing a spec-file.txt yo clone your environment and try your notebook? If you have another video where you have already instruct your subscribers how to follow along your tutorial, please just point me in the right direction. Thank you very much for your time!

    • @datasciencebasics
      @datasciencebasics  8 หลายเดือนก่อน

      You are welcome. Installing unstructured python sdk might be challenging as it might need some system level package installation. This video has some ideas and link in the notebook, please follow it there.
      th-cam.com/video/hQu8WN8NuVg/w-d-xo.htmlsi=V3N2VjcguzLPMiII

  • @drmetroyt
    @drmetroyt 8 หลายเดือนก่อน

    How to install as docker container?

    • @datasciencebasics
      @datasciencebasics  8 หลายเดือนก่อน

      You can read and follow feom the official documentation -> docs.unstructured.io/open-source/installation/docker-installation

    • @drmetroyt
      @drmetroyt 7 หลายเดือนก่อน +1

      ​@@datasciencebasicsI'm not a tech person sir , I'm a medical student i have many PDFs which i want to feed in RAG so i want to install unstructured io in docker but there is no video on internet , although they mention in the documents to install as docker but there is no proper understandble guide

    • @awakenwithoutcoffee
      @awakenwithoutcoffee 6 หลายเดือนก่อน

      @@drmetroyt ask GPT! It should be able to walk you trough the process or at-least point you in the right direction. It is not an easy subject for a beginner (I am also new to Docker).