The voice is very clear and crisp for non-english viewers to understand. Content is very excellent and explained exquisitely. Could you let us know if tables in the pdf or word doc, using RAG and prompt can we able to join tables, filter data from tables and other operations in Gen AI?
Thank you for your response! My course on Multimodal RAG below would be a good option to explore and might be what you are looking for to answer your questions. th-cam.com/video/D5iKsvK7cXg/w-d-xo.htmlsi=scyeB48fyvrn9nsU
Hey, im getting an error ~\anaconda3\lib\genericpath.py in isfile(path) 28 """Test whether a path is a regular file""" 29 try: ---> 30 st = os.stat(path) 31 except (OSError, ValueError): 32 return False TypeError: stat: path should be string, bytes, os.PathLike or integer, not JpegImageFile can you help me with a fix?
Important question: If a pdf does have a picture in it, when converting to picture, firstly, is that picture added as text or is it skipped? Secondly, is there a way to *know* that the extracted text is coming from an image within the pdf? Some sort of metadata at least to get that info? Thanks for the video, nice content with overall breadth, wish you could answer my question.
@@piyushchhawachharia_0068 Hi, not really. I realised the solution that's usually done with the python libraries are that they pitcture-ize the entire pdf page. That means that pictures within pdfs, are read as characters but obviously semantics from visual representations are not transcribed. Thankfully the order of text appearing is good enough, haven't seen anything terribly wrong in the order of the text extracted. How do you approach this yourself?
Please suggest , i have a multiple pdfs and some of the pdfs have the images. i don't want to work and apply multimodel ai seperaterately for the pdfs which having images , i want to proceed like : load the documents -> split into multiple chunks -> embedding each chunks and stored into vector stores .. please suggest the process
Susmitsekhar, thank you! Yes, we can extract any textual information from histogram/barplot. Give it a try and I am happy to know more about your findings.
Hi there, I plan on using the EasyOCR Library for some sensitive Documents, Is it safe, like can any data Leaks Occur, also Is there any Documentations of the Library I can refer to ? Thanks !!
I run into the following error when I try langchain's UnstructuredImageLoader: TypeError: stat: path should be string, bytes, os.PathLike or integer, not JpegImageFile
IndexError : list index out of range . For the part where I am trying to get multiple pages converted pdf and and do data[index].page_content I am getting an error for this line .
I used a third party tool API to extract text and tables but the image part is not working for it. It’s not even recognising the images. If I just use the python libraries instead that will recognise the image and I can save them to other folders and later work on it but the extraction of tables won’t work I guess for python libraries.
@@techwithzoum Je suis un Malien .Y'a t'il un moyen de vous contacter, je dois commencé mon programme de master en data science et machine learning bientôt plus précisément en septembre. j'ai bésion de poser des questions à quelqu'un qui est déjà dans l'industrie.
that was very useful for me, but i faced one struggle in langchain unstructuredimageloader it say its not allowed PNGImage? i don't get it, I try to resolve it but i couldn't , pls if there is any way to contact you, I'll appreciate . thank you
Please subscribe and like the video to help me keep motivated to make awesome videos like this one. :)
Thank you for this video. I needed this information at this very moment.
I am really happy it helped!
Fantastic tutorial, so much simplified ...great job
You are welcome, AbdoulAhad-Family!
The voice is very clear and crisp for non-english viewers to understand. Content is very excellent and explained exquisitely. Could you let us know if tables in the pdf or word doc, using RAG and prompt can we able to join tables, filter data from tables and other operations in Gen AI?
Thank you for your response!
My course on Multimodal RAG below would be a good option to explore and might be what you are looking for to answer your questions.
th-cam.com/video/D5iKsvK7cXg/w-d-xo.htmlsi=scyeB48fyvrn9nsU
@@techwithzoum thanks good to explore
@@SMCGPRA you are welcome!
Excellent tutorial, thank you. Your video was referenced on AI Jason's channel and I am glad he did.
This is awesome, great work!
Thank you, AI JASON!
Please, feel free to share with your audience for better outreach :)
Great work. Absolute genius!
Thank you!
So much thanks :) Great work!
Thanks!
Thanks so much. can we have an example for a data extration from a table on an image ?
Thanks, @thibauteka4046, and that sounds like a great idea!
Hello, what is your version of langchain i think mine doesnt work
Can you please share the error you are getting?
How is LlamaIndex PDF to Text Conversion as compared to any of these methods?
Hey, im getting an error
~\anaconda3\lib\genericpath.py in isfile(path)
28 """Test whether a path is a regular file"""
29 try:
---> 30 st = os.stat(path)
31 except (OSError, ValueError):
32 return False
TypeError: stat: path should be string, bytes, os.PathLike or integer, not JpegImageFile
can you help me with a fix?
which among these has the best accuracy?
The Unstructured library seems to do a great job since it has the features to handle any type of file without much difficulty.
Great Work, thank you ! 😀
You're welcome!
Important question: If a pdf does have a picture in it, when converting to picture, firstly, is that picture added as text or is it skipped?
Secondly, is there a way to *know* that the extracted text is coming from an image within the pdf? Some sort of metadata at least to get that info?
Thanks for the video, nice content with overall breadth, wish you could answer my question.
Hi , Could you solve this issue?
@@piyushchhawachharia_0068 Hi, not really. I realised the solution that's usually done with the python libraries are that they pitcture-ize the entire pdf page. That means that pictures within pdfs, are read as characters but obviously semantics from visual representations are not transcribed. Thankfully the order of text appearing is good enough, haven't seen anything terribly wrong in the order of the text extracted. How do you approach this yourself?
Please suggest , i have a multiple pdfs and some of the pdfs have the images. i don't want to work and apply multimodel ai seperaterately for the pdfs which having images , i want to proceed like : load the documents -> split into multiple chunks -> embedding each chunks and stored into vector stores .. please suggest the process
Great work. can we extract information from charts like histogram/barplot ?
Susmitsekhar, thank you!
Yes, we can extract any textual information from histogram/barplot. Give it a try and I am happy to know more about your findings.
@@techwithzoum Thanks for your quick response.Unfortunately it's not working for Histogram/barplot kind of things. Do you have any solutions.
@@susmitsekhar5100 I haven't explored any type of case like this, so can't tell right now.
@@techwithzoum thanks for your response. kindly let us know if you come across any similar kind of things.
@@susmitsekhar5100 sure!
Don't forget to subscribe and share the link of the channel with your friends!
Are these working for scanned pdfs?
Yes, they do
@zoumdatascience Can we give the image output with text based on the questions?
Yes, you can do that.
@@techwithzoumhow can i do that do you have any reference ?
Actually i need to make chat with pdf and want to get answer in text with image
Good one!
Thanks!
مااسم صفحة الانترنت التى تكتب فيها الاكواد
Hi there, I plan on using the EasyOCR Library for some sensitive Documents, Is it safe, like can any data Leaks Occur, also Is there any Documentations of the Library I can refer to ?
Thanks !!
Same question. Any one please answer?
I run into the following error when I try langchain's UnstructuredImageLoader:
TypeError: stat: path should be string, bytes, os.PathLike or integer, not JpegImageFile
did you find a fix to this?
I am getting list index of range for langchain. Can you suggest sometime there
Can you please share the complete error you are getting?
Sure
IndexError : list index out of range .
For the part where I am trying to get multiple pages converted pdf and and do data[index].page_content
I am getting an error for this line .
@@anubhav963 please let me know if you need further help!
I used a third party tool API to extract text and tables but the image part is not working for it. It’s not even recognising the images. If I just use the python libraries instead that will recognise the image and I can save them to other folders and later work on it but the extraction of tables won’t work I guess for python libraries.
I use a neural network to extract data from tables .
Hi sir i am from india and this is very useful i am also doing same kind project…i need your help can you help please?
I am glad it helps. How may I help?
Hey, I am also from India, What kind of project are you working on?
Bonjour êtes vous malien? super tutoriel merci pour le partage
Je suis Ivoirien avec des originines Maliennes!
@@techwithzoum Je suis un Malien .Y'a t'il un moyen de vous contacter, je dois commencé mon programme de master en data science et machine learning bientôt plus précisément en septembre. j'ai bésion de poser des questions à quelqu'un qui est déjà dans l'industrie.
كل الاساليب لتحويل الملفات فشلت ولا اعرف السبب
٨
Ok
that was very useful for me, but i faced one struggle in langchain unstructuredimageloader it say its not allowed PNGImage? i don't get it, I try to resolve it but i couldn't , pls if there is any way to contact you, I'll appreciate . thank you