This type of classes are really nice. Please do upload advanced topic in tesseracr in future videos. Thank you so much for this, one of my most waited video
Please make a video on post processing of the text that is extracted from ocr ?? It is very important because of the design changes hardcording like contain string does not work. So we need to use nlp I guess.
I'm doing the same steps still it shows test for simple images, but does not show for other complex images like invoices, traffic signal sign etc. What may be the reason, pls guide.
Hi Sir, My challenge is reading texts inside images with wavy lines. The Image was created with a cell phone and just inserted as an image in a PDF file. Any special lib to do this? Pytesseract did work very well. It didn't capture well the wavy lines
Hello thank you for the video. Is there a way to get the image preprocessed by the tesseract algorithm? When running tesseract in cmd I can get it by setting tessedit_write_images = 1, but in python I couldn't find a way to get preprocessed image.
Hi @Krish, I want to extract text from the yolov8 predicted results which are scanned documents and predicted result images also have bounding boxes with their classes defined as, header, footer, subheading and paragraph. I want to extract text with respect to the class name and the confidence score.
Sir i am working with teseract and opencv for making an ML based Application on Invoice system. The project is basically on the systwm where you automate the data and convert into excel by training 10 bill of invoice. Please help sir
@@manjubadiger2902 Thanks. But mera project band kar diya gaya ha. Maine bahat help mangi krish sir se, aur bhi loggo se LinkedIn par. But no one help. For this reason i lost my job. Tabse maine inn sab youtuber ki video dekhna band kar diya. Thanks for your favour.
hello krish, i try to upload the same images like you are uploading i.e traffic image and invoice..i choose the exact same image from google but on running, image is displayed but no text is getting printed and for the case i take screen shot of Wikipedia text its is working absolutely fine..what could be the problem??
sir i tried this pyteserract on number plate detection...and its not showing great results...can you please make one video on number plate detection also ?
Dear Sir, I am your Subscriber I want to create a tool that finds text errors in the image. For Example: I forgot to write CONTACT US, BUY NOW, CONTACT NUMBER, SPELLING MISTAKE, etc... in my social media post. that the tool finds error and suggests what are missing or what is incorrect in social media post. 🙏 Please guide me and suggest what course I need to buy or what I need to learn to create this tool Thank you
I am getting error: ImportError: cannot import name 'image_to_string' from 'pytesseract' (c:\python37\lib\site-packages\pytesseract\__init__.py) Just after importing tesseract and giving the path. Please help!!
I think this doesn't work on colab because we need to install tessarct exe file on our local system to use it. So use this on your local desktop jupyter notebook.
There is one otherway to make it, first change the original image to binary image which will basically separate the text and non- text part. And then further feed it into tesseract. It will get improved
You have actually played a safe game in the video without resolving the extraction issues
It was nice. Please keep doing session so that our learning curve doesn't stop.
This type of classes are really nice.
Please do upload advanced topic in tesseracr in future videos.
Thank you so much for this, one of my most waited video
This is very helpful session for me ... Can you please make a video on how to convert Image to CSV ... If possible.
Thank you so much 👍🤝
Great sir 👍 , before this video i can't imagine that python do this type of extraction also.
I like this type of session sir thank you for such a great session
Thank you Krish for the video. Really interesting and useful..!!
Hi sir,
I am currently working on a project Text Extraction from CPG(Consumer packaged goods) Product Images. Can we use Pytesseract to do the same?
Really you are helping me alot
Thank you very much
Thank you So much!, its really helpful
Sir, please make video on custom training and fine tuning! Please!
thank you so much sir...
Sir please take a class about how to save the model created using cnn for future use using hdf5
Sir, how extract data from PDF and separate the names and phone numbers and save it in Excel file
You found the way dude?
Have you got the information
Thankyou so much sir
Live or recorded Both ways are good, sir
you are awesome .. Nice video.
Nice topic , krish
Thanks a lot sir ..
Please make a video on post processing of the text that is extracted from ocr ?? It is very important because of the design changes hardcording like contain string does not work. So we need to use nlp I guess.
I'm doing the same steps still it shows test for simple images, but does not show for other complex images like invoices, traffic signal sign etc. What may be the reason, pls guide.
Hi Krish thanks a lot for your videos..I also want to know create container in aws
Helo sir. Could you please make a video on segmentation of handwritten text image to characters. 🙏
You saved me
Do we have any library which can extract text from structured documents like passport, adhar card ,pancard ?
use opencv library
Yes yes
Sir can you have lecture on OCR USING DEEP LEARNING
Yes
Great
@krish Naik sir could you please tell some way to extract address from a large text corpus? How can tesseract help to extract address from docs?
Sir..can we extract arabic and english text in pytesseract?if so,can you discuss in tomorrows session or put a video reg the same sir..
Have you tried that?
Hi Sir, My challenge is reading texts inside images with wavy lines. The Image was created with a cell phone and just inserted as an image in a PDF file. Any special lib to do this? Pytesseract did work very well. It didn't capture well the wavy lines
Hello thank you for the video. Is there a way to get the image preprocessed by the tesseract algorithm? When running tesseract in cmd I can get it by setting tessedit_write_images = 1, but in python I couldn't find a way to get preprocessed image.
Hi @Krish, I want to extract text from the yolov8 predicted results which are scanned documents and predicted result images also have bounding boxes with their classes defined as, header, footer, subheading and paragraph. I want to extract text with respect to the class name and the confidence score.
Bro, can u try
Image_to_boxes
sir if video would be recorded then it would have be more helpfull rather than livestreaming
Krish can you make this on Real time video
What is the name of your writing pad
Sir how can we train or retrain the model for new symbol ....
So that it can detect the symbol ....
Sir i am working with teseract and opencv for making an ML based Application on Invoice system. The project is basically on the systwm where you automate the data and convert into excel by training 10 bill of invoice.
Please help sir
Hi you can contact me regarding OCR on invoice projects
@@manjubadiger2902 Thanks. But mera project band kar diya gaya ha. Maine bahat help mangi krish sir se, aur bhi loggo se LinkedIn par. But no one help. For this reason i lost my job. Tabse maine inn sab youtuber ki video dekhna band kar diya. Thanks for your favour.
@@shubairabbas5480 Could you clarify abt the project..and why was it closed?
@@manjubadiger2902 hey buddy.. I need some help, how to extract tables along with other datas from any scanned document??
This is amazing. Thanks. Can we extract tabular info from image as tables? how?
I want to know how to do this as well....
sir ,how can we do it on multiple images and the extracted text should be created as .txt file as like in notepad
You make one environment to install all installation. Or make every time create new environment and install.plz clear me.
hello krish,
i try to upload the same images like you are uploading i.e traffic image and invoice..i choose the exact same image from google but on running, image is displayed but no text is getting printed and for the case i take screen shot of Wikipedia text its is working absolutely fine..what could be the problem??
sir plz tell how to implement for multiple images
Can this also read invoices or bank statements? I think should be able to help my wife who is a CA
Yes I have shown the example
Oh sorry did I miss it I am was getting my food.
How to send this data to excel files?
What is the name of the writing pad
Sir please build handwritten Oct recognise using CNN...
Sir have you found any solution for your queary ,as I also need OCR using deep learning tutorial
I am unable to join ur membership can u guide to join the membership
sir i tried this pyteserract on number plate detection...and its not showing great results...can you please make one video on number plate detection also ?
How to know, what's the accuracy of my ocr model ?
How can I generate character level confidence score using tesseract??
Sir Debit Card is not working for getting membership ( Rs. 59 ) of your channel. Please help sir.
Dear Sir, I am your Subscriber
I want to create a tool that finds text errors in the image.
For Example:
I forgot to write CONTACT US, BUY NOW, CONTACT NUMBER, SPELLING MISTAKE, etc... in my social media post.
that the tool finds error and suggests what are missing or what is incorrect in social media post.
🙏 Please guide me and suggest what course I need to buy or what I need to learn to create this tool
Thank you
sir..can u put a new video for text extraction in azure for arabc and eng ID cards
I am getting error:
ImportError: cannot import name 'image_to_string' from 'pytesseract' (c:\python37\lib\site-packages\pytesseract\__init__.py)
Just after importing tesseract and giving the path.
Please help!!
Hey! can u create a model for extracting pan number from pan card
If we draw a circle over a text and take a snap of it then How will we extract that only content which is inside the circle.?
Did you find answer for this?
Please let me know how we can install it in Linux
Hey, follow this th-cam.com/video/-fIlUcp69xo/w-d-xo.html.
Can it read Doctor's Handwriting?
It show me module not found sir
hey im trying to build a pdf chat bot but i want to install ocr in it so that it recognizes image text too , can someone guide me plz
When I execute import pytesseract....
This is not working in tabular data in scanned images
can we use pytesseract to read kannada text
What about other languages
Getting error Exec format error tesseract-ocr-w64-v5.exe
Running code in colab
I think this doesn't work on colab because we need to install tessarct exe file on our local system to use it. So use this on your local desktop jupyter notebook.
hi krish
what if the language is hindi or sanscrit will it work
How to extract hindi text in tessract.
what about ubuntu path
please help us with captcha images reading
Hi, I am looking for medical prescriptions dataset where I read the handwritten text using OCR, anyone can share with me this dataset?
I want to just read particular part from images after classification
like only read names from all aadhar cards photos
Hi
Hi
Tesseract only works when the image background and texts are clear. I tried to use tesseract on lcd panels and it gave bad results.
There is one otherway to make it, first change the original image to binary image which will basically separate the text and non- text part. And then further feed it into tesseract. It will get improved
@@adis6867 Can you elaborate the steps for it? It would be quite helpful.
Sir, I have a linux box. What are the steps for me? I have installed tesseract-ocr and pytesseract both the packages
Hey, follow this th-cam.com/video/-fIlUcp69xo/w-d-xo.html.