Extract Text From Images in Python (OCR)

แชร์
ฝัง
  • เผยแพร่เมื่อ 26 ธ.ค. 2024

ความคิดเห็น •

  • @StephenBeale
    @StephenBeale ปีที่แล้ว +28

    Page segmentation modes:
    O Orientation and script detection (OSD) only
    1 Automatic page segmentation with OSD. ‘
    2 Automatic page segmentation, but no OSD, or OCR.
    3 Fully automatic page segmentation, but no OSD. (Default)
    4 Assume a single column of text of variable sizes.
    5 Assume a single uniform block of vertically aligned text.
    6 Assume a single uniform block of textJ
    7 Treat the image as a single text line.
    8 Treat the image as a single word.
    9 Treat the image as a single word in a circle.
    10 Treat the image as a single character.
    11 Sparse text. Find as much text as possible in no particular order.
    12 Sparse text with OSD.
    13 Raw line. Treat the image as a single text line, bypassing hacks that are Tesseract-specific.

    • @hpbbx
      @hpbbx 7 หลายเดือนก่อน

      wheres the code

  • @mishudhar3746
    @mishudhar3746 9 หลายเดือนก่อน +1

    Your video helped a lot than university. Thank you so much. I can't express how much it helped.

  • @foxii5573
    @foxii5573 ปีที่แล้ว +11

    I never thought Nacho would teach me Python

  • @desmondchia2013
    @desmondchia2013 2 ปีที่แล้ว +4

    hi, the cv2.fucntion was highlighted and cannot run. no errors either. import cv2 done. opencv-python installed....all done. how to resolve?

  • @zackjennings4355
    @zackjennings4355 ปีที่แล้ว +4

    do you have any videos on training tesseract to better recognize say a different font?

  • @2ru2pacFan
    @2ru2pacFan 3 ปีที่แล้ว +12

    Thank you dude! Exactly what I was looking for! Suggestion, do you think you can make a video on Google Drive and Python?

  • @kaitaojiang5839
    @kaitaojiang5839 2 ปีที่แล้ว +25

    According to my tests, PaddleOCR works better than tesseract in most scenes.

    • @samajanjua9625
      @samajanjua9625 2 ปีที่แล้ว +2

      Plzz share code

    • @coyoteden8111
      @coyoteden8111 ปีที่แล้ว

      It's all about tesserocr

    • @aimalrafi7718
      @aimalrafi7718 10 หลายเดือนก่อน

      im looking for library that can take cursive writing to readable writing. Do you think paddleocr can do the job

  • @borisveselov8421
    @borisveselov8421 10 หลายเดือนก่อน +1

    Thank you for your tutorials!
    I have a question: what would be the best way to extract a specific information, such as invoice number or purchase order number. Let’s say there are many different vendors, maybe over 100 vendors. And every vendor has invoice number in different parts of the document. What technique would be used to perform this kind of ocr?

    • @NeuralNine
      @NeuralNine  10 หลายเดือนก่อน +1

      That's quite difficult. I don't think there is a very simple solution to this. But one idea is to use regular expressions if you know the format of the numbers for each vendor.

    • @borisveselov8421
      @borisveselov8421 10 หลายเดือนก่อน +1

      @@NeuralNine thank you. I agree that regex is a decent way to achieve this. What do you think about creating a template for each vendor?

  • @invinciblenoman
    @invinciblenoman 3 ปีที่แล้ว +2

    I am facing this error. Anyone Can help me? PermissionError: [WinError 5] Access is denied

  • @DhruvalBhinsara
    @DhruvalBhinsara 3 ปีที่แล้ว +2

    you always bring this awesome videos .

  • @mars-cs4uk
    @mars-cs4uk 2 ปีที่แล้ว +7

    Awesome video. It would be nice if we can use the phone to do the same photo to text conversion.

  • @szymonandrzejewski6087
    @szymonandrzejewski6087 11 หลายเดือนก่อน

    Can I put this kind of a program on the web server, get data from webhooks, process them with OpenCV + Tesseract, and then send processed data back to the input server?
    Actually I mean if the tesseract and whole program can work on server environment?

  • @Than_31
    @Than_31 ปีที่แล้ว +4

    how do you make it extract text from what is currently on ur screen? and store it in a variable which changes each time because each time whats stored on ur screen is different

    • @daveg3651
      @daveg3651 ปีที่แล้ว +1

      Screenshot screen, scan screenshot, store to file

  • @danielrichard6894
    @danielrichard6894 9 หลายเดือนก่อน

    Am running tbe code in jupyter Notebook but am having insure installing the different packages like tesseract

  • @MananGandhi
    @MananGandhi 3 ปีที่แล้ว +3

    Your videos are amazing

  • @ArianAlijani
    @ArianAlijani 18 วันที่ผ่านมา

    can you train it for new fonts or images ?

  • @matheus.tecchio
    @matheus.tecchio 10 หลายเดือนก่อน

    Can I use that to read numerical values? I'm creating a project that I can read every day my energy usage with a camera

  • @ryanjcole
    @ryanjcole ปีที่แล้ว

    Hey mate- -- any chance these files might be in a repo somewhere for reference?

  • @ammadkhan4687
    @ammadkhan4687 11 หลายเดือนก่อน

    Hi, do you have any video on Azure Function App using Python as a Scripting Language to run the python script in Azure Function app?

  • @anilsharma32g
    @anilsharma32g ปีที่แล้ว

    Dear Sir, I am your Subscriber
    I want to create a tool that finds text errors in the image.
    For Example:
    I forgot to write CONTACT US, BUY NOW, CONTACT NUMBER, SPELLING MISTAKE, etc... in my social media post.
    that the tool finds error and suggests what are missing or what is incorrect in social media post.
    🙏 Please guide me and suggest what course I need to buy or what I need to learn to create this tool
    Thank you

  • @SD-rg5mj
    @SD-rg5mj ปีที่แล้ว +1

    hello could you advise me an artificial intelligence, I have photos of jewelry and I would like this one to give me the description in SEO and then the generated text goes directly into a Google sheet
    anyway thank you very much for your videos

  • @arek7198
    @arek7198 ปีที่แล้ว

    Thanks, nice example, good work.

  • @hemantchauhan6437
    @hemantchauhan6437 9 หลายเดือนก่อน

    NEED HELP! I am making a website where user can upload a pdf but I want that pdf to upload only if that pdf has images of only HANDWRITTEN text. Thank you for reading.

    • @llmll6447
      @llmll6447 8 หลายเดือนก่อน

      I am working on the same project as u

    • @llmll6447
      @llmll6447 8 หลายเดือนก่อน

      Did u find anything ?

    • @hemantchauhan6437
      @hemantchauhan6437 8 หลายเดือนก่อน

      Nope🥲..Please let me know if you find anything

  • @otiopo7459
    @otiopo7459 3 ปีที่แล้ว +1

    I was looking for this

  • @timahern1017
    @timahern1017 3 หลายเดือนก่อน

    would this work on handwritten text?

  • @2862_AKASHDEEP
    @2862_AKASHDEEP 7 หลายเดือนก่อน

    Please tell me if I use what I learn in this video in any commercial app of my own, is it legal? I am in 1st year of college and need to know these rules

    • @sujanmuppavarapu8281
      @sujanmuppavarapu8281 7 หลายเดือนก่อน

      Why would you think its illegal. Its an open source repo of course its legal. My suggestion learn any tech stack and implement it and deploy

  • @MedoHamdani
    @MedoHamdani 8 หลายเดือนก่อน +2

    Is it possible to export it into an online document automatically, then it get checked by A.I to detect any mistakes or misspelling words, so the user can automatically and quickly fix them up one by one, by jumping from one word to another.
    Would this work on manuscripts such as Arabic language, and is there a way to train the machine to recognise the characters of Arabic including Tashkeel.
    Thanks

  • @kapilbhatia1012
    @kapilbhatia1012 ปีที่แล้ว

    hi there. awesome video. i wanted to ask that if i want particular data only from an image is it possible?

    • @honkersandslug8134
      @honkersandslug8134 ปีที่แล้ว

      did you figure it out

    • @mukilanru
      @mukilanru 5 หลายเดือนก่อน

      ​@@honkersandslug8134 Did you?

  • @viviannjuguna3127
    @viviannjuguna3127 2 ปีที่แล้ว

    wish to test this code on my localhost,where can i find it?

  • @chinagopireddyravuri4007
    @chinagopireddyravuri4007 2 ปีที่แล้ว +1

    I need a help from you. I think you know ICR right (INTELLIGENT CHARACTER RECOGNITION) . We are trying to implement that in deep learning.
    But we don't have segmentation for image.
    Our algorithm is good for normal hand writting but cursive handwriting which is having high storks.
    Please help me out for word segmentation brother,
    Thankyou, hope you will help me.

  • @samuraigaming0
    @samuraigaming0 ปีที่แล้ว

    tesseract thinks 7 is 1 for me, is there anything i can do, its also not so good at detecting dots

  • @gpulogic4ai
    @gpulogic4ai ปีที่แล้ว +1

    OK followed that and used a very CLEAR input image and VERY VERY VERY clear human readable font. Tesseract in all its paramaters totally face plants. It totally totally fails it is so bad!

  • @DrU387
    @DrU387 10 หลายเดือนก่อน

    Can anyone tell me why someone use python or paddle or these things that require coding and does not use a ocr program like pdf element or pdf acrobat or any online ocr , why i use coding thing ???

    • @abhimonarch9596
      @abhimonarch9596 10 หลายเดือนก่อน

      first of all it is free and let u automate task the only reason to use ocr....i have to say after extraction of 1000 images each image data i want to properly arrange in excel too. So python helps a lot working with ocr

    • @DrU387
      @DrU387 10 หลายเดือนก่อน

      @@abhimonarch9596
      I have a lot of scanned images maybe reach 10000 images and the number will inceease i want to convert all of that to microsoft word files with the same original formatting and tables and pictures all that i want to keep it ,can you tell me the best way to do that ?

  • @KhalilYasser
    @KhalilYasser 3 ปีที่แล้ว +2

    Awesome. Thanks a lot. Can you share the code as I am lost at some point?

    • @NeuralNine
      @NeuralNine  3 ปีที่แล้ว +4

      I don't share code for small tutorials. I explain why in my Q&A.

    • @Foxlaren
      @Foxlaren ปีที่แล้ว

      @@NeuralNine can u help me continue this project by exporting to csv file.

  • @shepherdchitedze6169
    @shepherdchitedze6169 2 ปีที่แล้ว

    I am looking for something like this but which scans passport mrz

    • @Black-wj1tu
      @Black-wj1tu 10 หลายเดือนก่อน

      If you found sth send help please I’m stuck :(

  • @eiskaffe0
    @eiskaffe0 2 ปีที่แล้ว

    Thanks. Great video!

  • @donnaesperas3739
    @donnaesperas3739 5 หลายเดือนก่อน

    How about it is in the '.pdf' document but contains image?

    • @NeuralNine
      @NeuralNine  5 หลายเดือนก่อน

      First extract the image(s) from the PDF and then perform the OCR.

  • @varraakhil9869
    @varraakhil9869 ปีที่แล้ว

    Will this work for captcha images? can anyone please answer

  • @JuanDelaCruz-uq1vt
    @JuanDelaCruz-uq1vt 9 หลายเดือนก่อน +8

    Not for everyone. Too complex.

    • @andrefelix9794
      @andrefelix9794 4 หลายเดือนก่อน +8

      if coding eas easy, everyone would do it :)

  • @vivekan97
    @vivekan97 3 ปีที่แล้ว +5

    Nice tutorial. Can you post a Tutorial on how to extract tabular data in OCR Python

    • @kewalmishra7870
      @kewalmishra7870 2 ปีที่แล้ว

      Hi did you find any solution for tabular data extraction using ocr?

    • @abdulwajid6725
      @abdulwajid6725 2 ปีที่แล้ว

      Hi did you find any solution for tabular data extraction using ocr?

    • @stefano8840
      @stefano8840 ปีที่แล้ว

      Hi did you find any solution for tabular data extraction using ocr?

    • @honkersandslug8134
      @honkersandslug8134 ปีที่แล้ว

      plz find one@@stefano8840

    • @ttgboi6734
      @ttgboi6734 ปีที่แล้ว

      Hi did you find any solution for tabular data extraction using ocr?

  • @saivivek5563
    @saivivek5563 3 ปีที่แล้ว

    Bro please post a video that extract tabular data from image and pdf except using tabula, camelot,table -ocr,... All libraries I used none of them are working plzzz..

  • @sulekha3771
    @sulekha3771 2 ปีที่แล้ว

    How would I go about it using a MacBook ?

    • @axscs1178
      @axscs1178 2 ปีที่แล้ว +1

      You can use a jupyter Notebook and use pytesserect, similar to what he does at 02:58

  • @shashidhardevraj
    @shashidhardevraj ปีที่แล้ว

    I am working handwritten text extraction from the image which is non english. I tried using tesseract for extract. but the accuracy is very poor. Could any one please let me know if there is any lib available for extraction?

    • @HamzaSaleem-x2o
      @HamzaSaleem-x2o ปีที่แล้ว +1

      no there is not , I have the same problem , I have tried many packages none of them is advanced enough to detect handwritten text until it is written in very clear handwriting.

  • @Dildo_Schwangerschaft
    @Dildo_Schwangerschaft 3 หลายเดือนก่อน

    Kommst du etwa aus Mannheim mein guter?

  • @poojabhandari631
    @poojabhandari631 ปีที่แล้ว

    i want to convert pdf to excel ....how i can do

    • @rayanibrahimbenatallah7278
      @rayanibrahimbenatallah7278 ปีที่แล้ว

      to a csv file, u need to extract the text first, then save the string in a file, and u can read the file with pandas and give it the separator.

  • @kaleblub
    @kaleblub ปีที่แล้ว

    I am getting into Django development, so I'm a little new to that, but do you think it's possible to run this feature on a Django site? It should be possible right? Basically anything python can do, just on the web..

  • @hamyaff7321
    @hamyaff7321 2 ปีที่แล้ว

    How to extract pixels for a TIFF image and be white and black and they do inside his matrix use Python

  • @Who_____Am_____i
    @Who_____Am_____i ปีที่แล้ว

    how to protect text from ocr scanning ?? 😢😢

    • @cavalfou
      @cavalfou ปีที่แล้ว

      make bad looking text, write it by hand ? do weird letters XD

    • @Who_____Am_____i
      @Who_____Am_____i ปีที่แล้ว

      @@cavalfou No brother its not written by hand.... i am doing legal drafting work.... some people ask me for pdf or JPEG files for drafting chekout...... now days someone know about ocr scanning and so many apps... i dont want to copy my drafting work so.....

  • @bhojpurishorts2109
    @bhojpurishorts2109 ปีที่แล้ว

    How to create txt extractor bot please help ❤❤

  • @CoolBoy_Official
    @CoolBoy_Official ปีที่แล้ว

    21:04

  • @danielniels22
    @danielniels22 3 ปีที่แล้ว

    hello i wanna ask please...

  • @mr.borchick6087
    @mr.borchick6087 วันที่ผ่านมา

    dattaaa 23:46

  • @FrankLena-i8r
    @FrankLena-i8r 3 หลายเดือนก่อน +1

    Elise Landing

  • @anormalzz
    @anormalzz 3 หลายเดือนก่อน

    thank you verymuch

  • @WesleyKerr-n7l
    @WesleyKerr-n7l 3 หลายเดือนก่อน

    Hadley Highway

  • @joswadsouza8362
    @joswadsouza8362 3 ปีที่แล้ว +1

    Second 🔥🔥🔥best project

  • @manlawan
    @manlawan ปีที่แล้ว

    seems like Pipboy is not impossiible

  • @METHUPETERMACHARIA
    @METHUPETERMACHARIA ปีที่แล้ว

    you look a little bit like joel kinnaman

  • @bhavyamverma70
    @bhavyamverma70 3 ปีที่แล้ว +2

    First

  • @philtoa334
    @philtoa334 2 ปีที่แล้ว

    Thx_.

  • @SPOOKIEMOOKe
    @SPOOKIEMOOKe 2 ปีที่แล้ว

    lol not for the amateurs. why are all coders terrible teachers

  • @ahmedgamberli2250
    @ahmedgamberli2250 3 ปีที่แล้ว

    Forth

    • @NeuralNine
      @NeuralNine  3 ปีที่แล้ว +1

      Forth or fourth? 👀

    • @ahmedgamberli2250
      @ahmedgamberli2250 3 ปีที่แล้ว

      @@NeuralNine Was ihre Meinung?

    • @thomasgoodwin2648
      @thomasgoodwin2648 3 ปีที่แล้ว +1

      Forth was an intriguing language imho. I can understand why it ultimately failed, but it still had some cool ideas.

  • @aminachak1498
    @aminachak1498 ปีที่แล้ว

    hey
    i have an image that contains persian numbers. i want to reed those numbers. the ocr didnt helped me so much. but i have an idea. what if i could use "opencv" and give it a set of persian numbers and signs that the image contains then match number digit by digit. numbers are big >99,999.999