How to Preprocess Images for Text OCR in Python (OCR in Python Tutorials 02.02)

แชร์
ฝัง
  • เผยแพร่เมื่อ 6 เม.ย. 2021
  • If you enjoy this video, please subscribe.
    ✅Be my Patron: / wjbmattingly
    ✅PayPal: www.paypal.com/cgi-bin/webscr...
    How to Open an Image with OpenCV: 4:10
    01. Invert an Image: 9:47
    03: Binarization: 13:33
    04: Noise Reduction: 20:40
    05: Dilation and Erosion: 28:33
    06: Rotation and Deskewing: 35:07
    07: Removing Borders: 42:18
    08: Missing Borders: 49:09
    GitHub Notebook: github.com/wjbmattingly/ocr_p...
    If there's a specific video you would like to see or a tutorial series, let me know in the comments and I will try and make it.
    If you liked this video, check out www.PythonHumanities.com, where I have Coding Exercises, Lessons, on-site Python shells where you can experiment with code, and a text version of the material discussed here.
    You can follow me at:
    / wjb_mattingly

ความคิดเห็น • 162

  • @ds5015
    @ds5015 7 หลายเดือนก่อน +2

    Hey thank you so much! The video is about an hour long, so I’m really happy that your voice is so pleasant and happy to listen to! This video helped me a lot on my project.

  • @marcerum6963
    @marcerum6963 ปีที่แล้ว +1

    What a legend only one ad in the beginning . Your so damn underrated

  • @saifabusrour
    @saifabusrour ปีที่แล้ว +1

    I appreciate your step-by-step explanations

  • @balkiprasanna1984
    @balkiprasanna1984 ปีที่แล้ว +2

    Amazing, I am going to recommend to my colleagues. Great work.

  • @jadenizard9955
    @jadenizard9955 ปีที่แล้ว +7

    This is awesome ! Really helpful for my dissertation research about OCR in the accountability field. Many thanks from Paris !

    • @python-programming
      @python-programming  ปีที่แล้ว

      So happy to hear that! No problem!

    • @rishis11
      @rishis11 2 หลายเดือนก่อน

      why are we working
      on jupyter all of a sudden?

  • @eduardmoldoveanu4245
    @eduardmoldoveanu4245 ปีที่แล้ว

    Thank you, I've been looking for a long time

  • @sriyaniliyanage6495
    @sriyaniliyanage6495 ปีที่แล้ว +1

    Thank you so much. I just upgraded from Making Waves, so the concepts are familiar, but tNice tutorials is a whole new world! Look forward to

  • @Superdooperhero
    @Superdooperhero 3 ปีที่แล้ว +14

    Doing all the steps finally got me to 98% accuracy. Thanks so much!

  • @FernandoFischer6048
    @FernandoFischer6048 ปีที่แล้ว

    Men you are an amazing teacher! I love the way you explain things! thank you BTW!

  • @sabririhab9383
    @sabririhab9383 3 ปีที่แล้ว +27

    i can't believe i found this amazing channel !!! I've been working on an OCR mobile app to extract fields from scanned invoices and these videos are exactly what I m looking for million thanks man !!!

    • @python-programming
      @python-programming  3 ปีที่แล้ว +4

      Awesome! Glad you are finding them useful! I have 10 more videos planned for this series. I am getting over a cold and have not been able to post for a while.

    • @sabririhab9383
      @sabririhab9383 3 ปีที่แล้ว +1

      @@python-programming i hope you get well soon and i m looking forward to these videos

    • @petianikolova3466
      @petianikolova3466 2 ปีที่แล้ว

      same!

    • @cuneytsn
      @cuneytsn ปีที่แล้ว

      did you finish your work ?

    • @sabririhab9383
      @sabririhab9383 ปีที่แล้ว +1

      @@cuneytsn yes I did

  • @sinantk4036
    @sinantk4036 ปีที่แล้ว

    found actually helpful for people started from scratch

  • @michaelangelomarcos1824
    @michaelangelomarcos1824 ปีที่แล้ว

    Thank you man for sharing this stuff

  • @Superdooperhero
    @Superdooperhero 3 ปีที่แล้ว +2

    Inverting took me from garbage to great recognition. Thanks!

    • @python-programming
      @python-programming  3 ปีที่แล้ว

      Awesome! Glad these are useful and working. I was not sure if I explained it well enough.

  • @ibrahimbourkiba9935
    @ibrahimbourkiba9935 2 ปีที่แล้ว

    you really helped me, thank you

  • @vivekan97
    @vivekan97 2 ปีที่แล้ว +1

    Great content and clear explanation

  • @user-ys7st8kk5e
    @user-ys7st8kk5e หลายเดือนก่อน

    you saved my life....thanks

  • @dotzoidcom5322
    @dotzoidcom5322 ปีที่แล้ว

    yo bro, really thankya. Big respect

  • @shiyason213
    @shiyason213 2 ปีที่แล้ว +1

    just amazing, thank you!

  • @minghui9525
    @minghui9525 ปีที่แล้ว

    fire video, thanks bro

  • @tuphamngoanh5515
    @tuphamngoanh5515 2 ปีที่แล้ว +2

    Thank you for the very helpful tutorial. I am wondering if there is any difference between from preprocessing SCENE-TEXT dataset from OCR dataset? In my POV, BW function will not be needed in SCENE-TEXT since there is various color in an image. Thanks again.

  • @marco_graciano
    @marco_graciano 2 ปีที่แล้ว

    Awesome!

  • @sager56
    @sager56 6 หลายเดือนก่อน +1

    Thank you! This will help me save my work :>

  • @netsereabhadis4351
    @netsereabhadis4351 7 หลายเดือนก่อน +1

    good job man!

  • @musinguzibenard2670
    @musinguzibenard2670 8 หลายเดือนก่อน +1

    Great thanks from Uganda the pearl of Africa

  • @cerioscha
    @cerioscha ปีที่แล้ว

    Great videos thanks for sharing !. Would you be able to tackle the problem of removing an image watermark in scanned pdfs please?

  • @user-zv7yn9ly5b
    @user-zv7yn9ly5b ปีที่แล้ว +1

    Hi!, First thank for the tutorial. In my case I had a problem while correcting the rotated image with the function. In the link it gives you a really cool solution, if anyone had the problem just comment the line of the angle and put this lines instead and it will work perfectly.
    middleContour = contours[len(contours) // 2]
    angle = cv2.minAreaRect(middleContour)[-1]
    As I said it comes in the link given but this is in the case you're in a rush.
    Thanks for the channel and the videos for real. I would have liked to have teachers like you while I was studying!!!!

    • @sh00t01
      @sh00t01 4 หลายเดือนก่อน

      Thanks for your input. I tried every option but it'll always get -90º to -30º (aprox) rotated. Cannot find where the problem is. I'm trying with a picture of my own, like the ones I will process.

  • @venes3713
    @venes3713 ปีที่แล้ว

    Thank you for this awesome tutorial! 1 questions though. Can we just manually crop the image instead of removing the borders?

  • @Rahul_Singh_Rajput_04
    @Rahul_Singh_Rajput_04 2 ปีที่แล้ว +1

    Thankyou so much for this Video . This is an Great Video.

  • @RATANAGARWALITINFORMER
    @RATANAGARWALITINFORMER ปีที่แล้ว

    Wow good

  • @manchikatlasravan8264
    @manchikatlasravan8264 3 ปีที่แล้ว +1

    can you plz share display method which you found in stack overflow

  • @nemka9119
    @nemka9119 10 หลายเดือนก่อน

    Hi, what if I have a region of interest that is about 150x50 which has a single number let's say 1235 that I want to ocr, and that number takes a large portion of the roi image, like 60% of the height and 50% of the width, the image itself is a frame from a video capture so its good quality, what preprocessing do you think should be done? I am using tesseract for the ocr. Thanks!

  • @user-dt4nl7po7e
    @user-dt4nl7po7e ปีที่แล้ว +2

    Hi! Thanks for the amazing explanation, it has helped me alot! I am currently working on optimizing the Tesseract engine for my specific needs. However, I am having some problems with rotating documents. For some reason, removing the borders is not working. Since the document is rotated, so are the borders (so I am seeing black triangle-shaped borders). Furthermore, if the document is rotated counter-clockwise just a little bit, it will correct it by rotating the image to 90 degrees counter-clockwise, rather then to the closest 90 degree angle.
    Any ideas for the problems I am having?

    • @sympathique7512
      @sympathique7512 7 หลายเดือนก่อน

      How did you success to rotate the image ? It rotate every time the image to 90, i don't know how to fix that !

  • @JasonVerro
    @JasonVerro 23 วันที่ผ่านมา

    Thanks for the video. I was having issues with the "getSkewAngle" function.
    I found an easy workaround though.
    I changed the last line
    "return -1.0 * angle"
    to just
    "return angle"
    Hope this helps anyone else with this problem.

  • @vandhyashreehs9435
    @vandhyashreehs9435 ปีที่แล้ว

    Thank you
    Sir, Is there any way to separate handwritten text and printed text in documents?

  • @lucyledezma709
    @lucyledezma709 3 ปีที่แล้ว

    Hello, can you help me please?
    how to detect inverted text or letters in an image? and after rotate the image the normal state?, thanks

  • @rohinimaidamwar6787
    @rohinimaidamwar6787 ปีที่แล้ว

    Great Tutorial. How to deskew an image when the page is rotated upside down? Or atleast how to get an angle of rotation.?
    If anyone can help, please suggest me some method/ approach.

  • @meleseayichlie5645
    @meleseayichlie5645 ปีที่แล้ว

    Thank you for your valuable tutorial, but i have a question, how to save extracted text and corresponding image name in excel.

  • @souravthakur6222
    @souravthakur6222 2 ปีที่แล้ว

    How to extract different kind of text such as hand written, tabular, free text from different scanned docs and pdf's and images using ML OCR and its pre-processing techniques ?

  • @123mat1231
    @123mat1231 3 หลายเดือนก่อน

    For anyonne getting a error message when importing pylot. heres a few solutions:
    1. Instalation problem. error in python-slugify setup command: use_2to3 - solution: update your setup tools in the terminal with the comand "setuptools==58". Then try "pip install pylot"
    2. Error in import - solution: change comand to "import matplotlib.pyplot as plt"

  • @prasannaimmaneni6888
    @prasannaimmaneni6888 3 ปีที่แล้ว +3

    I'm unable to acces the notebook. Can you please share the notebook.

  • @user-xe7ww4py7u
    @user-xe7ww4py7u หลายเดือนก่อน

    Are the functions you built only suitable for this specific document? Or will they be able to handle new input documents?

  • @nossonweissman
    @nossonweissman ปีที่แล้ว

    Great video. But I need to point out that `/` is actually a forward slash

  • @theorager15
    @theorager15 3 ปีที่แล้ว +5

    Hey, any idea why I got "ValueError: not enough values to unpack (expected 3, got 2)" at 15:43. I'm using Python3.8.2 and open-cv 4.5.1.
    Also, please check patron messages.
    Thanks!

    • @python-programming
      @python-programming  3 ปีที่แล้ว +13

      Good question. I don't think I explained this in the video. The problem is likely from the display function with the height, width, (sometimes depth). Make sure that im_data.shape[:2] is in your display function. I updated this for GitHub so this error wouldn't pop up, but I don't think I mentioned it in the video. Here is the corrected display function:
      def display(im_path):
      dpi = 80
      im_data = plt.imread(im_path)
      height, width = im_data.shape[:2]
      # What size does the figure need to be in inches to fit the image?
      figsize = width / float(dpi), height / float(dpi)
      # Create a figure of the right size with one axes that takes up the full figure
      fig = plt.figure(figsize=figsize)
      ax = fig.add_axes([0, 0, 1, 1])
      # Hide spines, ticks, etc.
      ax.axis('off')
      # Display the image.
      ax.imshow(im_data, cmap='gray')
      plt.show()

    • @theorager15
      @theorager15 3 ปีที่แล้ว +2

      ​@@python-programming Thank you for your response, nonetheless, I already found and fix it from SO. The point was that I didn't get the idea.
      Here is my current thought about it, the reason behind it is the code trying to unpack more values from an object than those that actually exist. So either we have to remove some requests or return the 'requested' value (before actually request it) to the program and then request the results.
      I know that this is a bit more detailed and out of the topic to appropriately fit on your video comment section but, a clarified answer would be greatly appreciated.
      Learn without gaps is better in the long run, don't you agree?

    • @python-programming
      @python-programming  3 ปีที่แล้ว +2

      Precisely the problem! It was a good catch and I'm glad this comment is here. No worries about it being out of topic. I think it is very much on topic.

  • @azmainatefsamy9566
    @azmainatefsamy9566 ปีที่แล้ว

    Ok nice

  • @Superdooperhero
    @Superdooperhero 3 ปีที่แล้ว +6

    from numpy import ones, uint8
    kernel = ones((1, 1), uint8)
    is a lot faster than importing the entire numpy module

  • @hamidsafiullahawan3433
    @hamidsafiullahawan3433 ปีที่แล้ว +1

    Thank you very much, Sir. it helped me a a lot in my OCR project

  • @hemantchauhan6437
    @hemantchauhan6437 5 หลายเดือนก่อน

    Hello sir I am working on a project where I need user to upload a pdf if it has only handwritten text in the images of pdf and not the computer typed text. Is there any python library which can help me in this?

  • @ram_qr
    @ram_qr 5 หลายเดือนก่อน

    (14:49) Why we use function grayscale, when we can simply use built-in function cvtColor?

  • @akzork
    @akzork ปีที่แล้ว +1

    Rescaling code is not filled in the final notebook on GitHub.

  • @daves4026
    @daves4026 ปีที่แล้ว +1

    Wow awesome video, totally subscribing 🙂, I would like a transparency example, as we have invoices scanning from thin paper with duplex pages bleeding through the image. I'm assuming the transparency will help?

    • @python-programming
      @python-programming  ปีที่แล้ว

      Thanks! Can you DM me on Twitter with some example images?

    • @daves4026
      @daves4026 ปีที่แล้ว

      @@python-programming hi sorry I don’t do twitter or social media. But simply put sometimes a scanner will produce an image which has some ghosting of the other side coming through

    • @daves4026
      @daves4026 ปีที่แล้ว

      I am working through the video stack and trying to comprehend it all. Trying to develop an auto scan ocr solution of sorts. Really helpful what you and others have shared

  • @sanjeebsarkar6484
    @sanjeebsarkar6484 2 ปีที่แล้ว +1

    Hi , Thank you so much for such a detailed explanation and making a series out of this ,
    I have one question,
    Can you please tell me or refer a link as to how to get a threshold of image, like get the threshold of an image, so that we can use it as a reference for other images?

    • @python-programming
      @python-programming  2 ปีที่แล้ว

      Hi! Interesting question. If I understand you right, that is not possible as far as I know. The threshold is a change to the image based on a certain pixle number. Are instead thinking of trying to get an ubderstanding of the average range of pixles in an image?

    • @sanjeebsarkar6484
      @sanjeebsarkar6484 2 ปีที่แล้ว

      @@python-programming uhh yes something like that, so in here , instead of setting the threshold on an image and correcting it via trial and error, I was thinking if it is possible to get a threshold of a perfect image and apply those values to a problamatic image. Actually I was trying ur tutorial, and you have mentioned that , if we try to use dilation and erotion on a perfectly fine image, it will ruin it. and thats what was happening. So I got into thinking if there is any way to get a threshold value of an image.

  • @osumanaaa9982
    @osumanaaa9982 2 ปีที่แล้ว +2

    Thanks a lot !! That's exactly the kind of material I was looking for. I have a question about deskewing though. Although it straightens the document well, there are cases where it completely flips it 90 degrees (the original skew was quite small actually), so I had to set a limit of 35/-35 in the angle to ignore what's outside the range. However, what bothers me more is that the OCR results get worse after applying the deskew. I mean, the results before are quite good, but if I apply it, it fails to detect many words that I can detect without applying it and even detects another language not existing in the document (I used 2 languages because in some documents, they both appear). Again, these two issues don't happen when I don't deskew

    • @python-programming
      @python-programming  2 ปีที่แล้ว +1

      Thanks for the commen! Glad you liked the video. That is odd indeed! I am not sure why that would be the case

    • @coolcovers2397
      @coolcovers2397 2 ปีที่แล้ว

      the reason this problem is happening is that the rotation function does the same things we already did such as turning the pic to gray, dialation etc.. so if u do it again the quality is going to change not in your favor this time which will make rotating the pic impossible.
      so you simply need to start by rotating the pic THEN move on to the next steps.

  • @51_prathamniphadkar96
    @51_prathamniphadkar96 11 หลายเดือนก่อน

    Hello!!
    So when I try to convert the image and when I pass on this command
    inverted_image = cv2.bitwise_not(img)
    cv2.imwrite('PYTESSERACT/inverted.png', inverted_image)
    Its output is showing me "False"
    Please tell me how do I fix this.

  • @ethanyoung8971
    @ethanyoung8971 ปีที่แล้ว +1

    What happened to using PIL to open an image..?

  • @hritiksth764
    @hritiksth764 2 ปีที่แล้ว

    what is kernel in the deskewing function ?

  • @jashcontact7476
    @jashcontact7476 2 ปีที่แล้ว

    your display function is not working in greyscale image bcz function need three argument "width" "height" "depth" , but grey scale give only 2 parameter .

  • @soted
    @soted ปีที่แล้ว

    at 41:15 line 395 i get an error saying : name 'new' is not defined ( im working with pycharm)

  • @vincentmanlesis5354
    @vincentmanlesis5354 2 ปีที่แล้ว

    I tried the code for rotation and deskewing but it doesn't change anything. I used raspberry pi 4 to try it. Is it possible that it cannot perform in raspberry pi? Please reply..

  • @vaibhavpawar5807
    @vaibhavpawar5807 3 ปีที่แล้ว +1

    Can you suggest me an algorithm or any library through which we can separate text from image which is having text in different languages. I am having a image which is having text in both German and English language. I want to separate the text for text summarization. Any help will be appreciated.

    • @python-programming
      @python-programming  3 ปีที่แล้ว +1

      OpenCV. I have 5 videos coming out on how to do this statting next week. You will want to use bounding boxes.

    • @nagarajumuthyala5798
      @nagarajumuthyala5798 3 ปีที่แล้ว

      Lot of modules r here u can use Fitz,pypdf2 , pdfplumber complot ,pytesseratte

  • @tieman3790
    @tieman3790 2 ปีที่แล้ว +2

    Good video. not completely relevant for my case as i dont know what the image will look like, but still usefull!

  • @nagarajumuthyala5798
    @nagarajumuthyala5798 3 ปีที่แล้ว

    Please please make the video on how to detect the table in image ?

  • @nagarajumuthyala5798
    @nagarajumuthyala5798 3 ปีที่แล้ว +2

    Please make the video on how to detect the table and how to Calculate the accuracy of the image

  • @talhaabdulqayyum193
    @talhaabdulqayyum193 3 ปีที่แล้ว +2

    which theme ur using for jupyter, please share the method so it doesnt effect intellisense as well

    • @python-programming
      @python-programming  3 ปีที่แล้ว +1

      I'm using the standard dark theme in JupyterLab

  • @websoftwaredeveloperijtiha3093
    @websoftwaredeveloperijtiha3093 3 หลายเดือนก่อน

    Which IDE you have used in vedio

  • @Fakkboi
    @Fakkboi 2 ปีที่แล้ว +1

    Did you make a video for rescaling? :)

  • @jeevajanu1708
    @jeevajanu1708 2 ปีที่แล้ว +1

    Could you please tell me where do you write this code , I mean is it in python or IDLE?

  • @pcb5135
    @pcb5135 11 หลายเดือนก่อน

    wait we dont need to invert image on tesseract 4.0?

  • @justBeOrDontB7568
    @justBeOrDontB7568 2 ปีที่แล้ว

    I am learning tons of stuff from this OCR series! It has helped me massively in developing my own project! But the display (im_path) function you are using to print every image is never working for me; I keep getting the same error ---> "ValueError: not enough values to unpack (expected 3, got 2)". Can you please help me with this?

    • @taylordunn5672
      @taylordunn5672 2 ปีที่แล้ว +2

      Not sure if you have solved this or not, but this is how I got around that issue. Please comment if there is a better way. When converted to grayscale you lose the depth dimension. I put an if statement in the function to specify the number of dimensions to be expected. I hope this helps
      # 3 dimensions for RGB, 2 dimensions for gray scale
      def display(im_path,dimensions):

      if dimensions == 3:
      dpi = mpl.rcParams['figure.dpi']
      im_data = plt.imread(im_path)

      elif dimensions == 2:
      dpi = mpl.rcParams['figure.dpi']
      im_data = plt.imread(im_path)

    • @Booza1981
      @Booza1981 ปีที่แล้ว

      Wierd - i agree with you - but if the shape is only 2 dimensional for grayscale that the error didn't come up in the video.

  • @FreeTrial93
    @FreeTrial93 5 หลายเดือนก่อน +1

    First I'd like to thank you for the tutorial. I have a question. In the 28th min you are talking about pros and cons of noise removal. And you said that for this particular image you wouldn't use it for stated reasons. So what if you have a thousand images you want to process with maximum rate of success. How would you then implement noise removal to those images that actually benefit from it and skip it for those that don't and would in case of using it result in worse performance?

    • @python-programming
      @python-programming  5 หลายเดือนก่อน +1

      Thanks for the comment! I'm glad you liked the video! There are a few solutions here, perhaps a small image classifier to detect those pages that have a lot of noise and flag them for manual adjustments. It likely wouldn't take many examples to train a binary model to do this. Another approach could be with Open-CV but it may take longer for some data to get that up and running effectively. It really comes down to the images you are using, though.

  • @zealotbloodlust6089
    @zealotbloodlust6089 3 ปีที่แล้ว +1

    Hi
    I just copied the same exact code that you have used for deskewing and also used the image that you used, but it doesn't work correctly, could you guide me through this?
    Thank you

    • @python-programming
      @python-programming  3 ปีที่แล้ว +1

      Whats the error?

    • @zealotbloodlust6089
      @zealotbloodlust6089 3 ปีที่แล้ว +2

      @@python-programming
      I had imported the rotated image with borders accidentally.
      my bad
      thank you for sharing your knowledge. I am learning a ton of new things

    • @python-programming
      @python-programming  3 ปีที่แล้ว +2

      No worries at all! Glad you figured it out and I am glad you are getting a lot from this channel!

  • @prithatasmin9966
    @prithatasmin9966 2 ปีที่แล้ว

    how can i detect check box yes and no answer ?

  • @octoplay_movies8847
    @octoplay_movies8847 2 ปีที่แล้ว

    which IDE is used in this

  • @tiberiugeorgescu4459
    @tiberiugeorgescu4459 ปีที่แล้ว

    Border removal does not work with white borders. Anyone has any idea why?

  • @mariaoviedo6348
    @mariaoviedo6348 ปีที่แล้ว

    The only tNice tutorialng I learnt myself in soft soft is pressing tab in the keyboard to bring up the channel rack

  • @naitiktalati7330
    @naitiktalati7330 2 ปีที่แล้ว

    not able to access the notebook!

  • @suphawatwong9438
    @suphawatwong9438 2 ปีที่แล้ว

    bro did you forget to fill the rescaling the img in Github? Anyway Thankyou

  • @Rachel-uh9lh
    @Rachel-uh9lh 2 ปีที่แล้ว +1

    Hi Sir could you do a session on Houghline transformation

  • @pacengan7843
    @pacengan7843 ปีที่แล้ว

    What name application editor ? tq...

  • @akashhprabhu3763
    @akashhprabhu3763 2 ปีที่แล้ว

    i need ocr detection of kannada language...;how can i co that?

  • @sheebavinod7204
    @sheebavinod7204 4 หลายเดือนก่อน

    Will it work with scanned pdf?

  • @fatimahehab7986
    @fatimahehab7986 10 หลายเดือนก่อน

    I get this error when trying to get the gray version of the image at 18:12
    Cell In[4], line 4, in display(img_path)
    2 dpi = 80
    3 img_data = plt.imread(img_path)
    ----> 4 height, width, depth = img_data.shape
    6 img_size = width / float(dpi), height/ float(dpi)
    8 fig = plt.figure(figsize=img_size)
    ValueError: not enough values to unpack (expected 3, got 2)

    • @remicornut9643
      @remicornut9643 8 หลายเดือนก่อน

      copy and paste the fonction call it "display2" for exemple and remove the "depth" then use the display2 when display dosen't work

  • @user-xu9qc7em5c
    @user-xu9qc7em5c ปีที่แล้ว

    I get the following error, can anyone suggest a solution ? :
    error: OpenCV(4.5.4) :-1: error: (-5:Bad argument) in function 'dilate'
    > Overload resolution failed:
    > - src is not a numpy array, neither a scalar
    > - Expected Ptr for argument 'src'

  • @aeeweb6464
    @aeeweb6464 2 ปีที่แล้ว

    Amazing video.
    I wanna skew code plz....................

  • @MijanurRahman-jo1st
    @MijanurRahman-jo1st 11 หลายเดือนก่อน +1

    Can i use it for handwritten text recognition? Please?

    • @python-programming
      @python-programming  11 หลายเดือนก่อน

      HTR is a different problem. Much more custom solutions are needed. Transkribus or OCR4All (open source) are both good options to consider.

    • @MijanurRahman-jo1st
      @MijanurRahman-jo1st 11 หลายเดือนก่อน

      @@python-programming Thanks

  • @sachinramesh5071
    @sachinramesh5071 ปีที่แล้ว

    Broooooo!

  • @modeltrainer1246
    @modeltrainer1246 2 ปีที่แล้ว +3

    HOW TO RETRAIN OCR? I WANT TO DETECT LIVE LICENCE PLATE NUMBER

    • @python-programming
      @python-programming  2 ปีที่แล้ว

      That would be object detection first then OCR

    • @modeltrainer1246
      @modeltrainer1246 2 ปีที่แล้ว +1

      @@python-programming i already did the object detection part. trained the license plate in yolo v4. i cropped the license plate from images and having a hard time using easy OCR to get the text from plate.

    • @python-programming
      @python-programming  2 ปีที่แล้ว

      @@modeltrainer1246 excellent. Okay mind dming me on Twitter with some sample images?

    • @shubhamnayak9682
      @shubhamnayak9682 10 หลายเดือนก่อน

      Hi bro ,Have you got to know ? how can i retrain my model to implement ocr object detection was done

  • @anilsharma32g
    @anilsharma32g 8 หลายเดือนก่อน

    Dear Sir, I am your Subscriber
    I want to create a tool that finds text errors in the image.
    For Example:
    if I forgot to write CONTACT US, BUY NOW, CONTACT NUMBER, SPELLING MISTAKE, etc... in my social media post.
    that the tool finds error and suggests what are missing or what is incorrect in social media post.
    🙏 Please guide me and suggest what course I need to buy or what I need to learn to create this tool
    Thank you!

  • @trishamaemendoza1819
    @trishamaemendoza1819 ปีที่แล้ว

    not wNice tutorialle quarantine but how r u doing is that hard ?

  • @leutrimiTBA
    @leutrimiTBA 22 วันที่ผ่านมา

    rescaling is not added

  • @niroshansilva8700
    @niroshansilva8700 ปีที่แล้ว

    misconceptions that the comnt is supposed to soft like I am in love with Nice tutorialm or sotNice tutorialng.

  • @32-sangnguyen11
    @32-sangnguyen11 ปีที่แล้ว

    I actually tried to play around on my own before watcNice tutorialng tNice tutorials and finally knowing what many of the buttons I randomly clicked on an is

  • @user-cg1ug1hb4c
    @user-cg1ug1hb4c ปีที่แล้ว

    display('photos/gray.jpg')
    ValueError Traceback (most recent call last)
    Cell In[83], line 1
    ----> 1 display1('photos/gray.jpg')
    Cell In[65], line 4, in display1(im_path)
    2 dpi = 80
    3 im_data = plt.imread(im_path)
    ----> 4 height, width, depth = im_data.shape
    6 figsize = width / float(dpi), height / float(dpi)
    8 fig = plt.figure(figsize=figsize)
    ValueError: not enough values to unpack (expected 3, got 2)

  • @Champe19
    @Champe19 7 หลายเดือนก่อน

    Jupyter lab is saying “No module named matplotlib”

  • @toiatikhawla9278
    @toiatikhawla9278 ปีที่แล้ว

    It took 1,5 hours to finish watcNice tutorialng tNice tutorials 18 MINUTES video wNice tutorialle doing all the sa steps on soft soft myself. My brain is fried and

  • @CryptixYT
    @CryptixYT ปีที่แล้ว

    guilty, I feel like being honest here is going to be the most aningful.

  • @ahil6958
    @ahil6958 ปีที่แล้ว

    Not Working for Windows...I am stuffed up with this...!

  • @sucacxuyendem
    @sucacxuyendem ปีที่แล้ว

    too I made like s on garage band and thought it be easier in softsoft. nope

  • @subtleintuitions6524
    @subtleintuitions6524 2 ปีที่แล้ว +1

    need to have pdf scanned into text

  • @WABCodeLab
    @WABCodeLab ปีที่แล้ว

    Tutorial*

  • @foreswanbe403
    @foreswanbe403 ปีที่แล้ว

    Nasty charge never said it did but ok