@@nathantonning Nice of you to say so -thanks again. Q: have you used the program in a batch process. I've got very many (100's) archived pdfs on my Synology NAS That are not searchable that I'd like to be searchable. The process looks daunting. Doing individual pdfs is doable but would take a year of months, many hours to complete. Any pointers? Thanks again. cheers ATB
@@TheClembo I'm sorry! I didn't see your comment until just now! YT notifications can be kinda dumb. This answer is probably late, but you might be able to use a bash script with a loop that allows you perform the operation. All you would need to do is to count your files, store that value to a variable, create the loop and call the ocr process on each file.
yo this may be the dumbest question you've ever heard, but how do I run it? like I have python (and all the pre-reqs for the package) installed, and I'm opening a powershell window where my pdf is. but, It's always throwing me a command not recognized error, I think maybe I'm missing something about how python packages are even run and installed? Any help is appreciated!
I haven't attempted to use the program in windows, but my recommendation would be to use WSL2, if you are able to set up that environment, and use the linux version of ocrmypdf. I have some videos that show how to get started with WSL, and there are many fine tutorials out there as well. Hope that's helpful!
So I got the same problem, also for the same reason. If you installed ocrmypdf with -pip in Anaconda's Python you need to use Anaconda Prompt to run ocrmypdf. Open Anaconda Prompt and go where the pdf file is with "cd ", then you'll be able to run commands as "ocrmypdf -l por hernan_sampieri.pdf hernan_sampieri_ocr.pdf".
thanks a lot, very helpful content
just what i was looking for -thank you.
You are welcome!
@@nathantonning Nice of you to say so -thanks again. Q: have you used the program in a batch process. I've got very many (100's) archived pdfs on my Synology NAS That are not searchable that I'd like to be searchable. The process looks daunting. Doing individual pdfs is doable but would take a year of months, many hours to complete. Any pointers? Thanks again. cheers ATB
@@TheClembo I'm sorry! I didn't see your comment until just now! YT notifications can be kinda dumb. This answer is probably late, but you might be able to use a bash script with a loop that allows you perform the operation. All you would need to do is to count your files, store that value to a variable, create the loop and call the ocr process on each file.
Very nice. Thank you.
yo this may be the dumbest question you've ever heard, but how do I run it? like I have python (and all the pre-reqs for the package) installed, and I'm opening a powershell window where my pdf is. but, It's always throwing me a command not recognized error, I think maybe I'm missing something about how python packages are even run and installed? Any help is appreciated!
I haven't attempted to use the program in windows, but my recommendation would be to use WSL2, if you are able to set up that environment, and use the linux version of ocrmypdf. I have some videos that show how to get started with WSL, and there are many fine tutorials out there as well. Hope that's helpful!
So I got the same problem, also for the same reason. If you installed ocrmypdf with -pip in Anaconda's Python you need to use Anaconda Prompt to run ocrmypdf. Open Anaconda Prompt and go where the pdf file is with "cd ", then you'll be able to run commands as "ocrmypdf -l por hernan_sampieri.pdf hernan_sampieri_ocr.pdf".
can you use this on a 550 page tech manual that has graphs and pictures
in my case the ocr is useless and very poor for languages other then english i use it on windows but ocrmypdf doesnt work for me
Which ocr program do you use for non-english text sources?