How to Preprocess Images for Text OCR in Python (OCR in Python Tutorials 02.02)

Python RAG Tutorial (with Local LLMs): AI For Your PDFs

My 17 Minute AI Workflow To Stand Out At Work

🔴LIVE โหนกระแส บาร์โฮสสะเทือน!!! "สุนิสา" อาละวาดไล่หลอกเงิน

หนูขอไปด้วย #แม่สุซูกัส #ตลก #shorts

ช้างศึกโดนก่อน ไล่ยิงคืนสิงคโปร์ ทะลุน็อคเอาท์

Best Way to OCR a PDF in Python - spaCy Layout

Python Tutorials for Digital Humanities

มุมมอง 2 223

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 9 ก.พ. 2025
In this video, I'm going to show you the best way to OCR a PDF in Python with the new spaCy Layout package. The best part about this package is that it gives you access to all the important metadata generated from a spaCy pipeline alongside layout detection and OCR. This means you will have bounding boxes for the labeled regions of text on a given image. You can also do table detection.
spaCy Layout: github.com/exp...
GitHub Repo: github.com/wjb...
Join this channel to get access to perks:
/ @python-programming
If you enjoy this video, please subscribe.
✅Be my Patron: / wjbmattingly
✅PayPal: www.paypal.com...
If there's a specific video you would like to see or a tutorial series, let me know in the comments and I will try and make it.
If you liked this video, check out www.PythonHumanities.com, where I have Coding Exercises, Lessons, on-site Python shells where you can experiment with code, and a text version of the material discussed here.
You can follow me at:
/ wjb_mattingly

ความคิดเห็น • 16

@msickand 2 ชั่วโมงที่ผ่านมา
Man, this is an amazing video. So helpful, a big THANK YOU. A Table video would also be fantastic, and thanks in advance for that!😉
@VentureMLops 18 วันที่ผ่านมา ⁺²
Interesting. Waiting video with tables!)
@python-programming 17 วันที่ผ่านมา
Thanks! I'll work on that table video in the near future. As for the math formulae, I don't work with those often, but I have seen some promising models, specifically fine-tunes of tf-id
@sheikhakbar2067 27 วันที่ผ่านมา ⁺³
Thanks; I have been looking for such a tool.
@python-programming 27 วันที่ผ่านมา ⁺¹
Glad I could help!
@flyingzeppo 27 วันที่ผ่านมา ⁺²
Very interesting. Thank you.
@python-programming 27 วันที่ผ่านมา
Glad you liked it!
@GuidoAmabili 5 วันที่ผ่านมา
Very interesting, thank you! How would you go about if you had to improve the accuray and train your models to work on specific types of documents ? What are the main steps using these new capabilities ?
@kn8u 4 วันที่ผ่านมา
I'm working on a small academic helper chatbot. Can I use this to prepare my documents which are just scans of textbooks? I'll be using the output in the RAG workflow.
@Osman-dy5br 19 วันที่ผ่านมา ⁺¹
Would this be able to support extracting mathematical formulae?
@python-programming 17 วันที่ผ่านมา ⁺¹
Good question! Formula is one of the labels. There are a lot of quality models that can convert formulae to Latex so even if the OCR is bad, you could use the bboxes and feed that image to a better quality model for formulae
@smazorize 16 วันที่ผ่านมา
I am struggling with trying to extract tilted and vertical texts from PDF documents and embed them back into the pdf document so that it can be searchable, do you have a solution on that? OCRmyPDF library doesnt help, would spacy and CV help with this?
@nitondeauricergeson 11 วันที่ผ่านมา ⁺¹
can you make a table video?
@python-programming 8 วันที่ผ่านมา
I definitely will!
@science_electronique 10 วันที่ผ่านมา ⁺¹
use Gemini OCR with good prompt
@traveling-historian 10 วันที่ผ่านมา
Thanks for the comment! That’s a good suggestion for some usecases, but not all. If bounding boxes and labels are important, then this is better, assuming you have standard typed text. Also, this approach is faster and local. It also handles aligning the output as a spaCy Doc which gives you linguistic analysis too.

ต่อไป

เล่นอัตโนมัติ

How to Preprocess Images for Text OCR in Python (OCR in Python Tutorials 02.02)

How to Preprocess Images for Text OCR in Python (OCR in Python Tutorials 02.02)

Python RAG Tutorial (with Local LLMs): AI For Your PDFs

Python RAG Tutorial (with Local LLMs): AI For Your PDFs

My 17 Minute AI Workflow To Stand Out At Work

My 17 Minute AI Workflow To Stand Out At Work

🔴LIVE โหนกระแส บาร์โฮสสะเทือน!!! "สุนิสา" อาละวาดไล่หลอกเงิน

🔴LIVE โหนกระแส บาร์โฮสสะเทือน!!! "สุนิสา" อาละวาดไล่หลอกเงิน

หนูขอไปด้วย #แม่สุซูกัส #ตลก #shorts

หนูขอไปด้วย #แม่สุซูกัส #ตลก #shorts

ช้างศึกโดนก่อน ไล่ยิงคืนสิงคโปร์ ทะลุน็อคเอาท์

ช้างศึกโดนก่อน ไล่ยิงคืนสิงคโปร์ ทะลุน็อคเอาท์

Highlight : นายใหญ่ฉุนใคร?

Highlight : นายใหญ่ฉุนใคร?

7 Design Patterns EVERY Developer Should Know

7 Design Patterns EVERY Developer Should Know

Pdf Parsing with Scanned Images, Tables, Text with Docling, Claude 3.5, GPT 4, Llama 3.2

Pdf Parsing with Scanned Images, Tables, Text with Docling, Claude 3.5, GPT 4, Llama 3.2

15 POWERFUL Python Libraries You Should Be Using

15 POWERFUL Python Libraries You Should Be Using

Modern Python logging

Modern Python logging

Best Way to Build Network Analysis App in Python with Streamlit and st-link-analysis - Easy Tutorial

Best Way to Build Network Analysis App in Python with Streamlit and st-link-analysis - Easy Tutorial

How I animate 3Blue1Brown | A Manim demo with Ben Sparks

How I animate 3Blue1Brown | A Manim demo with Ben Sparks

I replaced a $20,000 server with this

I replaced a $20,000 server with this

The 8 AI Skills That Will Separate Winners From Losers in 2025

The 8 AI Skills That Will Separate Winners From Losers in 2025

Turn ANY Website into LLM Knowledge in SECONDS

Turn ANY Website into LLM Knowledge in SECONDS

ตรวจหวยงวดวันที่ 16 ธันวาคม 2567 พร้อมรางวัล N3 รางวัลพิเศษ รางวัล 2 ตัว : Matichon Online

ตรวจหวยงวดวันที่ 16 ธันวาคม 2567 พร้อมรางวัล N3 รางวัลพิเศษ รางวัล 2 ตัว : Matichon Online

Live! ถ่ายทอดสดหวย ถ่ายทอดสดการออกรางวัลสลากกินแบ่งรัฐบาล งวดวันที่ 16 ธันวาคม 2567

Live! ถ่ายทอดสดหวย ถ่ายทอดสดการออกรางวัลสลากกินแบ่งรัฐบาล งวดวันที่ 16 ธันวาคม 2567

แหกหน้าพ่อค้าจีน 2 #hagatestudio #fun #funny #พากย์นรก

แหกหน้าพ่อค้าจีน 2 #hagatestudio #fun #funny #พากย์นรก

#นายกแพทองธาร ลงพื้นที่มอบถุงยังชีพ บริเวณ ซ.พัฒนาการคูขวาง ๑๐ (ถ.ท่าโพธิ์) จ.นครศรีธรรมราช

#นายกแพทองธาร ลงพื้นที่มอบถุงยังชีพ บริเวณ ซ.พัฒนาการคูขวาง ๑๐ (ถ.ท่าโพธิ์) จ.นครศรีธรรมราช

Nec Red Rockets Kawasaki vs. LP Bank Ninh Binh - Pool B | Highlights | Club World Champs 2024

Nec Red Rockets Kawasaki vs. LP Bank Ninh Binh - Pool B | Highlights | Club World Champs 2024

บังอาจ ทาบบารมี ! ผ่าเบื้องลึก 1 วันก่อนสังหาร เดินเกมล้มตระกูล “วิลาวัลย์” #ถกไม่เถียง

บังอาจ ทาบบารมี ! ผ่าเบื้องลึก 1 วันก่อนสังหาร เดินเกมล้มตระกูล “วิลาวัลย์” #ถกไม่เถียง

หนูกับเต้ รัก ”พี่อู๋จูน“ นะ

หนูกับเต้ รัก ”พี่อู๋จูน“ นะ

LIVE🔴 : Singapore vs Thailand | ASEAN Championship 2024 | 17.12.24

LIVE🔴 : Singapore vs Thailand | ASEAN Championship 2024 | 17.12.24