LlamaOCR - Building your Own Private OCR System

EASIEST Way to Fine-Tune a LLM and Use It With Ollama

LlamaParse: Convert PDF (with tables) to Markdown

Nec Red Rockets Kawasaki vs. LP Bank Ninh Binh - Pool B | Highlights | Club World Champs 2024

หัวหน้าแก๊งพาลูกสาวไปกินไก่ทอด เจอกลุ่มนักเลงหาเรื่อง เลยจัดการพวกนั้นจนพ่ายแพ้

LIVE🔴 : Singapore vs Thailand | ASEAN Championship 2024 | 17.12.24

How to Train Tesseract OCR Engine 5 on Custom Data

SL7 Tech

มุมมอง 5 358

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 22 ม.ค. 2025

ความคิดเห็น • 20

@SL7Tech 3 หลายเดือนก่อน ⁺¹
Important: The name of your image and ground truth file must match without the extension while preparing the dataset. Otherwise the trainer will throw an error.
@nicolasvegaquevedo2006 3 หลายเดือนก่อน
excellent video, thank you
@Arun-ku7kq 5 วันที่ผ่านมา
By far the best explanation of tesseract training.. 👌🏼
@ArT-yt3ng 6 ชั่วโมงที่ผ่านมา
Thanks a lot bro. You are literally my savior for today. Thanks a bunch.
@aritradeb1935 หลายเดือนก่อน
MOst of my data has two lines. What to do in that case?
@DangKhang2811 14 วันที่ผ่านมา
can i use file png and box in data bro ?
@appsscope2487 2 หลายเดือนก่อน
If I need to train in Arabic numbers, can I do it in the same way? because there is no Arabic number dataset to download!!
@SL7Tech 2 หลายเดือนก่อน
@appsscope2487 you can create dataset yourself and yes follow this procedure for fine tuning. remember to pass language type as RTL.
@inkmaze 2 หลายเดือนก่อน
I got combine_tessdata failed at 12:39 pls help
@SL7Tech 2 หลายเดือนก่อน ⁺¹
@@inkmaze can you share the log
@inkmaze 2 หลายเดือนก่อน
@@SL7Tech Sure
You are using make version: 4.4.1
combine_tessdata -u ../tessdata//deu_latf.traineddata data/deu_latf/engplus
process_begin: CreateProcess(NULL, combine_tessdata -u ../tessdata//deu_latf.traineddata data/deu_latf/engplus, ...) failed.
make (e=2): The system cannot find the file specified.
make: *** [Makefile:207: data/deu_latf/engplus.lstm-unicharset] Error 2
@inkmaze 2 หลายเดือนก่อน ⁺¹
@@SL7Tech Oh I forgot to add Tesseract to path LOL
@SidhuOp 3 หลายเดือนก่อน
Since pytesseract is terrible with alphanumeric words, can we train it with those kind of datasets
@st1np 3 หลายเดือนก่อน
true, I've been trying for a long time to train for the Consolas alphanumeric font, but tesseract it's very inaccurate. HELP
@markmacharia5187 2 หลายเดือนก่อน
I ran into this error"$ make training MODEL_NAME=kernsys START_MODEL=eng TESSDATA=../tessdata/ MAX_ITERATIONS=2000 LEARNING_RATE=0.001
You are using make version: 4.4.1
tesseract "data/kernsys-ground-truth/image_001.png" data/kernsys-ground-truth/image_001 --psm 13 lstm.train
No box data found in 'data/kernsys-ground-truth/image_001.box'.
Failed to read boxes from data/kernsys-ground-truth/image_001.png
Error during processing.
make: *** [Makefile:248: data/kernsys-ground-truth/image_001.lstmf] Error 1
"
@SL7Tech 2 หลายเดือนก่อน
make sure that ground truth file is not empty
@markmacharia5187 2 หลายเดือนก่อน
@SL7Tech it is not empty
@paulp4061 หลายเดือนก่อน
Ran into same error. In my case it was an empty (zero bytes) file with .box extension which was apparently created during one of the previous failed attempts to run the command. After deleting the file it worked.

ต่อไป

เล่นอัตโนมัติ

LlamaOCR - Building your Own Private OCR System

LlamaOCR - Building your Own Private OCR System

EASIEST Way to Fine-Tune a LLM and Use It With Ollama

EASIEST Way to Fine-Tune a LLM and Use It With Ollama

LlamaParse: Convert PDF (with tables) to Markdown

LlamaParse: Convert PDF (with tables) to Markdown

Nec Red Rockets Kawasaki vs. LP Bank Ninh Binh - Pool B | Highlights | Club World Champs 2024

Nec Red Rockets Kawasaki vs. LP Bank Ninh Binh - Pool B | Highlights | Club World Champs 2024

หัวหน้าแก๊งพาลูกสาวไปกินไก่ทอด เจอกลุ่มนักเลงหาเรื่อง เลยจัดการพวกนั้นจนพ่ายแพ้

หัวหน้าแก๊งพาลูกสาวไปกินไก่ทอด เจอกลุ่มนักเลงหาเรื่อง เลยจัดการพวกนั้นจนพ่ายแพ้

LIVE🔴 : Singapore vs Thailand | ASEAN Championship 2024 | 17.12.24

LIVE🔴 : Singapore vs Thailand | ASEAN Championship 2024 | 17.12.24

【พากย์ไทย】ฮ่องเต้เมาและหลับไปกับนางใน แต่นางในตั้งท้องมังกรทันที จึงได้รับการแต่งตั้งเป็นพระมเหสี

【พากย์ไทย】ฮ่องเต้เมาและหลับไปกับนางใน แต่นางในตั้งท้องมังกรทันที จึงได้รับการแต่งตั้งเป็นพระมเหสี

Training Tesseract 5 for a New Font

Training Tesseract 5 for a New Font

Tesseract OCR: What is it and is it the BEST for you?

Tesseract OCR: What is it and is it the BEST for you?

Google’s New AI Is Recreating the Whole World to Unlock Superhuman Intelligence

Google’s New AI Is Recreating the Whole World to Unlock Superhuman Intelligence

C# Оптимизация оперативной памяти

C# Оптимизация оперативной памяти

Optical Character Recognition (OCR) - Computerphile

Optical Character Recognition (OCR) - Computerphile

How to Build Effective AI Agents (without the hype)

How to Build Effective AI Agents (without the hype)

How to Auto Label Your Custom Dataset with Roboflow in 2 Minutes

How to Auto Label Your Custom Dataset with Roboflow in 2 Minutes

Llama 3.2-vision: The best open vision model?

Llama 3.2-vision: The best open vision model?

YOLOv8 | How to Train for Object Detection on a Custom Dataset | Computer Vision

YOLOv8 | How to Train for Object Detection on a Custom Dataset | Computer Vision

MARK 마크 '프락치 (Fraktsiya) (Feat. 이영지)' MV

MARK 마크 '프락치 (Fraktsiya) (Feat. 이영지)' MV

#นายกแพทองธาร ลงพื้นที่มอบถุงยังชีพ บริเวณ ซ.พัฒนาการคูขวาง ๑๐ (ถ.ท่าโพธิ์) จ.นครศรีธรรมราช

#นายกแพทองธาร ลงพื้นที่มอบถุงยังชีพ บริเวณ ซ.พัฒนาการคูขวาง ๑๐ (ถ.ท่าโพธิ์) จ.นครศรีธรรมราช

【หนังพากย์ไทย】ยอดฝีมือสังหารนักโทษ แต่นักโทษเป็นปรมาจารย์กังฟูที่ซ่อนอยู่ เขาจัดการทั้งหมดในทันที

【หนังพากย์ไทย】ยอดฝีมือสังหารนักโทษ แต่นักโทษเป็นปรมาจารย์กังฟูที่ซ่อนอยู่ เขาจัดการทั้งหมดในทันที

ศึกมวยไทยพันธมิตร 16/12/2024

ศึกมวยไทยพันธมิตร 16/12/2024

LIVE🔴 : Singapore vs Thailand | ASEAN Championship 2024 | 17.12.24

LIVE🔴 : Singapore vs Thailand | ASEAN Championship 2024 | 17.12.24

🎄✨ Puff is saving Christmas again with his incredible baking skills! #PuffTheBaker #thatlittlepuff

🎄✨ Puff is saving Christmas again with his incredible baking skills! #PuffTheBaker #thatlittlepuff

แมนยู Corner : คุยหลังเกม แมนฯซิตี้ 1-2 แมนฯยู ชัยชนะมาจากอโมริมกล้าตัด แรชฟอร์ด , การ์นาโช

แมนยู Corner : คุยหลังเกม แมนฯซิตี้ 1-2 แมนฯยู ชัยชนะมาจากอโมริมกล้าตัด แรชฟอร์ด , การ์นาโช

Live!🔴 สิงคโปร์ VS ทีมชาติไทย เชียร์สดฟุตบอลฟุตบอล ASEAN Mitsubishi Electric Cup™ 2024

Live!🔴 สิงคโปร์ VS ทีมชาติไทย เชียร์สดฟุตบอลฟุตบอล ASEAN Mitsubishi Electric Cup™ 2024