Tesseract OCR - Create Trained data for Seven segment (Sample)

  Рет қаралды 31,698

Automation Control Hub

Automation Control Hub

Күн бұрын

Hi everyone,
I want to share how to create trained-data for OCR tesseract.
After tried many-times, I found a simple version to do this (ofc my version lol). Your version might better than me, but I hope it will useful for someone who facing some problems during trained-data creation.
Here some tools we required to create trained-data
Serak Trainer V0.4 for Tesseract 3.0
code.google.co...
jTessBoxEditor 2.3.0 and VietOCR
sourceforge.ne...
my Text document which I created using Microsoft excel and export to txt(Tab delimiter)
drive.google.c...
Thanks for watching guys...

Пікірлер: 38
@fopix7667
@fopix7667 3 жыл бұрын
You are the best, thank you very much !!!
@yeeteng2427
@yeeteng2427 3 жыл бұрын
Hi, I face this error using trying to use the VIETOCR "Exception has been thrown by the target of invocation." How should I solve this?
@username----------
@username---------- 2 жыл бұрын
is it possible to create a dataset from images alone as i would like to be able for it to scan the covers of games, magazines, vinyls and similar objects? for example i train it to recognise a brand of xbox game from it's logos and fonts, like call of duty. and i do this by having thousands of photos of the games? do u know if this is possible?
@jennifermesh5780
@jennifermesh5780 5 ай бұрын
Did you got any solution? I'm also looking for the same
@kshitijsoni9227
@kshitijsoni9227 3 жыл бұрын
white spaces are added between two character randomly, can you suggest any thing about resolving this issue ?
@pujanbhatt9747
@pujanbhatt9747 3 жыл бұрын
I want to create my own character set, the same as you did for numbers. can you help me out??
@farazsoftinfo
@farazsoftinfo 3 жыл бұрын
hi thanks for making this video can u make a video for fine tuning? thanks
@justinjagdeep
@justinjagdeep 4 жыл бұрын
Try to talk with a better mic next time , Thanks
@0xPanda1
@0xPanda1 3 жыл бұрын
What did you meant by fine tuning?
@VonchkynProduction
@VonchkynProduction 3 жыл бұрын
For some reason, I can't find my font on jTessBoxEditor.. anyone know what to do??
@dikzhead
@dikzhead 4 жыл бұрын
Hey, i want to train data from my handwriting. Can i use your method to train data from an image file?
@automationcontrolhub
@automationcontrolhub 4 жыл бұрын
I never tried before, but It might yes, you can. You can convert your image file to .tiff format and trained it. Hope will help you..
@dikzhead
@dikzhead 4 жыл бұрын
@@automationcontrolhub ya thank you, i just found the way to convert my handwriting to font style, so i can follow your step. Thank you
@automationcontrolhub
@automationcontrolhub 4 жыл бұрын
@@dikzhead Terima kasih kembali..
@dikzhead
@dikzhead 4 жыл бұрын
@@automationcontrolhub wah orang indo haha, mau nanya dong kenapa ya kok saya udah train data saya, tapi di folder train data gada file normproto, dan file itu dibutuhin buat combine tessdata. jadi saya gabisa combine nih
@pujanbhatt9747
@pujanbhatt9747 3 жыл бұрын
@@dikzhead hey I want to convert my handwriting to the font style. can you help me out?
@haiuuyen107
@haiuuyen107 4 жыл бұрын
Thank you! Could you let me know how to input Serak font to jTessBoxEditor? even I run .tiff file successful but it not appear in the option.
@antonioiesce3285
@antonioiesce3285 3 жыл бұрын
yes i have the same problem, did u found a solution?
@Kerbargos
@Kerbargos Жыл бұрын
One some Windows machines there are two buttons for installing fonts: Install For Me and Install For All Users. Java only lists fonts installed for all users.
@Kerbargos
@Kerbargos Жыл бұрын
One some Windows machines there are two buttons for installing fonts: Install For Me and Install For All Users. Java only lists fonts installed for all users.@@antonioiesce3285
@Wansovi1z
@Wansovi1z Ай бұрын
​@@Kerbargos guy i don't know you, but i love you! thaks so much, you saved me
@matemaniaindonesia3635
@matemaniaindonesia3635 4 жыл бұрын
like this
@about-technology5874
@about-technology5874 4 жыл бұрын
How about the Tesseract 4.0??
@automationcontrolhub
@automationcontrolhub 4 жыл бұрын
tesseract 4.0 has different approach. Most developer using javascript and java i think.
@ifeanyinnaemego
@ifeanyinnaemego 2 жыл бұрын
Can I train it to understand hand written text
@automationcontrolhub
@automationcontrolhub Жыл бұрын
So far never test it yet
@sabucyril-b3l
@sabucyril-b3l 2 ай бұрын
error while train data
@Barklo69
@Barklo69 3 жыл бұрын
maybe could put more gain on mic before upload the video thx
@automationcontrolhub
@automationcontrolhub 3 жыл бұрын
i will reupload with better audio. Thanks for suggestion.
@xiaonan238
@xiaonan238 3 жыл бұрын
Audio too soft
@RoboGenesHimanshuVerma
@RoboGenesHimanshuVerma 3 жыл бұрын
This audio has been recorded with a potato 😂
@ytp-c3p
@ytp-c3p 10 ай бұрын
bhai muh me kuch fasa h ka
@mohamedshili8429
@mohamedshili8429 4 жыл бұрын
i hope this will work! if it does share plus sub
@harmindersinghnijjar
@harmindersinghnijjar Жыл бұрын
This is torture.
@IamTheGreatCornholioo
@IamTheGreatCornholioo 3 жыл бұрын
wtf, it's impossible to understand what you are doing, way too many information gaps
@hak14971
@hak14971 3 жыл бұрын
Try to speak clearly and with proper sentences next time. We are not mind readers. You are mumbling words in your mouth into a poor quality mic
@shitpost_xxx
@shitpost_xxx 3 жыл бұрын
What is 7 segment means? why dont you just put a..z and 0..9 letters?
Using Tesseract-OCR to extract text from images
11:29
DFIRScience
Рет қаралды 222 М.
Optical Character Recognition (OCR) - Computerphile
14:16
Computerphile
Рет қаралды 190 М.
Kluster Duo #настольныеигры #boardgames #игры #games #настолки #настольные_игры
00:47
Incredible: Teacher builds airplane to teach kids behavior! #shorts
00:32
Fabiosa Stories
Рет қаралды 12 МЛН
Training Tesseract 5 for a New Font
17:24
Gabriel Garcia
Рет қаралды 44 М.
How to Preprocess Images for Text OCR in Python (OCR in Python Tutorials 02.02)
53:24
Python Tutorials for Digital Humanities
Рет қаралды 161 М.
Optical Character Recognition with EasyOCR and Python | OCR PyTorch
16:00
Nicholas Renotte
Рет қаралды 145 М.
Text recognition (OCR) with Tesseract and Python
31:32
Pysource
Рет қаралды 80 М.
Flutter - OpenCV Seven Segment Digit Detector
1:23
Hilmi Yafi A
Рет қаралды 1,7 М.
[NODE-RED] MODBUS - IEEE754 64BIT FLOATING POINT DATA
25:46
Automation Control Hub
Рет қаралды 757
How To Create A News Channel with AI | 2024
9:40
Website Learners
Рет қаралды 8 М.
[Guide] OPC UA Connection - Siemens S7 1500 PLC and Aveva System Platform
19:10
Kluster Duo #настольныеигры #boardgames #игры #games #настолки #настольные_игры
00:47