Better results with lang 'OCRB'
See original GitHub issueI have played around a bit with this tool and got much better results with lang ‘OCRB’ which can be fetched from here: https://github.com/Exteris/tesseract-mrz/tree/master/lang
After adding it to tesseract data dir, I have added to the following line -l OCRB
: https://github.com/konstantint/PassportEye/blob/master/passporteye/util/ocr.py#L30
Just wanted to let you know - you probably want to include this by default into this tool or add a hint to the README.
Issue Analytics
- State:
- Created 6 years ago
- Reactions:1
- Comments:12 (4 by maintainers)
Top Results From Across the Web
How to get the most accurate results with Tesseract OCR
Now for my question, what can I do so that Tesseract produces much more accurate results? My 30 training samples consisted of photos...
Read more >Writing in OCR-B, size 1 - TeX - LaTeX Stack Exchange
The documentation suggests that opentype is preferable if possible, so I decided to try that and got rather better looking results.
Read more >How to improve accuracy for OCR? - Google Groups
The training process should be pretty straightforward and I'd expect good results since all I have to deal with is one font (OCR-B),...
Read more >OCR B - Adobe Fonts
Explore OCR B designed by Adrian Frutiger at Adobe Fonts. ... Subscribe to Creative Cloud to use more fonts ... Learn more about...
Read more >OCR-B - Wikipedia
OCR-B. Article Talk · Language · Watch · Edit. OCR-B is a monospace font developed in 1968 by Adrian Frutiger for Monotype by...
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
@krupeshxebia yes i commented the same thing the results get poor if i use OCRB , The data needs to be trained or tessdata needs to get updated by google to provide us better results.
Hi @kmanadkat
let’s say if you have downloaded OCRB pretrained data you just need to specify it while reading mrz file like
mrz = read_mrz('abu_2.jpg',extra_cmdline_params='-l OCRB') -##### you can change OCRB to your pretrained data,
Also Make sure you have the file in tesseract folder tessdata folder.