Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Better results with lang 'OCRB'

See original GitHub issue

I have played around a bit with this tool and got much better results with lang ‘OCRB’ which can be fetched from here:

After adding it to tesseract data dir, I have added to the following line -l OCRB:

Just wanted to let you know - you probably want to include this by default into this tool or add a hint to the README.

Issue Analytics

  • State:open
  • Created 6 years ago
  • Reactions:1
  • Comments:12 (4 by maintainers)

github_iconTop GitHub Comments

hunaidkhan2000commented, Aug 13, 2020

@krupeshxebia yes i commented the same thing the results get poor if i use OCRB , The data needs to be trained or tessdata needs to get updated by google to provide us better results.

hunaidkhan2000commented, Aug 11, 2020

Hi @kmanadkat
let’s say if you have downloaded OCRB pretrained data you just need to specify it while reading mrz file like

mrz = read_mrz('abu_2.jpg',extra_cmdline_params='-l OCRB') -##### you can change OCRB to your pretrained data, Also Make sure you have the file in tesseract folder tessdata folder.

Read more comments on GitHub >

github_iconTop Results From Across the Web

How to get the most accurate results with Tesseract OCR
Now for my question, what can I do so that Tesseract produces much more accurate results? My 30 training samples consisted of photos...
Read more >
Writing in OCR-B, size 1 - TeX - LaTeX Stack Exchange
The documentation suggests that opentype is preferable if possible, so I decided to try that and got rather better looking results.
Read more >
How to improve accuracy for OCR? - Google Groups
The training process should be pretty straightforward and I'd expect good results since all I have to deal with is one font (OCR-B),...
Read more >
OCR B - Adobe Fonts
Explore OCR B designed by Adrian Frutiger at Adobe Fonts. ... Subscribe to Creative Cloud to use more fonts ... Learn more about...
Read more >
OCR-B - Wikipedia
OCR-B. Article Talk · Language · Watch · Edit. OCR-B is a monospace font developed in 1968 by Adrian Frutiger for Monotype by...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Post

No results found

github_iconTop Related Hashnode Post

No results found