Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Better results with lang 'OCRB'

See original GitHub issue

I have played around a bit with this tool and got much better results with lang ‘OCRB’ which can be fetched from here: https://github.com/Exteris/tesseract-mrz/tree/master/lang

After adding it to tesseract data dir, I have added to the following line -l OCRB: https://github.com/konstantint/PassportEye/blob/master/passporteye/util/ocr.py#L30

Just wanted to let you know - you probably want to include this by default into this tool or add a hint to the README.

Issue Analytics

State:
Created 6 years ago
Reactions:1
Comments:12 (4 by maintainers)

Top GitHub Comments

1reaction

hunaidkhan2000commented, Aug 13, 2020

@krupeshxebia yes i commented the same thing the results get poor if i use OCRB , The data needs to be trained or tessdata needs to get updated by google to provide us better results.

1reaction

hunaidkhan2000commented, Aug 11, 2020

Hi @kmanadkat
let’s say if you have downloaded OCRB pretrained data you just need to specify it while reading mrz file like

mrz = read_mrz('abu_2.jpg',extra_cmdline_params='-l OCRB') -##### you can change OCRB to your pretrained data, Also Make sure you have the file in tesseract folder tessdata folder.