Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Documentation: Should the OCR-D processor run on RGB or binarized images?

See original GitHub issue

Should the OCR-D processor run on a RGB or a binarized image input group?

I think it would be best if the README listed an example, e.g.:

ocrd-eynollah-segment -I <WHICH ONE?> -O SEG-LINE -P xyz abc

Issue Analytics

State:
Created 2 years ago
Comments:14 (14 by maintainers)

Top GitHub Comments

1reaction

cneudcommented, Mar 30, 2022

Ok, how about this then?

Use as OCR-D processor Eynollah ships with a CLI interface to be used as OCR-D processor. In this case, the source image file group with (preferably) RGB images should be used as input like this: ocrd-eynollah-segment -I OCR-D-IMG -O SEG-LINE -P models In fact, the image provided by @imageFilename in PAGE-XML is passed on directly to Eynollah as a processor, so that e.g. ocrd-eynollah-segment -I OCR-D-IMG-BIN -O SEG-LINE -P models will still use the original (RGB) image despite any binarization that may have occured in previous OCR-D processing steps

0reactions

cneudcommented, Mar 30, 2022

OK I’ve amended this accordingly in https://github.com/qurator-spk/eynollah/commit/441c8566dda5cc2b37fd92a39236dc595a547298 and will close here once the PR for the README update has been merged.

Top Results From Across the Web

support AlternativeImage input in OCR-D processor #32

Segmentation works on raw RGB images, but respects and retains AlternativeImages from binarization steps, so it's a good idea to do binarization first, ......

OCR-D Workflow Guide

In this processing step, a scanned colored /gray scale document image is taken as input and a black and white binarized image is...

Improve OCR Accuracy With Advanced Image Preprocessing

When setting up an OCR solution, utilizing advanced image preprocessing will increase accuracy levels of your OCR engine.

Pre-Processing in OCR!!!

For an OCR system, an Image is a multidimensional array (2D array if the image is grayscale (or) binary, 3D array if the...

image processing to improve tesseract OCR accuracy

The Tesseract documentation contains some good details on how to improve the OCR quality via image processing steps.