question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Documentation: Should the OCR-D processor run on RGB or binarized images?

See original GitHub issue

Should the OCR-D processor run on a RGB or a binarized image input group?

I think it would be best if the README listed an example, e.g.:

ocrd-eynollah-segment -I <WHICH ONE?> -O SEG-LINE -P xyz abc

Issue Analytics

  • State:closed
  • Created 2 years ago
  • Comments:14 (14 by maintainers)

github_iconTop GitHub Comments

1reaction
cneudcommented, Mar 30, 2022

Ok, how about this then?

Use as OCR-D processor Eynollah ships with a CLI interface to be used as OCR-D processor. In this case, the source image file group with (preferably) RGB images should be used as input like this: ocrd-eynollah-segment -I OCR-D-IMG -O SEG-LINE -P models In fact, the image provided by @imageFilename in PAGE-XML is passed on directly to Eynollah as a processor, so that e.g. ocrd-eynollah-segment -I OCR-D-IMG-BIN -O SEG-LINE -P models will still use the original (RGB) image despite any binarization that may have occured in previous OCR-D processing steps

0reactions
cneudcommented, Mar 30, 2022

OK I’ve amended this accordingly in https://github.com/qurator-spk/eynollah/commit/441c8566dda5cc2b37fd92a39236dc595a547298 and will close here once the PR for the README update has been merged.

Read more comments on GitHub >

github_iconTop Results From Across the Web

support AlternativeImage input in OCR-D processor #32
Segmentation works on raw RGB images, but respects and retains AlternativeImages from binarization steps, so it's a good idea to do binarization first, ......
Read more >
OCR-D Workflow Guide
In this processing step, a scanned colored /gray scale document image is taken as input and a black and white binarized image is...
Read more >
Improve OCR Accuracy With Advanced Image Preprocessing
When setting up an OCR solution, utilizing advanced image preprocessing will increase accuracy levels of your OCR engine.
Read more >
Pre-Processing in OCR!!!
For an OCR system, an Image is a multidimensional array (2D array if the image is grayscale (or) binary, 3D array if the...
Read more >
image processing to improve tesseract OCR accuracy
The Tesseract documentation contains some good details on how to improve the OCR quality via image processing steps.
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found