Whole line (multiple words) recognition
See original GitHub issueRefer this issue https://github.com/emedvedev/attention-ocr/issues/59, are some modifications I make below sufficient to recognize multiple words (as a line)?
-
Include the space character into
CHARMAP
inaocr/util/data_gen.py
. -
Increase
max-prediction
for predicted phrase. -
Increase
max-width
for input image
Issue Analytics
- State:
- Created 5 years ago
- Comments:11 (5 by maintainers)
Top Results From Across the Web
python - Optical Character Recognition Multiple Line Detection
You want to recognize text of a document containing multiple lines. There are two ways to achieve this: Segment the document into lines...
Read more >The Science of Word Recognition - Typography | Microsoft Learn
This model says that the letters within a word are recognized simultaneously, and the letter information is used to recognize the words.
Read more >Phonics vs. Whole Word Reading - New Chapter Learning
For some children, phonics comes easily and they move into whole word recognition smoothly. For others, putting these sounds together is very difficult....
Read more >3. Word Recognition Skills: One of Two Essential Components ...
Our speech consists of whole words, but we write those words by breaking them down into their phonemes and representing each phoneme with...
Read more >What is OCR (Optical Character Recognition)? - AWS
Intelligent word recognition systems work on the same principles as ICR, but process whole word images instead of preprocessing the images into characters....
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
Hi @hiepph --sorry I didn’t see this message sooner. I’m using max-width 380, max-height 32, max-prediction 75 with a few tens of thousands of images. You don’t want max-height to be so large because all the images will get stretched to that size, and it will destroy the aspect ratio. You should be able to see some results with only 1000 images for training, but you need a lot more to get really good results I believe.
Yes that’s why --full-ascii exists too, @hiepph . I’ve done this successfully.