OCR Support on DepthAI (and megaAI)
See original GitHub issueStart with the why
:
Text detection combined with optical character recognition (OCR) is extremely useful in all sorts of automation and perception applications, but is also extremely resource heavy. So being able to offload this to DepthAI would be a very valuable feature.
Move to the how
:
The EAST text detector seems quite useful and has the capability return rotated boxes (example below) so it allows functioning where the camera’s horizon is not oriented with the text.
This region of interest can then be warped back to a straight/horizontal text, which using Gen2 Pipeline Builder (https://github.com/luxonis/depthai/issues/136), can be fed into a second-stage neural network (or otherwise) that does the OCR on the horizontal text.
Move to the what
:
Support an out-of-the-box capability to do text detection and OCR of the detected text.
Issue Analytics
- State:
- Created 3 years ago
- Reactions:2
- Comments:13
@schreiterjp Can you please try it again now?
Thanks for the insane fast support @Luxonis-Brandon and @vladvyskocil now the model was downloaded and is finally working. 🎉