Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

OCR Support on DepthAI (and megaAI)

See original GitHub issue

Start with the `why`:

Text detection combined with optical character recognition (OCR) is extremely useful in all sorts of automation and perception applications, but is also extremely resource heavy. So being able to offload this to DepthAI would be a very valuable feature.

Move to the `how`:

The EAST text detector seems quite useful and has the capability return rotated boxes (example below) so it allows functioning where the camera’s horizon is not oriented with the text.

This region of interest can then be warped back to a straight/horizontal text, which using Gen2 Pipeline Builder (https://github.com/luxonis/depthai/issues/136), can be fed into a second-stage neural network (or otherwise) that does the OCR on the horizontal text.

Move to the `what`:

Support an out-of-the-box capability to do text detection and OCR of the detected text.

Issue Analytics

State:
Created 3 years ago
Reactions:2
Comments:13

Top GitHub Comments

3reactions

Luxonis-Vladcommented, May 28, 2022

@schreiterjp Can you please try it again now?

2reactions

schreiterjpcommented, May 29, 2022

Thanks for the insane fast support @Luxonis-Brandon and @vladvyskocil now the model was downloaded and is finally working. 🎉

Top Results From Across the Web

FAQs & How-To - DepthAI documentation - Luxonis

Yes, DepthAI and megaAI work cleanly on all the Jetson/Xavier series, ... and warping those to be rectangular (which can be useful for...

DepthAI's Documentation — DepthAI documentation | Luxonis

This example shows how to use the DepthAI/megaAI/OAK cameras in the Gen2 Pipeline ... (EAST) followed by optical character recognition of the detected...

AI / ML / NN - DepthAI documentation - Luxonis

Demos: Face recognition, person identification, OCR, license plate ... You can estimate the performance of your model with the help of the chart...

Support - DepthAI documentation - Luxonis

Support ¶. Running into issues or have questions? We're here to help. Before requesting support, please also check Troubleshooting documentation page.

First steps with DepthAI

What is megaAI? Which Model Should I Order? How hard is it to get DepthAI running from scratch? What Platforms are Supported? Is...