question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

OCR Support on DepthAI (and megaAI)

See original GitHub issue

Start with the why:

Text detection combined with optical character recognition (OCR) is extremely useful in all sorts of automation and perception applications, but is also extremely resource heavy. So being able to offload this to DepthAI would be a very valuable feature.

Move to the how:

The EAST text detector seems quite useful and has the capability return rotated boxes (example below) so it allows functioning where the camera’s horizon is not oriented with the text.

image

This region of interest can then be warped back to a straight/horizontal text, which using Gen2 Pipeline Builder (https://github.com/luxonis/depthai/issues/136), can be fed into a second-stage neural network (or otherwise) that does the OCR on the horizontal text.

Move to the what:

Support an out-of-the-box capability to do text detection and OCR of the detected text.

Issue Analytics

  • State:closed
  • Created 3 years ago
  • Reactions:2
  • Comments:13

github_iconTop GitHub Comments

3reactions
Luxonis-Vladcommented, May 28, 2022

@schreiterjp Can you please try it again now?

2reactions
schreiterjpcommented, May 29, 2022

Thanks for the insane fast support @Luxonis-Brandon and @vladvyskocil now the model was downloaded and is finally working. 🎉

Read more comments on GitHub >

github_iconTop Results From Across the Web

FAQs & How-To - DepthAI documentation - Luxonis
Yes, DepthAI and megaAI work cleanly on all the Jetson/Xavier series, ... and warping those to be rectangular (which can be useful for...
Read more >
DepthAI's Documentation — DepthAI documentation | Luxonis
This example shows how to use the DepthAI/megaAI/OAK cameras in the Gen2 Pipeline ... (EAST) followed by optical character recognition of the detected...
Read more >
AI / ML / NN - DepthAI documentation - Luxonis
Demos: Face recognition, person identification, OCR, license plate ... You can estimate the performance of your model with the help of the chart...
Read more >
Support - DepthAI documentation - Luxonis
Support ¶. Running into issues or have questions? We're here to help. Before requesting support, please also check Troubleshooting documentation page.
Read more >
First steps with DepthAI
What is megaAI? Which Model Should I Order? How hard is it to get DepthAI running from scratch? What Platforms are Supported? Is...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found