Consider newer SSD Mobilenet model versions
See original GitHub issueI see the current version only allows ssdMobilenetv1
but seems like v2
is already available.
How hard is to get to use the newer version instead? Also, what’s needed to make it work on this project as a new model?
Edit: I’ve seen some v3
(here) but I don’t know how stable it is.
Issue Analytics
- State:
- Created 4 years ago
- Reactions:3
- Comments:13 (4 by maintainers)
Top Results From Across the Web
mobilenet-ssd — OpenVINO™ documentation — Version(latest)
The mobilenet-ssd model is a Single-Shot multibox Detection (SSD) network intended to perform object detection. This model is implemented using the Caffe* ...
Read more >Everything you need to know about MobileNetV3
A detailed article explaining the concepts used in MobileNetV3 and comparing with older versions.
Read more >[SSD] Small object detection · Issue #3196
I consider my objects medium size but SSD mobilenet v1 gives low accuracy and the training time is long. I did try to...
Read more >Object Detection with SSD and MobileNet | by Aditya Kunar
For bigger objects, the two models tend to perform similarly, with MobileNet beating VGG16 for objects between 40 and 80% when no data ......
Read more >MobileNet version 2
Even better, MobileNet+SSD uses a variant called SSDLite that uses depthwise separable layers instead of regular convolutions for the object ...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
In the course of the last couple of months I have been working hard on getting better and faster face detectors to face-api.js. SSD in fact is not the best method for face detection anymore, FPNs (feature pyramid networks) are achieving state of the art results nowadays.
I am trying out different backbones for that purpose, mobilenetv3 is one of them. I have not decided yet, which backbone to use, still have to do some evaluation.
I do not want to decide on any ETA yet, but the latest models I can come up with are much more lightweight and already achieve higher accuracy with way less parameters than the SSDMobilenetV1 currently provided by face-api.js. I want to make sure, that the new models are as close to state of the art performance, while being as small and lightweight as possible.
So the new model (maybe models) I will be releasing are going to deprecate the currently provided face detection models. That’s one of the reasons I do not want do rush things.
Still working on this, I finally got the first versions of some models, that I am quite satisfied with. But still a lot of work has to be done, since I want these models not only to detect faces but also regress 5 point facial landmarks at the same time for face alignment.
Unfortunately training such a model until convergence takes more than one to two weeks, which does slow down the entire process.