Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Train on datasets larger than memory

See original GitHub issue

I wish to train a MaskRCNN model, but I can’t fit all training set annotations as a list[dict] in memory (as I understand, this is needed when using DatasetCatalog).

How can we train really large models, where the annotations alone are vastly larger than the memory size?

Issue Analytics

State:
Created 2 years ago
Comments:7 (2 by maintainers)

Top GitHub Comments

2reactions

zhanghang1989commented, Mar 25, 2022

@zhanghang1989 Hey, does D2Go cache option require training from the more limited D2Go Model Zoo? If so, is there any way to train on datasets where annotations don’t fit into memory and still be able to choose from any detectron2 model?

(I have a very large dataset to train, but am not interested in a model optimized for mobile deployment)

D2Go should support training for all Detectron2 model configs.

0reactions

austinmwcommented, Mar 24, 2022

@zhanghang1989 Hey, does D2Go cache option require training from the more limited D2Go Model Zoo? If so, is there any way to train on datasets where annotations don’t fit into memory and still be able to choose from any detectron2 model?

(I have a very large dataset to train, but am not interested in a model optimized for mobile deployment)