Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

How should I configure the target classes when building my custom dataset (esp. not-present class)?

See original GitHub issue

Hi,

I’m trying to build my own dataset that would work with DETR. I’m a bit confused on the data format that the matcher expects to perform the matching.

From the comment https://github.com/facebookresearch/detr/blob/a54b77800eb8e64e3ad0d8237789fcbf2f8350c5/models/matcher.py#L44, it seems like we are not going to include 0 / null class padding for when an object is not present. In other words, if I have a batch with 2 images, one has 3 objects and one has 4 objects, and num_queries is 100, then targets[0]['labels'] has length 3 and targets[0]['labels'] has length 4. Is that correct? I should not include any null / padding class to the matcher that indicates when something is not present?

If so, my confusion is how is DETR able to avoid predicting duplicate boxes? Say DETR predicts 50 present boxes, and the ground truth target I supply has 3 boxes. Then the matcher would pick the top 3 boxes from the predicted 50 and only penalize those 3, while the other 47 boxes are not penalized. They could as well be some nonsense, but present, boxes that confuses the result.

I must be missing something here. Please help : )

Issue Analytics

State:
Created 2 years ago
Comments:6 (1 by maintainers)

Top GitHub Comments

1reaction

alcinoscommented, Apr 25, 2021

Hi @LeoZDong

Your understanding of the matcher is correct. Note however that the matcher is not computing any loss, this is the job of the SetCriterion. Specifically, here is what happens for the label supervision: https://github.com/facebookresearch/detr/blob/a54b77800eb8e64e3ad0d8237789fcbf2f8350c5/models/detr.py#L116-L119

By default, the unmatched queries will be supervised to predict the “no-object” class. The corresponding index for this class is self.num_classes in the snippet above. For the matched queries, this default label is replaced with the label of the matched query.

Another confusion is that there doesn’t seem to be a “preprocessing” for the predicted classes and boxes before going into the matcher. If a predicted box matches very well with a gt box, but its predicted class is not-present, will the matcher fail in this case?

The matching cost is a combination of a localization cost and a classification cost. There is no general answer to your question, it all depends on the specific situation. If a box has very good localization but very poor classification, depending on the exact values as well as the coefficient for both costs, it’s possible that the GT will be matched to a different box with slightly worse localization but better classification.

I hope this helps, feel free to reach out if you have further questions.

0reactions

amirhesamyazdicommented, Sep 5, 2021

I’m not sure if I understand what you mean by “negative samples”. Say your num_queries=3 and your image has 2 objects. Then one of the predicted boxes would belong to the class nonexistent / null, which is kind of like negative samples but really just a different class.

Thank you so much for your feedback.

I mean, do I need to include images that don’t have any airplanes in my dataset. and if so how? because when you annotate an image if you don’t create any bounding boxes (because there is nothing in this image), no entry would be created in the json annotations. So, I have trained a model already and it performs very well for inference of images that do have objects, but when I do inferences on images with no objects, it perform very badly.

Top Results From Across the Web

Build System - ESP32 - — ESP-IDF Programming Guide latest ...

A full list of supported targets in your version of ESP-IDF can be seen by running idf.py –list-targets . Some things are not...

Jersey 2.37 User Guide - GitHub Pages

Reusing Jersey implementation in your custom application model; 4.3. ... Jetty server) or by java -cp target/classes:target/dependency/* com.example.heroku.

Hibernate ORM 5.2.18.Final User Guide - Red Hat on GitHub

Hibernate works best if these classes follow the Plain Old Java Object (POJO) / JavaBean ... The Contact class itself would be an...

Writing R Extensions - The Comprehensive R Archive Network

Packages needed to use datasets from the package should be in ' Imports ': this includes those needed to define S4 classes used....

Save and Load Machine Learning Models in Python with scikit ...

In this post you will discover how to save and load your machine ... onset of diabetes dataset, save the model to file...