Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

W and H computation in `crop` method of several target structures seems wrong

See original GitHub issue

🐛 Bug

It seems that the crop() methods present in many target structure classes, e.g. BoxList, BinaryMaskList, and Polygons, calculate the height and width wrongly.

`BoxList` class

In its crop(self, box) method, L173-L178:

        w, h = box[2] - box[0], box[3] - box[1]
        cropped_xmin = (xmin - box[0]).clamp(min=0, max=w)
        cropped_ymin = (ymin - box[1]).clamp(min=0, max=h)
        cropped_xmax = (xmax - box[0]).clamp(min=0, max=w)
        cropped_ymax = (ymax - box[1]).clamp(min=0, max=h)

Since box defines the corner coordinates of the desired crop region, its width and height should be box[2] - box[0] + 1 and box[3] - box[1] + 1 respectively. The correct clamp max bound in the following four lines of code should be w - 1 and h - 1.

Therefore, the above code block should be:

        w, h = box[2] - box[0] + 1, box[3] - box[1] + 1
        cropped_xmin = (xmin - box[0]).clamp(min=0, max=w - 1)
        cropped_ymin = (ymin - box[1]).clamp(min=0, max=h - 1)
        cropped_xmax = (xmax - box[0]).clamp(min=0, max=w - 1)
        cropped_ymax = (ymax - box[1]).clamp(min=0, max=h - 1)

While at first glance it seems that the wrong code cancels itself out, and doesn’t impact the resultant cropped_xs and cropped_ys, it does cause a wrong bounding box size later, in L187 of the same method:

       bbox = BoxList(cropped_box, (w, h), mode="xyxy")

The cropped BoxList would be smaller than the desired crop box, by 1px in both x and y direction.

`BinaryMaskList`, `PolygonInstance`, `PolygonList` class

Similarly, in the crop() method of these classes, the computed w and h are 1px smaller than the correct values.

BinaryMaskList.crop(self, box) L92-L111:

    def crop(self, box):
        ...
        width, height = xmax - xmin, ymax - ymin
        cropped_masks = self.masks[:, ymin:ymax, xmin:xmax]
        cropped_size = width, height
        return BinaryMaskList(cropped_masks, cropped_size)

PolygonInstance.crop(self, box) L242-L268:

    def crop(self, box):
        ...
        w, h = xmax - xmin, ymax - ymin

        cropped_polygons = []
        for poly in self.polygons:
            p = poly.clone()
            p[0::2] = p[0::2] - xmin  # .clamp(min=0, max=w)
            p[1::2] = p[1::2] - ymin  # .clamp(min=0, max=h)
            cropped_polygons.append(p)

        return PolygonInstance(cropped_polygons, size=(w, h))

PolygonList.crop(self, box) L381-L388:

    def crop(self, box):
        w, h = box[2] - box[0], box[3] - box[1]
        cropped_polygons = []
        for polygon in self.polygons:
            cropped_polygons.append(polygon.crop(box))

        cropped_size = w, h
        return PolygonList(cropped_polygons, cropped_size)

Impact

The impact is usually unfelt if one just runs the code base as is, since these mistakes seem quite self-consistent. However, besides the implementation being fundamentally incorrect, it also adds unnecessary annoyances to people developing new functionalities based on this code base.

Issue Analytics

State:
Created 4 years ago
Reactions:5
Comments:12 (12 by maintainers)

Top GitHub Comments

2reactions

ppwwyyxxcommented, Jun 11, 2019

So again it is just an annotation convention of your data format. If the right bottom point is excluded, there is no “+1”, but if the right bottom point is included, there should be “+1”.

It is a matter of convention, but there are good convention and bad convention.

“+1” is a bad convention because when x2 = x1 + w - 1, x2 is not scale-invariant, meaning that all code written to scale the boxes are not precise. Without the “+1”, a box that’s half of the image is simply (0.0, 0.0, 128.0, 128.0).

2reactions

ppwwyyxxcommented, Jun 10, 2019

why there should be “+1” when converting a box from “xyxy” to “xywh” if the coordinates of this box are 0-indexed absolute coordinates (floats)

You’re right and there should not be. This is just a historical issue and we’re slowly correcting it (e.g., https://github.com/pytorch/pytorch/commit/373e6a78bf6afaf66d2749740036b077b70312c8 )

Top Results From Across the Web

Latest Developed Strategies to Minimize the Off-Target Effects ...

Researchers have discovered in vivo/vitro biochemical assays and algorithm-based computational approaches to detect and quantify the off-target ...

Clarifier Calculations

Calculate the Detention Time in for a clarifier with a volume of 25,000 gallons that receives a flow of 310,000 gal/day. Hydraulic Loading....

Highly accurate protein structure prediction with AlphaFold

Here we provide the first computational method that can regularly predict protein structures with atomic accuracy even in cases in which no ...

Analysis of Series of Cultivar Trials with Perennial Grasses for ...

This paper presented different methods to analyze trials with perennial species adapted for the division of a target region into zones. Methods for...

Guide 6: Basic Business Operations for the Entrepreneur - Citi

information and many of the tools for you to make the operational decisions – ... partnership, means there is no distinction between business...