An annotation for an image (OD in the name for Object Detection)
A mask of regions defined by one or more polygons.
A mask of regions defined by one or more polygons. The masked object(s) should have the same label.
A mask of regions defined by RLE.
A mask of regions defined by RLE. The masked object(s) should have the same label. This class corresponds to "uncompressed RLE" of COCO dataset. RLE is a compact format for binary masks. Binary masks defines the region by assigning a boolean to every pixel of the image. RLE compresses the binary masks by instead recording the runs of trues and falses in the binary masks. RLE is an array of integer. The first element is the length of run of falses staring from the first pixel. The second element of RLE is the is the length of first run of trues. e.g. binary masks: 00001110000011 RLE: ---4--3----5-2 ====> 4,3,5,2
Also note that we don't use COCO's "compact" RLE string here because this RLE class has better time & space performance.
An annotation for an image (OD in the name for Object Detection)
the Id of the image
the Id of the category. Note that categoryId is not continuous from 0 to the number of categories. You can use COCODataset.cateId2Idx to convert an categoryId to a compact category index.
the segmentation data
area
the bounding box, (xmin, ymin, xmax, ymax)
if the annotation is a crowd. e.g. a crowd of people. If true, segmentation is an COCORLE object
the reference to the image