Use COCO Mask Parsing from pycocotools #8630

david-csnmedia · 2024-09-03T20:59:03Z

🚀 The feature

The CocoDetection v2 transform wrapper attempts to decode the mask itself, but pycocotools provides a high performance implementation already. We have had to copy from master, this _dataset_wrapper.py because of a bug related to the handling of these masks that was fixed in master but not installable using pip yet.

https://github.com/pytorch/vision/blob/main/torchvision/tv_tensors/_dataset_wrapper.py#L402

Seeing torchvision.datasets.CocoDetection has self.coco as a COCO() object, let's use it.

       coco_ann = dataset.coco.imgToAnns[image_id]

        if "masks" in target_keys:
            target["masks"] = tv_tensors.Mask(
                    torch.stack([
                        torch.from_numpy(dataset.coco.annToMask(ann))
                        for ann in coco_ann
                    ])
                )

Motivation, pitch

There have already been bugs related to this, and there's no need to reinvent the wheel. Instead, let's use the existing implementation.

Alternatives

No response

Additional context

No response

The text was updated successfully, but these errors were encountered:

NicolasHug · 2024-09-04T08:59:25Z

Thanks for opening the issue @david-csnmedia . I'm happy for you to open a PR and see if the tests are passing

venkatram-dev mentioned this issue Sep 10, 2024

use_pycocotools_for_segment_mask #8640

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use COCO Mask Parsing from pycocotools #8630

Use COCO Mask Parsing from pycocotools #8630

david-csnmedia commented Sep 3, 2024

NicolasHug commented Sep 4, 2024

Use COCO Mask Parsing from pycocotools #8630

Use COCO Mask Parsing from pycocotools #8630

Comments

david-csnmedia commented Sep 3, 2024

🚀 The feature

Motivation, pitch

Alternatives

Additional context

NicolasHug commented Sep 4, 2024