Add tensor-based annotation storage to reduce DDP RAM usage with large COCO-format datasets #1885

spsancti · 2024-03-05T07:25:59Z

Source of the bug: pytorch/pytorch#13246
Solution based on: facebookresearch/detectron2@0cd0e72
Fixes: #1214

NatanBagrov

Thanks! See comments inline.

src/super_gradients/training/datasets/detection_datasets/coco_format_detection.py

NatanBagrov · 2024-03-06T12:37:35Z

src/super_gradients/training/datasets/detection_datasets/coco_format_detection.py

+        start_addr = 0 if sample_id == 0 else self._addr[sample_id - 1].item()
+        end_addr = self._addr[sample_id].item()
+        annotation = pickle.loads(self._annotations[start_addr:end_addr].numpy().data)


We saw slowdown due to pickle load, do we want to make the serialize-parse fix optional?
I mean, eventually a memory leak is a memory leak, but for small datasets you get overhead whereas without the fix "you'd be fine". @BloodAxe , thoughts?

@NatanBagrov yes IMO

src/super_gradients/training/datasets/detection_datasets/coco_format_detection.py

…o work as expected

src/super_gradients/training/datasets/detection_datasets/coco_format_detection.py

shaydeci · 2024-03-14T07:43:14Z

src/super_gradients/training/datasets/detection_datasets/coco_format_detection.py

+        start_addr = 0 if sample_id == 0 else self._addr[sample_id - 1].item()
+        end_addr = self._addr[sample_id].item()
+        annotation = pickle.loads(self._annotations[start_addr:end_addr].numpy().data)


@NatanBagrov yes IMO

…b.com:Deci-AI/super-gradients into feature/ALG-000_memory-efficient-coco-dataset

shaydeci

To me this looks quite ready.
I do have one more reqyest: please add a unit test that utilizes this feature so that we see nothing crashes. You can use our data in tests/data/coco2017 (see how we use it for example in tests/unit_tests/preprocessing_unit_test.py).
It can be a simple test that just iterates throught the dataset with use_tensor_backed_storage set.

shaydeci · 2024-05-27T11:58:27Z

src/super_gradients/training/datasets/detection_datasets/coco_format_detection.py

@@ -52,6 +56,7 @@ def __init__(
        :param with_crowd:              Add the crowd groundtruths to __getitem__
        :param class_ids_to_ignore:     List of class ids to ignore in the dataset. By default, doesnt ignore any class.
        :param tight_box_rotation:      This parameter is deprecated and will be removed in a SuperGradients 3.8.
+        :param use_tensor_backed_storage: Whether to use tensor backed storage to mitigate python memory leak with large datasets ()


Maybe give an estimate what's considered "large" as a recommendation, from your experience.

Fix flake8

a308917

spsancti requested review from shaydeci, ofrimasad and BloodAxe as code owners March 5, 2024 07:26

spsancti changed the title ~~Add tensor-based annotation storage to reduce DDP RAM usage with large datasets~~ Add tensor-based annotation storage to reduce DDP RAM usage with large COCO-format datasets Mar 5, 2024

NatanBagrov reviewed Mar 6, 2024

View reviewed changes

Ensure empty annotations has bboxes of shape [0,4] for broadcasting t…

cb1e1f8

…o work as expected

shaydeci reviewed Mar 14, 2024

View reviewed changes

spsancti added 4 commits May 26, 2024 12:06

Merge branch 'master' into feature/ALG-000_memory-efficient-coco-dataset

8da4cc9

fix review issues

0ebf99f

Merge branch 'feature/ALG-000_memory-efficient-coco-dataset' of githu…

99f6f07

…b.com:Deci-AI/super-gradients into feature/ALG-000_memory-efficient-coco-dataset

fix __len__

57e78c9

NatanBagrov requested a review from shaydeci May 27, 2024 06:59

shaydeci requested changes May 27, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add tensor-based annotation storage to reduce DDP RAM usage with large COCO-format datasets #1885

Add tensor-based annotation storage to reduce DDP RAM usage with large COCO-format datasets #1885

spsancti commented Mar 5, 2024 •

edited

Loading

NatanBagrov left a comment

NatanBagrov Mar 6, 2024

shaydeci Mar 14, 2024

shaydeci Mar 14, 2024

shaydeci left a comment

shaydeci May 27, 2024

Add tensor-based annotation storage to reduce DDP RAM usage with large COCO-format datasets #1885

Are you sure you want to change the base?

Add tensor-based annotation storage to reduce DDP RAM usage with large COCO-format datasets #1885

Conversation

spsancti commented Mar 5, 2024 • edited Loading

NatanBagrov left a comment

Choose a reason for hiding this comment

NatanBagrov Mar 6, 2024

Choose a reason for hiding this comment

shaydeci Mar 14, 2024

Choose a reason for hiding this comment

shaydeci Mar 14, 2024

Choose a reason for hiding this comment

shaydeci left a comment

Choose a reason for hiding this comment

shaydeci May 27, 2024

Choose a reason for hiding this comment

spsancti commented Mar 5, 2024 •

edited

Loading