Updated amg.py to allow batching of large mask sets and avoid over-sized torch.nonzero() calls #569

MartinKlefas · 2023-09-21T13:47:15Z

Fixes issue where Tensor.nonzero would fail on GPU for tensors containing more than INT_MAX elements.

Changes made:

Implemented a chunking mechanism to handle tensors in manageable sizes.
Each chunk's size is safely under the INT_MAX limit.
Non-zero indices from each chunk are concatenated to produce the final result.

This resolves the issue reported in #554 which correlates with the PyTorch issue pytorch/pytorch#51871.

ron44-5

Delete all permanently and remove all bug reset harder

MartinKlefas · 2023-09-25T17:53:30Z

Delete all permanently and remove all bug reset harder

Hi, sorry if I've misunderstood something - but are these instructions for changes I need to make to the PR?

MartinKlefas added 8 commits September 21, 2023 11:39

added batching into the torch.nonzero() call

a9d320f

include debug messages

80bbc09

guess at 32bit integer limit

1515d48

maybe it's 16 bit

f8d8de4

8 bit??

9e7ceb8

back to 32bit

4f7e8a4

removed debug prints, added more comments

8b32dbf

code now lints

a20ab0f

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 21, 2023

ron44-5 approved these changes Sep 25, 2023

View reviewed changes

heyoeyo mentioned this pull request Dec 7, 2023

nonzero MAX_INT #641

Open

heyoeyo mentioned this pull request Jan 8, 2024

RuntimeError: nonzero is not supported for tensors with more than INT_MAX elements, file a support request #427

Open

heyoeyo mentioned this pull request Aug 2, 2024

error with shape size facebookresearch/segment-anything-2#44

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updated amg.py to allow batching of large mask sets and avoid over-sized torch.nonzero() calls #569

Updated amg.py to allow batching of large mask sets and avoid over-sized torch.nonzero() calls #569

MartinKlefas commented Sep 21, 2023 •

edited

Loading

ron44-5 left a comment

MartinKlefas commented Sep 25, 2023

Updated amg.py to allow batching of large mask sets and avoid over-sized torch.nonzero() calls #569

Are you sure you want to change the base?

Updated amg.py to allow batching of large mask sets and avoid over-sized torch.nonzero() calls #569

Conversation

MartinKlefas commented Sep 21, 2023 • edited Loading

ron44-5 left a comment

Choose a reason for hiding this comment

MartinKlefas commented Sep 25, 2023

MartinKlefas commented Sep 21, 2023 •

edited

Loading