fixing image_encoder to work with cuda_graphs #393

HDCharles · 2023-05-24T06:00:29Z

Stack from ghstack (oldest at bottom):

-> fixing image_encoder to work with cuda_graphs #393

Summary: the combination of tensors on multiple devices in get_rel_pos
was preventing cuda graphs from correctly optimizing things

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

Summary: the combination of tensors on multiple devices in get_rel_pos was preventing cuda graphs from correctly optimizing things Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: the combination of tensors on multiple devices in get_rel_pos was preventing cuda graphs from correctly optimizing things Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 0fea0e19e5bf0ee44a19669fe33e7e16002a55af Pull Request resolved: #393

vkuzo · 2023-05-24T15:13:50Z

segment_anything/modeling/image_encoder.py

@@ -315,8 +315,8 @@ def get_rel_pos(q_size: int, k_size: int, rel_pos: torch.Tensor) -> torch.Tensor
        rel_pos_resized = rel_pos

    # Scale the coords with short length if shapes for q and k are different.
-    q_coords = torch.arange(q_size)[:, None] * max(k_size / q_size, 1.0)
-    k_coords = torch.arange(k_size)[None, :] * max(q_size / k_size, 1.0)
+    q_coords = (torch.arange(q_size).to(rel_pos.device)[:, None] * max(k_size / q_size, 1.0))


nit: torch.arange(q_size, device=rel_pos.device)

Summary: the combination of tensors on multiple devices in get_rel_pos was preventing cuda graphs from correctly optimizing things Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: the combination of tensors on multiple devices in get_rel_pos was preventing cuda graphs from correctly optimizing things Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 2256f130bb8249403710e1048ef69385ff71aed2 Pull Request resolved: #393

fixing image_encoder to work with cuda_graphs

9aacb82

Summary: the combination of tensors on multiple devices in get_rel_pos was preventing cuda graphs from correctly optimizing things Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 24, 2023

HDCharles marked this pull request as draft May 24, 2023 06:01

vkuzo reviewed May 24, 2023

View reviewed changes

Update on "fixing image_encoder to work with cuda_graphs"

51bc7a2

Summary: the combination of tensors on multiple devices in get_rel_pos was preventing cuda graphs from correctly optimizing things Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

HDCharles requested a review from vkuzo May 30, 2023 20:37

HDCharles requested review from cpuhrsch and ericmintun May 30, 2023 20:38

HDCharles marked this pull request as ready for review May 30, 2023 20:39

HDCharles requested a review from HannaMao May 30, 2023 20:44

2lambda123 mentioned this pull request Dec 2, 2023

fixing image_encoder to work with cuda_graphs 2lambda123/segment-anything#1

Merged

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fixing image_encoder to work with cuda_graphs #393

fixing image_encoder to work with cuda_graphs #393

HDCharles commented May 24, 2023 •

edited

Loading

vkuzo May 24, 2023

HDCharles May 30, 2023

fixing image_encoder to work with cuda_graphs #393

Are you sure you want to change the base?

fixing image_encoder to work with cuda_graphs #393

Conversation

HDCharles commented May 24, 2023 • edited Loading

vkuzo May 24, 2023

Choose a reason for hiding this comment

HDCharles May 30, 2023

Choose a reason for hiding this comment

HDCharles commented May 24, 2023 •

edited

Loading