Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PR #13603: NVTX: name threads, CUDA devices and CUDA streams #2337

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

copybara-service[bot]
Copy link

PR #13603: NVTX: name threads, CUDA devices and CUDA streams

Imported from GitHub PR openxla/xla#13603

This aims to improve the profiling experience. These names are shown in the Nsight Systems UI.

Device names:
Screenshot 2024-06-10 at 14 52 37

Stream names:
Screenshot 2024-06-10 at 14 53 25

Thread names:
Screenshot 2024-06-10 at 14 54 04

This also provides a missing link between replica IDs in the HLO and the physical devices in the profile.
Copybara import of the project:

--
12a02b67bd9db8b3f69ba1e0d00c7881f767f037 by Olli Lupton olupton@nvidia.com:

NVTX: name threads, CUDA devices and CUDA streams

--
bdf8dbf7700cbe7ce72070c25ce3d21e2dfeb54f by Olli Lupton olupton@nvidia.com:

Add missing header

--
98a80a40add79f108cb89987724c35f82cd727e4 by Olli Lupton olupton@nvidia.com:

add stubs

Merging this change closes #13603

FUTURE_COPYBARA_INTEGRATE_REVIEW=openxla/xla#13603 from olupton:name-devices-streams-and-threads 98a80a40add79f108cb89987724c35f82cd727e4

Imported from GitHub PR openxla/xla#13603

This aims to improve the profiling experience. These names are shown in the Nsight Systems UI.

Device names:
![Screenshot 2024-06-10 at 14 52 37](https://github.com/openxla/xla/assets/6459623/d889d37e-ca2e-4f5e-b5bd-240bbb625b4c)

Stream names:
![Screenshot 2024-06-10 at 14 53 25](https://github.com/openxla/xla/assets/6459623/4bfc4ffa-8fdf-4b93-b23e-95bf056799f3)

Thread names:
![Screenshot 2024-06-10 at 14 54 04](https://github.com/openxla/xla/assets/6459623/8852ca9e-f2f4-4a45-8334-a18f8ab5ce18)

This also provides a missing link between replica IDs in the HLO and the physical devices in the profile.
Copybara import of the project:

--
12a02b67bd9db8b3f69ba1e0d00c7881f767f037 by Olli Lupton <olupton@nvidia.com>:

NVTX: name threads, CUDA devices and CUDA streams

--
bdf8dbf7700cbe7ce72070c25ce3d21e2dfeb54f by Olli Lupton <olupton@nvidia.com>:

Add missing header

--
98a80a40add79f108cb89987724c35f82cd727e4 by Olli Lupton <olupton@nvidia.com>:

add stubs

Merging this change closes #13603

FUTURE_COPYBARA_INTEGRATE_REVIEW=openxla/xla#13603 from olupton:name-devices-streams-and-threads 98a80a40add79f108cb89987724c35f82cd727e4
PiperOrigin-RevId: 643290582
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
1 participant