You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the Bug
I'm running a program with apex in my anaconda3 environment. But meet with the following error:
...
File ".../anaconda3/envs/valor/lib/python3.9/site-packages/apex/transformer/pipeline_parallel/schedules/common.py", line 14, in<module>
from apex.transformer.tensor_parallel.layers import (
File ".../anaconda3/envs/valor/lib/python3.9/site-packages/apex/transformer/tensor_parallel/__init__.py", line 21, in<module>
from apex.transformer.tensor_parallel.layers import (
File ".../anaconda3/envs/valor/lib/python3.9/site-packages/apex/transformer/tensor_parallel/layers.py", line 32, in<module>
from apex.transformer.tensor_parallel.mappings import (
File ".../anaconda3/envs/valor/lib/python3.9/site-packages/apex/transformer/tensor_parallel/mappings.py", line 29, in<module>
torch.distributed.reduce_scatter_tensor = torch.distributed._reduce_scatter_base
AttributeError: module 'torch.distributed' has no attribute '_reduce_scatter_base'
Minimal Steps/Code to Reproduce the Bug
I installed apex with the following steps:
Describe the Bug
I'm running a program with apex in my anaconda3 environment. But meet with the following error:
Minimal Steps/Code to Reproduce the Bug
I installed apex with the following steps:
git clone https://github.com/NVIDIA/apex.git cd apex pip install -v --disable-pip-version-check --no-build-isolation --no-cache-dir ./
I also tried with the following steps:
or
But the methods all don't work.
Environment
Here is my environment info:
The text was updated successfully, but these errors were encountered: