Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RuntimeError: NCCL error in: /opt/conda/conda-bld/pytorch_1603729138878/work/torch/lib/c10d/ProcessGroupNCCL.cpp:31, unhandled cuda error, NCCL version 2.7.8 #18

Open
liutianyi00 opened this issue Jun 21, 2024 · 1 comment

Comments

@liutianyi00
Copy link

请问如何解决呢,我的cuda是12.4,其他包的版本都是按照您github上的版本安装的

@yuhongtian17
Copy link
Owner

12.4的nvcc cuda对mmrotate-0.3.3/0.3.4来说太高了。请先降低cuda版本,把rotated faster rcnn r50跑通。我们STD这个仓库代码README所述的环境配置是在2080Ti上进行的,可能与更先进的GPU存在兼容性问题。是一个在3090/4090/A100/H100 GPU服务器上均已验证可行的环境配置中文说明,供您参考。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants