Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CustomDevice] use CommContextManager to create xccl comm #57957

Merged
merged 1 commit into from
Oct 10, 2023

Conversation

ronny1996
Copy link
Contributor

@ronny1996 ronny1996 commented Oct 9, 2023

PR types

Others

PR changes

Others

Description

use CommContextManager to create & get xccl comm

@paddle-bot
Copy link

paddle-bot bot commented Oct 9, 2023

你的PR提交成功,感谢你对开源项目的贡献!
请关注后续CI自动化测试结果,详情请参考Paddle-CI手册
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

@ronny1996 ronny1996 force-pushed the comm_init_op branch 4 times, most recently from a2dbd4e to fada7ff Compare October 10, 2023 03:02
Copy link
Contributor

@qili93 qili93 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

auto store = phi::distributed::CreateOrGetGlobalTCPStore();
if (!phi::distributed::CommContextManager::GetInstance().Has(
std::to_string(rid))) {
phi::distributed::CommContextManager::CreateXCCLCommContext(
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这里看下是否需要兼容新旧通信库吧,nccl里是通过FLAGS_dynamic_static_unified_comm这个Flags判断可以做新旧通信库的切换。

124-127的判断develop分支已经更新了,这个pr好像没有提示冲突,可以手动更新下develop,防止覆盖回去了。
(当前flags的判断有问题)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

custom deivce不需要兼容旧通信库了

@ronny1996 ronny1996 merged commit 56d4476 into PaddlePaddle:develop Oct 10, 2023
27 checks passed
@ronny1996 ronny1996 deleted the comm_init_op branch October 10, 2023 09:54
jiahy0825 pushed a commit to jiahy0825/Paddle that referenced this pull request Oct 16, 2023
danleifeng pushed a commit to danleifeng/Paddle that referenced this pull request Nov 14, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants