[XPU] fix gm allocaion on XPUContext::Impl::Init #60260

dynamicheart · 2023-12-22T07:21:39Z

PR types

Bug fixes

PR changes

APIs

Description

This PR #54674 forces the option XPUAPI_DEFAULT_SIZE of xdnn::Context to 1 by default, regardless of whether we set the environment variable XPUAPI_DEFAULT_SIZE to a different value. It triggers a lot of xpu_wait calls.

This comment describes why XPUAPI_DEFAULT_SIZE is originally set to 1: #54674 (comment)

paddle-bot · 2023-12-22T07:21:45Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

houj04

LGTM && 建议戳原作者/原审批人也看一下。

houj04 · 2023-12-22T07:31:10Z

有个问题：这里引用到的PR是半年之前的，为啥最近发现了这个问题呢？

dynamicheart · 2023-12-22T07:33:34Z

LGTM && 建议戳原作者/原审批人也看一下。

@AlbertVan @zhupengyang 辛苦两位同学看看

runzhech

lgtm

dynamicheart · 2023-12-22T07:35:40Z

有个问题：这里引用到的PR是半年之前的，为啥最近发现了这个问题呢？

PyTorch的XpuContext实现参考了Paddle这边的实现，PyTorch那边先发现了这个问题，大概是2023年10月份发现的。

shentanyue

本质问题是多个xpu_context都会各自去申请一份XPUAPI_DEFAULT_SIZE。
训练侧后面可以再关注下。

[XPU] fix gm allocaion on XPUContext::Impl::Init

4ab5160

houj04 approved these changes Dec 22, 2023

View reviewed changes

runzhech approved these changes Dec 22, 2023

View reviewed changes

shentanyue approved these changes Dec 22, 2023

View reviewed changes

houj04 merged commit 39ddd5f into PaddlePaddle:develop Dec 22, 2023
29 checks passed

dynamicheart mentioned this pull request Dec 27, 2023

[XPU] avoid pre-allocating gm buffer #60387

Merged

dynamicheart mentioned this pull request Mar 15, 2024

xpu支持修改默认的l3/gm的buffer大小 #62729

Merged

houj04 added the XPU label Sep 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[XPU] fix gm allocaion on XPUContext::Impl::Init #60260

[XPU] fix gm allocaion on XPUContext::Impl::Init #60260

dynamicheart commented Dec 22, 2023 •

edited

Loading

paddle-bot bot commented Dec 22, 2023

houj04 left a comment

houj04 commented Dec 22, 2023

dynamicheart commented Dec 22, 2023

runzhech left a comment

dynamicheart commented Dec 22, 2023

shentanyue left a comment

[XPU] fix gm allocaion on XPUContext::Impl::Init #60260

[XPU] fix gm allocaion on XPUContext::Impl::Init #60260

Conversation

dynamicheart commented Dec 22, 2023 • edited Loading

PR types

PR changes

Description

paddle-bot bot commented Dec 22, 2023

houj04 left a comment

Choose a reason for hiding this comment

houj04 commented Dec 22, 2023

dynamicheart commented Dec 22, 2023

runzhech left a comment

Choose a reason for hiding this comment

dynamicheart commented Dec 22, 2023

shentanyue left a comment

Choose a reason for hiding this comment

dynamicheart commented Dec 22, 2023 •

edited

Loading