scaled_dot_product_attention api #55242

liuzhenhai93 · 2023-07-07T08:25:33Z

PR types

Others

PR changes

APIs

Description

scaled_dot_product_attention api
card-72806

paddle-bot · 2023-07-07T08:25:38Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

paddle-bot · 2023-07-07T08:25:40Z

✅ This PR's description meets the template requirements!
Please wait for other CI results.

paddle-ci-bot · 2023-07-17T03:19:04Z

Sorry to inform you that d419bc7's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually.

… develop_scaleed_dot_product_attention_api

python/paddle/nn/functional/flash_attention.py

jzhang533 · 2023-07-20T07:45:14Z

python/paddle/nn/functional/flash_attention.py

+
+    where : ``Q``, ``K``, and ``V`` represent the three input parameters of the attention module.
+    The dimensions of the three parameters are the same.
+    ``d`` represents the size of the last dimension of the three parameters.


Here, Q, K, and V denote the three input parameters of the attention module, all sharing identical dimensions. d represents the size of the last dimension of these three parameters.

在数学公式里面，一般用 where

OK，我这个是用的ChatGPT做的改动，仅供参考。

jzhang533 · 2023-07-20T07:47:16Z

python/paddle/nn/functional/flash_attention.py

+                    The dtype can be float16 or bfloat16.
+
+    Examples:
+        .. code-block:: python


框架正在引入xdoctest，示例代码可以顺便改成xdoctest支持的格式，see #55295

xdoctest支持的格式是什么样的呢？
是否有个 demo 或明确的规范

请参看我给的PR里的改动。

… develop_scaleed_dot_product_attention_api

jzhang533

LGTM

… develop_scaleed_dot_product_attention_api

…ithub.com/liuzhenhai93/Paddle into develop_scaleed_dot_product_attention_api

jeff41404 · 2023-08-01T09:40:31Z

python/paddle/nn/functional/flash_attention.py

@@ -407,4 +407,57 @@ def flash_attn_unpadded(
    return out, softmax if return_softmax else None


-scaled_dot_product_attention = flash_attention
+def scaled_dot_product_attention(
+    query, key, value, attn_mask=None, dropout_p=0.0, is_causal=False


In order to be consistent with other APIs, there must be a parameter name=None at last

jeff41404 · 2023-08-01T09:41:35Z

python/paddle/nn/functional/flash_attention.py

+            >>> print(output)
+            >>> # xdoctest: -SKIP
+    """
+    assert attn_mask is None, "attn_mask is not supported yet"


If attn_mask is not currently supported, add a TODO statement to indicate that it will be supported later

已经有工作正在支持attn_mask，因此依赖当前PR合入。

scaled_dot_product_attention api

3f0c520

add test

3362289

liuzhenhai93 requested review from sneaxiy and kuizhiqing July 7, 2023 09:00

polish

d419bc7

sneaxiy previously approved these changes Jul 10, 2023

View reviewed changes

liuzhenhai93 added 3 commits July 18, 2023 14:14

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

0b8f22a

… develop_scaleed_dot_product_attention_api

test

d161b82

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

3d838ac

… develop_scaleed_dot_product_attention_api

liuzhenhai93 dismissed sneaxiy’s stale review via 3d838ac July 18, 2023 12:45

liuzhenhai93 requested a review from jzhang533 July 20, 2023 07:34

jzhang533 reviewed Jul 20, 2023

View reviewed changes

liuzhenhai93 added 4 commits July 20, 2023 11:45

polish doc

bb9aecd

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

0be5b6c

… develop_scaleed_dot_product_attention_api

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

08af867

… develop_scaleed_dot_product_attention_api

polish

bea4ee3

jzhang533 previously approved these changes Jul 24, 2023

View reviewed changes

liuzhenhai93 added 2 commits July 25, 2023 10:38

polish

1d2a94e

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

871ff54

… develop_scaleed_dot_product_attention_api

liuzhenhai93 dismissed jzhang533’s stale review via 871ff54 July 25, 2023 10:41

polish

e68519e

jzhang533 previously approved these changes Jul 25, 2023

View reviewed changes

liuzhenhai93 added 4 commits July 31, 2023 10:17

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

5c2def7

… develop_scaleed_dot_product_attention_api

skip xtest

29a118d

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

ddcff7a

… develop_scaleed_dot_product_attention_api

Merge branch 'develop_scaleed_dot_product_attention_api' of https://g…

a9abd3c

…ithub.com/liuzhenhai93/Paddle into develop_scaleed_dot_product_attention_api

liuzhenhai93 dismissed jzhang533’s stale review via a9abd3c August 1, 2023 02:35

tianshuo78520a approved these changes Aug 1, 2023

View reviewed changes

jzhang533 approved these changes Aug 1, 2023

View reviewed changes

liuzhenhai93 requested a review from jeff41404 August 1, 2023 09:25

jeff41404 reviewed Aug 1, 2023

View reviewed changes

jeff41404 approved these changes Aug 2, 2023

View reviewed changes

Xreki merged commit b19dfb8 into PaddlePaddle:develop Aug 2, 2023
27 checks passed

iosmers mentioned this pull request Aug 15, 2023

Add flash attention backward grad check #56249

Merged

liuzhenhai93 deleted the develop_scaleed_dot_product_attention_api branch October 7, 2023 03:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scaled_dot_product_attention api #55242

scaled_dot_product_attention api #55242

liuzhenhai93 commented Jul 7, 2023 •

edited

Loading

paddle-bot bot commented Jul 7, 2023

paddle-bot bot commented Jul 7, 2023 •

edited

Loading

paddle-ci-bot bot commented Jul 17, 2023

jzhang533 Jul 20, 2023

liuzhenhai93 Jul 20, 2023

jzhang533 Jul 20, 2023

liuzhenhai93 Jul 20, 2023

jzhang533 Jul 20, 2023

liuzhenhai93 Jul 20, 2023 •

edited

Loading

jzhang533 Jul 20, 2023

jzhang533 left a comment

jeff41404 Aug 1, 2023

jeff41404 Aug 1, 2023

Xreki Aug 1, 2023

jeff41404 Aug 2, 2023

scaled_dot_product_attention api #55242

scaled_dot_product_attention api #55242

Conversation

liuzhenhai93 commented Jul 7, 2023 • edited Loading

PR types

PR changes

Description

paddle-bot bot commented Jul 7, 2023

paddle-bot bot commented Jul 7, 2023 • edited Loading

paddle-ci-bot bot commented Jul 17, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

liuzhenhai93 Jul 20, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jzhang533 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

liuzhenhai93 commented Jul 7, 2023 •

edited

Loading

paddle-bot bot commented Jul 7, 2023 •

edited

Loading

liuzhenhai93 Jul 20, 2023 •

edited

Loading