Sequence parallel strategy support. #819

GhostScreaming · 2022-10-09T06:21:14Z

Add sequence support strategy support for GPT pipeline parallel model.

Loss curve fits its peer (mp4_pp2 and mp8).
Both forward and backward output have been aligned with its peer (mp2_pp2) in the first step.
Function _is_valid_send_recv_partial() in paddle/distributed/fleet/meta_parallel/pp_utils/p2p_communication.py should been modified. Corresponding PR of paddle repo will be submitted.

1. Add sequence parallel strategy for GPTModelHybrid 2. Output has been checked layer by layer both in forward and backward progress, and its loss curve of the beginning 5000 steps fits the peer 3. Performance is improved for about 10% with sequence_parallel strategy compared with pretrain_gpt_1.3B_mp8

… sequence_parallel

1. Add sequence_parallel option for GPTModel 2. When mp=1, sequence_parallel option should always be set False

… sequence_parallel

ForFishes

LGTM

GhostScreaming added 12 commits September 16, 2022 07:14

Add sequence_parallel_utils.py file

f73dabe

Merge branch 'develop' of https://github.com/PaddlePaddle/FleetX into…

d1cc3b7

… sequence_parallel

Fix some bug of sequence_parallel.

c35dc57

1. Add sequence_parallel option for GPTModel 2. When mp=1, sequence_parallel option should always be set False

Merge branch 'develop' of https://github.com/PaddlePaddle/FleetX into…

30230a0

… sequence_parallel

Support sequence parallel strategy for GPT parallel models.

491571c

Add configuration check.

f32aa4d

Polish code.

c6281bb

Change name allow_partial to enable_partial_send_recv.

08b89fc

Auto set enable_partial_send_recv switch.

459fd8c

Polish code style.

74f8f92

Add enable_partial_send_recv default value.

0ed041a

sneaxiy approved these changes Oct 17, 2022

View reviewed changes

ForFishes approved these changes Oct 18, 2022

View reviewed changes

ForFishes merged commit fa4cd96 into PaddlePaddle:develop Oct 18, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sequence parallel strategy support. #819

Sequence parallel strategy support. #819

GhostScreaming commented Oct 9, 2022 •

edited

Loading

ForFishes left a comment

Sequence parallel strategy support. #819

Sequence parallel strategy support. #819

Conversation

GhostScreaming commented Oct 9, 2022 • edited Loading

ForFishes left a comment

Choose a reason for hiding this comment

GhostScreaming commented Oct 9, 2022 •

edited

Loading