Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

gather/broadcast the max value of the packing efficiency automatically #463

Merged
merged 2 commits into from
Sep 17, 2023

Conversation

winglian
Copy link
Collaborator

No description provided.

@winglian winglian force-pushed the calc-packing-eff-across-all-ranks branch from aef1b32 to d45b5d3 Compare August 23, 2023 08:07
@winglian winglian force-pushed the calc-packing-eff-across-all-ranks branch from 235ec15 to 97ae665 Compare September 17, 2023 11:33
@casper-hansen
Copy link
Collaborator

casper-hansen commented Sep 17, 2023

This PR makes multi-GPU work on custom datasets. Previously, training got stuck at epoch 1.0 when micro_batch_size was 1. However, with this PR, that problem is resolved.

image

@winglian winglian merged commit b15b19e into main Sep 17, 2023
4 checks passed
@winglian winglian deleted the calc-packing-eff-across-all-ranks branch September 17, 2023 15:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants