gather/broadcast the max value of the packing efficiency automatically #463

winglian · 2023-08-23T07:41:06Z

No description provided.

src/axolotl/utils/distributed.py

casper-hansen · 2023-09-17T13:56:17Z

This PR makes multi-GPU work on custom datasets. Previously, training got stuck at epoch 1.0 when micro_batch_size was 1. However, with this PR, that problem is resolved.

axolotl-ai-cloud#463)

winglian force-pushed the calc-packing-eff-across-all-ranks branch from aef1b32 to d45b5d3 Compare August 23, 2023 08:07

winglian requested a review from NanoCode012 August 23, 2023 15:56

NanoCode012 reviewed Sep 17, 2023

View reviewed changes

src/axolotl/utils/distributed.py Outdated Show resolved Hide resolved

gather/broadcast the max value of the packing efficiency automatically

97ae665

winglian force-pushed the calc-packing-eff-across-all-ranks branch from 235ec15 to 97ae665 Compare September 17, 2023 11:33

send raw values for packing efficiency

435bdb2

casper-hansen mentioned this pull request Sep 17, 2023

axolotl hanging during training on custom dataset (ran for 30 minutes before timing out) #592

Closed

8 tasks

winglian merged commit b15b19e into main Sep 17, 2023
4 checks passed

winglian deleted the calc-packing-eff-across-all-ranks branch September 17, 2023 15:08

mkeoliya pushed a commit to mkeoliya/axolotl that referenced this pull request Dec 15, 2023

gather/broadcast the max value of the packing efficiency automatically (

899ef14

axolotl-ai-cloud#463)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gather/broadcast the max value of the packing efficiency automatically #463

gather/broadcast the max value of the packing efficiency automatically #463

winglian commented Aug 23, 2023

casper-hansen commented Sep 17, 2023 •

edited

Loading

gather/broadcast the max value of the packing efficiency automatically #463

gather/broadcast the max value of the packing efficiency automatically #463

Conversation

winglian commented Aug 23, 2023

casper-hansen commented Sep 17, 2023 • edited Loading

casper-hansen commented Sep 17, 2023 •

edited

Loading