Skip to content

Commit

Permalink
Fix Deepspeed Zero3 Config (#791)
Browse files Browse the repository at this point in the history
* Update zero3.json

Take away CPU Offload by default (Slows things down horribly, better off reducing batchsize), and changes LR Scheduler to a properly decaying one

* Update zero3.json

fix something
  • Loading branch information
teknium1 committed Oct 28, 2023
1 parent 2e71ff0 commit d3193be
Showing 1 changed file with 3 additions and 10 deletions.
13 changes: 3 additions & 10 deletions deepspeed/zero3.json
Original file line number Diff line number Diff line change
@@ -1,14 +1,6 @@
{
"zero_optimization": {
"stage": 3,
"offload_optimizer": {
"device": "cpu",
"pin_memory": true
},
"offload_param": {
"device": "cpu",
"pin_memory": true
},
"overlap_comm": true,
"contiguous_gradients": true,
"sub_group_size": 0,
Expand Down Expand Up @@ -41,12 +33,13 @@
}
},
"scheduler": {
"type": "WarmupLR",
"type": "WarmupDecayLR",
"params": {
"warmup_min_lr": "auto",
"warmup_max_lr": "auto",
"warmup_num_steps": "auto",
"warmup_type": "linear"
"warmup_type": "linear",
"total_num_steps": "auto"
}
},
"gradient_accumulation_steps": "auto",
Expand Down

0 comments on commit d3193be

Please sign in to comment.