Rewrite state_dict in a more pytorch idiomatic way #60

blefaudeux · 2020-09-02T17:29:54Z

🚀 Feature

Change the param_groups handling in the state dict, in order to follow more closely the default PyTorch assumptions
https://pytorch.org/docs/stable/optim.html#torch.optim.Optimizer.state_dict

Some users may assume that the default pytorch optimizer interface with respect to the state dict is respected by fairscale/oss
People familiar with pytorch optimizers would have an easier learning curve when peeking into OSS

Rewrite the exposed state dict in order to return "state" and "param_groups" in accordance to pytorch expectations, without duplications

rely on the python/pytorch memory model to remove duplicates in memory and while serializing
add wrappers on the user side

blefaudeux · 2020-09-02T17:30:09Z

blefaudeux self-assigned this Sep 2, 2020

blefaudeux mentioned this issue Sep 2, 2020

[fix] OSS pytorch-compliant state dict #61

Merged

4 tasks

blefaudeux closed this as completed in #61 Sep 3, 2020

myleott pushed a commit that referenced this issue Feb 22, 2021

Fix state_dict bugs (#60)

f481877