[Feature Request] Support Stochastic Weight Averaging (SWA) for improved stability #321

pchalasani · 2022-11-27T01:49:22Z

🚀 Feature

Stochastic Weight Averaging (SWA) is a recently proposed technique can potentially help improve training stability in DRL. There is now a new implementation in torchcontrib. Quoting/paraphrasing from their page:

a simple procedure that improves generalization in deep learning over Stochastic Gradient Descent (SGD) at no additional cost, and can be used as a drop-in replacement for any other optimizer in PyTorch. SWA has a wide range of applications and features, [...] including [...] improve the stability of training as well as the final average rewards of policy-gradient methods in deep reinforcement learning.

See the PyTorch SWA page for more.

Motivation

SWA might help improve training stability as well as final reward in some DRL scenarios. It may also alleviate sensitivity to random-seed initialization.

Pitch

See above :)

Alternatives

No response

Additional context

See the PyTorch SWA page for more.

Checklist

I have checked that there is no similar issue in the repo

The text was updated successfully, but these errors were encountered:

araffin · 2022-11-29T09:56:21Z

Hello,

can potentially help improve training stability in DRL

do you have experimental results to back this claim?

In the paper linked in the blog post, results are on A2C/DDPG only (which have usually weaker results compared to PPO/TD3/SAC) and they used only 3 random seeds, which is not enough to account for noise in the results.

Torch contrib is also now archived and didn't receive any update for almost 3 years (https://github.com/pytorch/contrib).

EDIT: SWA seems to be directly in pytorch now https://pytorch.org/docs/stable/optim.html#stochastic-weight-averaging

pchalasani · 2022-11-29T13:27:00Z

Thanks, I did not know SWA is in main PyTorch. I will look into it.
As for empirical evidence, I'll continue experimenting and report back.

pchalasani added the enhancement New feature or request label Nov 27, 2022

pchalasani linked a pull request Nov 27, 2022 that will close this issue

Support for Stoch Wt Avg (SWA) closes #321 #320

Draft

13 tasks

pchalasani changed the title ~~[Feature Request] request title~~ [Feature Request] Support Stochastic Weight Averaging (SWA) for improved stability Nov 27, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Support Stochastic Weight Averaging (SWA) for improved stability #321

[Feature Request] Support Stochastic Weight Averaging (SWA) for improved stability #321

pchalasani commented Nov 27, 2022

araffin commented Nov 29, 2022 •

edited

Loading

pchalasani commented Nov 29, 2022

[Feature Request] Support Stochastic Weight Averaging (SWA) for improved stability #321

[Feature Request] Support Stochastic Weight Averaging (SWA) for improved stability #321

Comments

pchalasani commented Nov 27, 2022

🚀 Feature

Motivation

Pitch

Alternatives

Additional context

Checklist

araffin commented Nov 29, 2022 • edited Loading

pchalasani commented Nov 29, 2022

araffin commented Nov 29, 2022 •

edited

Loading