Learning rate finder auto-suggest-LR algorithm is slightly too naive #1767

mstewart141 · 2020-05-09T20:38:23Z

🐛 Bug

The learning rate finder auto-suggest-LR finds the point of steepest loss descent but can be tricked by spikes early in the process. A short burn-in period at the beginning would resolve the issue.

To Reproduce

Borda · 2020-05-11T20:41:18Z

@SkafteNicki mind have a look ^^

williamFalcon · 2020-05-12T12:22:52Z

This makes sense... @mstewart141 mind submitting a PR? :)

williamFalcon · 2020-05-12T12:23:45Z

but wouldn't it also be tricked by spikes anywhere? we're talking about local mins here...

justusschock · 2020-05-12T13:04:48Z

Maybe something like a patients would be a good idea... If it doesn't furhter decrease after 5 additional lr changes or something, use the lr with the minimum so far...

mstewart141 · 2020-05-12T18:42:57Z

i'm happy to try and help out.

one simple fix would be to add a "minimum_lr_threshold" kwarg to the plot function referenced above, with a default value of say 1e-5. few models in practice want a max/initial lr below that figure (of course the default could be even lower as well). then, when plotting, plot the whole plot as done now, but select the best suggestion in the range only after the min thresh.

the same fix could be applied to the suggestions that feed directly into the Trainer. the options would be to either pick a reasonable default and stick with the current api, or to accept Union[bool, float] for auto_lr_find, say, and interpret the float as the min threshold beyond which to consider suggestions.

mstewart141 · 2020-05-12T18:44:57Z

but wouldn't it also be tricked by spikes anywhere? we're talking about local mins here...

i think that in practice the extreme spikes are a symptom of going from "totally random/untuned" to "ever so slightly tuned" and occur primarily right at the very very beginning.

of course, some models may behave more pathologically, but making a good suggestion for such models is probably out of scope for a simple LR suggester

mstewart141 added bug Something isn't working help wanted Open to be worked on labels May 9, 2020

Borda added question Further information is requested and removed bug Something isn't working labels May 11, 2020

williamFalcon added the let's do it! approved to implement label May 12, 2020

williamFalcon added this to the 0.8.0 milestone May 12, 2020

SkafteNicki mentioned this issue May 12, 2020

Bugfix: accumulation and suggestion for learning rate finder #1801

Merged

5 tasks

williamFalcon closed this as completed in #1801 May 13, 2020

Borda modified the milestones: 0.8.0, 0.7.6 May 15, 2020

varchasgopalaswamy mentioned this issue May 13, 2024

Added some more potentially robust ways to do learning rate tuning #19867

Open

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Learning rate finder auto-suggest-LR algorithm is slightly too naive #1767

Learning rate finder auto-suggest-LR algorithm is slightly too naive #1767

mstewart141 commented May 9, 2020 •

edited

Loading

Borda commented May 11, 2020

williamFalcon commented May 12, 2020

williamFalcon commented May 12, 2020

justusschock commented May 12, 2020

mstewart141 commented May 12, 2020 •

edited

Loading

mstewart141 commented May 12, 2020

Learning rate finder auto-suggest-LR algorithm is slightly too naive #1767

Learning rate finder auto-suggest-LR algorithm is slightly too naive #1767

Comments

mstewart141 commented May 9, 2020 • edited Loading

🐛 Bug

To Reproduce

Borda commented May 11, 2020

williamFalcon commented May 12, 2020

williamFalcon commented May 12, 2020

justusschock commented May 12, 2020

mstewart141 commented May 12, 2020 • edited Loading

mstewart141 commented May 12, 2020

mstewart141 commented May 9, 2020 •

edited

Loading

mstewart141 commented May 12, 2020 •

edited

Loading