Bug in MultiStepLR lr scheduler #31828

Steve-Tod · 2020-01-03T15:24:07Z

🐛 Bug

Adding epoch argument to step() function of MultiStepLR lead to false learning rate.

To Reproduce

from torch import nn
import torch
net = nn.Linear(30, 10)
optimizer = torch.optim.Adam(net.parameters(), lr=0.001)
s = torch.optim.lr_scheduler.MultiStepLR(optimizer, [10, 20, 30], gamma=0.1)
print(s.get_lr())
s.step(1)
print(s.get_lr())

Output

[0.001]
[1.0000000000000002e-06]

Expected behavior

[0.001]
[0.001]

Environment

PyTorch version: 1.4.0a0+d5bf51b
Is debug build: No
CUDA used to build PyTorch: 9.0

OS: Ubuntu 16.04.6 LTS
GCC version: (Ubuntu 5.4.0-6ubuntu1~16.04.11) 5.4.0 20160609
CMake version: version 3.14.0

Python version: 3.6
Is CUDA available: Yes
CUDA runtime version: 9.0.176
GPU models and configuration:
GPU 0: TITAN Xp
GPU 1: TITAN Xp
GPU 2: TITAN Xp
GPU 3: TITAN Xp

Nvidia driver version: 430.26
cuDNN version: Could not collect

Versions of relevant libraries:

[pip] numpy==1.17.3
[pip] torch==1.4.0a0+d5bf51b
[conda] blas 1.0 mkl
[conda] magma-cuda90 2.5.0 1 pytorch
[conda] mkl 2019.4 243
[conda] mkl-include 2019.4 243
[conda] mkl-service 2.3.0 py36he904b0f_0
[conda] mkl_fft 1.0.15 py36ha843d7b_0
[conda] mkl_random 1.1.0 py36hd6b4f25_0
[conda] torch 1.4.0a0+d5bf51b pypi_0 pypi

Additional context

Possible cause might be that the milestones of MultiStepLR is a counter rather then a list, which leads to false action of bisect in get_lr function.

cc @vincentqb

The text was updated successfully, but these errors were encountered:

ssnl · 2020-01-03T21:41:39Z

you should use get_last_lr.

Steve-Tod · 2020-01-04T02:57:17Z

you should use get_last_lr.

The result is the same.

Steve-Tod · 2020-01-05T13:08:26Z

Did anyone try this?

vincentqb · 2020-01-09T14:57:52Z

If you are compiling from master, please make sure to have the latest. Schedulers no longer take the epoch parameter in step. A warning against doing so will be raised with #31125. The code should be the following.

from torch import nn
import torch
net = nn.Linear(30, 10)
optimizer = torch.optim.Adam(net.parameters(), lr=0.001)
s = torch.optim.lr_scheduler.MultiStepLR(optimizer, [10, 20, 30], gamma=0.1)

print(s. get_last_lr())
s.step()
print(s. get_last_lr())

Steve-Tod · 2020-01-10T06:16:09Z

Thank you for your response!
But I'm curious why remove this epoch parameter. It is convenient if I want to resume from a checkpoint and continue using the same milestones.

vincentqb · 2020-01-10T19:34:44Z

Thank you for your response!
But I'm curious why remove this epoch parameter.

Not all schedulers support that parameter in the first place. Moreover, we made schedulers chainable, and the epoch parameter doesn't extend nicely. See also #26423

It is convenient if I want to resume from a checkpoint and continue using the same milestones.

For this, you can run the the scheduler over a loop, or you can save the state and load.

AIRCAP · 2020-02-25T22:10:53Z

this bug is a duplicate of #33229

jerryzh168 added module: optimizer Related to torch.optim triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module labels Jan 8, 2020

vincentqb self-assigned this Jan 9, 2020

felixkreuk mentioned this issue Jan 18, 2020

LR Schedulers shouldn't get epoch argument in step function Lightning-AI/pytorch-lightning#708

Closed

Steve-Tod closed this as completed Jan 28, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug in MultiStepLR lr scheduler #31828

Bug in MultiStepLR lr scheduler #31828

Steve-Tod commented Jan 3, 2020 •

edited by pytorch-probot bot

Loading

ssnl commented Jan 3, 2020

Steve-Tod commented Jan 4, 2020

Steve-Tod commented Jan 5, 2020 •

edited

Loading

vincentqb commented Jan 9, 2020

Steve-Tod commented Jan 10, 2020

vincentqb commented Jan 10, 2020 •

edited

Loading

AIRCAP commented Feb 25, 2020

Bug in MultiStepLR lr scheduler #31828

Bug in MultiStepLR lr scheduler #31828

Comments

Steve-Tod commented Jan 3, 2020 • edited by pytorch-probot bot Loading

🐛 Bug

To Reproduce

Expected behavior

Environment

Additional context

ssnl commented Jan 3, 2020

Steve-Tod commented Jan 4, 2020

Steve-Tod commented Jan 5, 2020 • edited Loading

vincentqb commented Jan 9, 2020

Steve-Tod commented Jan 10, 2020

vincentqb commented Jan 10, 2020 • edited Loading

AIRCAP commented Feb 25, 2020

Steve-Tod commented Jan 3, 2020 •

edited by pytorch-probot bot

Loading

Steve-Tod commented Jan 5, 2020 •

edited

Loading

vincentqb commented Jan 10, 2020 •

edited

Loading