Allow setting per-layer learning rates" #1612

shaydeci · 2023-11-06T10:16:21Z

Added support for passing initial_lr as a dictionary.
Removed the usages of update_param_groups and initialize_param_groups. Affected recipes had equivalent initial_lr mapping added to them (tested).
For the edge cases of having an instantiated optimiser, I assign names to the parameter_groups and extrac an initial_lr mapping from them so our schedulers can be used.

…lizer

…zer_initializer' into feature/SG-1209_introduce_optimizer_initializer

…lizer

…zer_initializer' into feature/SG-1209_introduce_optimizer_initializer

…lizer

src/super_gradients/recipes/cityscapes_ddrnet.yaml

src/super_gradients/recipes/training_hyperparams/coco2017_yolo_nas_train_params.yaml

src/super_gradients/training/params.py

src/super_gradients/training/utils/optimizer_utils.py

src/super_gradients/training/utils/callbacks/callbacks.py

tests/unit_tests/test_lr_assignment.py

src/super_gradients/training/sg_trainer/sg_trainer.py

src/super_gradients/training/utils/optimizer_utils.py

BloodAxe

I went pretty brutal on this PR, sorry :)
I mean, it's good. It written well, it introduce new features while keeping support of existing logic, and it's clean. Just a few tricky things here and there.

BloodAxe · 2023-11-08T10:08:15Z

And cherry on top - let's rename the PR title to be "Github release notes"-friendly. Something like: "Allow setting per-layer learning rates"

…zer_initializer' into feature/SG-1209_introduce_optimizer_initializer

BloodAxe · 2023-11-09T21:53:51Z

One last remark from me - please add integration test for any model of your choice where we train a model for one short epoch with backbone lr=0, nech=0.1xLR, head=LR to ensure all works end-to-end
And then - LGTM :)

…lizer

…zer_initializer' into feature/SG-1209_introduce_optimizer_initializer

shaydeci · 2023-11-12T13:00:31Z

One last remark from me - please add integration test for any model of your choice where we train a model for one short epoch with backbone lr=0, nech=0.1xLR, head=LR to ensure all works end-to-end
And then - LGTM :)

Adde unit test training with frozen part of the net.

…lizer

shaydeci added 5 commits November 5, 2023 14:38

updated schedulers, warmup and logging

678276b

updated recipes

012fb72

updated tests

c56f85a

updated some typing and docs

b16a76a

updated some typing and docs

df5d481

shaydeci requested review from ofrimasad, BloodAxe and Louis-Dupont as code owners November 6, 2023 10:16

shaydeci and others added 9 commits November 6, 2023 12:16

Merge branch 'master' into feature/SG-1209_introduce_optimizer_initia…

8ba7c44

…lizer

removed update_param_groups test

595706a

Merge remote-tracking branch 'origin/feature/SG-1209_introduce_optimi…

aaf56c1

…zer_initializer' into feature/SG-1209_introduce_optimizer_initializer

handled training with instantiated optimizer

ed47193

updated instantiated optimizer tests

30170ae

Merge branch 'master' into feature/SG-1209_introduce_optimizer_initia…

f5e3c06

…lizer

fixed erased initial_lr in unit test

0c08687

Merge remote-tracking branch 'origin/feature/SG-1209_introduce_optimi…

c4f9f54

…zer_initializer' into feature/SG-1209_introduce_optimizer_initializer

Merge branch 'master' into feature/SG-1209_introduce_optimizer_initia…

46a23a8

…lizer