enable param group configuration in llm-foundry #760

vchiley · 2023-11-22T23:06:18Z

This PR enables param group configuration in llm-foundry.

The optimizer_config defines the optimizer args.
This PR allows the user to additionally have key disable_grad which is a string or list of strings. If a string matches a parameter name, then that parameter will have requires_grad=False. This is useful for freezing parameters.
This PR additionally allows the user to specify key param_groups which is a list of dicts. In this dict, key param_str_match defines a string; if a parameter name contains this string, then it will be in this parameter group. This is useful for grouping parameters together. The dict can also contain any other key that is a valid optimizer arg.
Note: to handle name overlap conflicts, params are assigned to parameter groups and added to param_groups in the order that param_str_match appear in param_groups.

Param name comparisons are done using RegEx search.

Usage
To disable gradient for all parameters that contain the string "norm" or "bias":

    optimizer_config: {
        "name": "decoupled_lionw",
        "lr": 1e-3,
        "weight_decay": 1e-2,
        "betas": [0.9, 0.999],
        "eps": 1e-8,
        "disable_grad": ["norm", "bias"]
    }

or in the yaml as:

optimizer:
  name: decoupled_lionw
  lr: 1e-3
  weight_decay: 1e-2
  betas:
  - 0.9
  - 0.999
  eps: 1e-8
  disable_grad:
  - norm
  - bias

To create modify the optimizer parameters for all parameters that contain the string "norm" separately:

    optimizer_config: {
        "name": "decoupled_lionw",
        "lr": 1e-3,
        "weight_decay": 1e-2,
        "betas": [0.9, 0.999],
        "eps": 1e-8,
        "param_groups": [
            {
                "param_str_match": "norm",
                "lr": 1e-4,
                "weight_decay": 0.0,
            },
        ],
    }

of in yaml form:

optimizer:
  name: decoupled_lionw
  lr: 1e-3
  weight_decay: 1e-2
  betas:
  - 0.9
  - 0.999
  eps: 1e-8
  param_groups:
  - param_str_match: norm
    lr: 1e-4
    weight_decay: 0

vchiley · 2023-11-24T04:41:12Z

potential users: @sashaDoubov @samhavens @b-chu @bcui19 @ShashankMosaicML

j316chuck

Nice ✅ . Left a comment but thanks for the clean implementation.

llmfoundry/utils/builders.py

llmfoundry/optim/lion8b.py

tests/test_builders.py

llmfoundry/utils/builders.py

Co-authored-by: Daniel King <43149077+dakinggg@users.noreply.github.com>

enable param group configuration in llm-foundry

26bbc57

vchiley requested review from dakinggg and removed request for dakinggg November 22, 2023 23:10

add doc string

f39592a

vchiley force-pushed the param_group_config branch from 9a36cdc to f39592a Compare November 23, 2023 21:51

vchiley and others added 3 commits November 23, 2023 13:59

add debug logs

2e9548b

add test, fix bug

fc69417

spell check; mark test gpu

167e173

vchiley requested review from j316chuck and dakinggg November 24, 2023 04:33

vchiley marked this pull request as ready for review November 24, 2023 04:41

updt to use RegEx search

3d73bb0

j316chuck approved these changes Nov 27, 2023

View reviewed changes

llmfoundry/utils/builders.py Show resolved Hide resolved

vchiley added 2 commits November 27, 2023 16:05

Merge branch 'main' into param_group_config

a932f7d

Merge branch 'main' into param_group_config

1efdffe

dakinggg approved these changes Nov 28, 2023

View reviewed changes

dakinggg and others added 3 commits November 28, 2023 14:17

Merge branch 'main' into param_group_config

9607747

Apply suggestions from code review

6130334

Co-authored-by: Daniel King <43149077+dakinggg@users.noreply.github.com>

updt with dakinggg pr comments

6b9ccbf

vchiley force-pushed the param_group_config branch from 017bf92 to 6b9ccbf Compare November 29, 2023 00:06

vchiley merged commit 5f21855 into mosaicml:main Nov 29, 2023
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

enable param group configuration in llm-foundry #760

enable param group configuration in llm-foundry #760

vchiley commented Nov 22, 2023 •

edited

Loading

vchiley commented Nov 24, 2023

j316chuck left a comment •

edited

Loading

enable param group configuration in llm-foundry #760

enable param group configuration in llm-foundry #760

Conversation

vchiley commented Nov 22, 2023 • edited Loading

vchiley commented Nov 24, 2023

j316chuck left a comment • edited Loading

Choose a reason for hiding this comment

vchiley commented Nov 22, 2023 •

edited

Loading

j316chuck left a comment •

edited

Loading