Metrics refactor pt1 #1411

ishanashastri · 2022-08-13T00:17:10Z

This PR implements part 1 of the Metrics Refactor, which entails removing metrics from the Evaluator class and storing raw deep-copied metrics in the State class. The Evaluator class now stores evaluation metric names instead of metric instances, which are matched against the model-defined model.val_metrics to indicate which metrics will be computed at eval time. Additionally, instead of storing all computed metrics as part of state.current_metrics, both the raw non-computed training and validation metrics are now stored separately as part of state.train_metrics and state.eval_metrics.

All tests updated and passing, regression tests will be conducted once PR 2 is ready.

Note: PR for part 2 will also touch a lot of the same code, so expect some refactors and clean up (mostly in the Trainer class) after this PR is merged in.

Closes CO-679

hanlint

Overall LGTM, a few questions and suggested code changes for style nits.

composer/core/evaluator.py

composer/core/state.py

composer/trainer/trainer.py

bandish-shah

LGTM, please see my nit about the TODO

composer/trainer/trainer_hparams.py

hanlint

One more quick change, and I think this is ready to merge!

composer/core/state.py

This PR implements the second half of the metrics refactor (#1411) that updates the training loop and all the Composer models. The training loop now uses the previously added state.train_metrics and state.eval_metrics from #1411 to perform all training and evaluation on the correct set of metrics. The main changes with respect to models is that the models now have split up validate() into eval_forward() and update_metrics() methods, which run an evaluation forward pass and update the metrics with the outputs of the evaluation forward pass respectively. This is mainly to get rid of the double forward pass (#467).

ishanashastri added 9 commits August 1, 2022 13:53

init changes

e772aad

added raw metric to state and propagated changes

1b69cc5

Merge branch 'dev' into metrics-refactor-pt1

b2d99ac

fixed imports

6c51178

added typing_ext and fixed some pyright

5574c00

Merge branch 'dev' into metrics-refactor-pt1

60a4da0

post merge docs updates

cd32b01

handle deepseed wrapping

3c66f70

removed MetricInterface

048441b

ishanashastri requested a review from hanlint August 16, 2022 00:36

ishanashastri marked this pull request as ready for review August 16, 2022 00:36

ishanashastri requested review from knighton and a team as code owners August 16, 2022 00:36

doctests

f21f4e0

hanlint approved these changes Aug 16, 2022

View reviewed changes

ishanashastri added 2 commits August 16, 2022 13:02

addressed comments

4cbfc38

removed old metricinterface deps

ad60f78

bandish-shah approved these changes Aug 16, 2022

View reviewed changes

composer/trainer/trainer_hparams.py Outdated Show resolved Hide resolved

removed todos

6135d7b

hanlint reviewed Aug 17, 2022

View reviewed changes

composer/core/state.py Outdated Show resolved Hide resolved

remove dep warning

3054496

ishanashastri merged commit b47ecd9 into mosaicml:dev Aug 17, 2022

This was referenced Aug 17, 2022

Metrics Refactor Part 2 #1418

Closed

Metrics Refactor Part 2 #1419

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Metrics refactor pt1 #1411

Metrics refactor pt1 #1411

ishanashastri commented Aug 13, 2022 •

edited

Loading

hanlint left a comment

bandish-shah left a comment

hanlint left a comment

Metrics refactor pt1 #1411

Metrics refactor pt1 #1411

Conversation

ishanashastri commented Aug 13, 2022 • edited Loading

hanlint left a comment

Choose a reason for hiding this comment

bandish-shah left a comment

Choose a reason for hiding this comment

hanlint left a comment

Choose a reason for hiding this comment

ishanashastri commented Aug 13, 2022 •

edited

Loading