Allow train.py-like config for eval.py #1351

josejg · 2024-07-12T00:56:24Z

Currently train.py requires a config like

model:
   <model_kwargs>
load_path: foobar
tokenizer:
   <tokenizer_kwargs>

whereas eval.py requires:

models:
  - model:
        <model_kwargs>
     tokenizer:
        <tokenizer_kwargs>
      load_path: foobar
      model_name: my_model

This PR allows the user to run eval.py using the train.py syntax which is easier when editing yamls directly.

I tried to make the implementation fully backwards compatible and cover all edge cases (both keys specified, missing top level keys like tokenizer, &c)

EDIT: Run debug-eval-llama-3x-8b-g16-d10-B8JDJd completed sucessfully and the config used model and tokenizer directly

…o eval_script_model

irenedea · 2024-07-15T22:26:14Z

This looks reasonable to me. Want to see if Milo has any thoughts on how to make this nicer with the config_utils.py he added. (Pinged Milo bc I can't add him as a reviewer for some reason)

milocress

This looks good, I like that we don't have to modify the EvalConfig dataclass and the transformation happens at a lower level. One question is whether the logged config should have the original syntax of the file or not?

I lean towards yes, in which case you may want to make a transform that's passed to make_dataclass_and_log_config which packages your transformation into a function which is applied only to the eval config and not to the logged config.

I think it's fairly important that the config that gets logged is exactly equal to the config specified in the file.

You can pass a function to make_dataclass_and_log_config as specified here which does your config transformation.

here is a good example of a config transform, I think this PR can be formatted in the same way.

dakinggg

lgtm, could go either way on milo's suggestion

josejg · 2024-07-17T18:32:18Z

I like @milocress suggestion, will retry to rework it as a config transform. Some clarifications:

Should I add it to the transform registry?
The logic would be sth like "if model in config, pass transform to the make_dataclass_and_log_config fn" ?

dakinggg · 2024-07-17T18:40:15Z

The existing config transforms were added for train (and will all be applied for train) so i'd probably just make a function for now and not register it. We can revisit later if there is reason to.

and id suggest not conditionally applying the transform, but rather the transform itself handling the if model in config check

dakinggg

lgtm, please add a manual test run in the pr description showing the functionality works

scripts/eval/eval.py

josejg · 2024-07-17T23:00:06Z

Updated the PR with a run that completed successfully with this transformation

josejg added 2 commits July 11, 2024 17:14

Allow model key in eval script

4917c39

Compatibility

2b5145c

josejg requested a review from a team as a code owner July 12, 2024 00:56

josejg and others added 2 commits July 11, 2024 18:03

pre-commit fix

9b47caa

Merge branch 'main' into eval_script_model

6495831

mvpatel2000 requested review from irenedea and KuuCi July 12, 2024 14:01

josejg added 3 commits July 15, 2024 09:42

Fix load_path

e88af48

Merge branch 'eval_script_model' of github.com:josejg/llm-foundry int…

533b871

…o eval_script_model

fix

222b689

milocress reviewed Jul 16, 2024

View reviewed changes

dakinggg approved these changes Jul 16, 2024

View reviewed changes

josejg added 2 commits July 17, 2024 15:21

Refactor as a config transform

e96055d

formatting

5c9c1ac

dakinggg approved these changes Jul 17, 2024

View reviewed changes

scripts/eval/eval.py Outdated Show resolved Hide resolved

fix

9465603

josejg added 4 commits July 18, 2024 17:28

Merge branch 'main' into eval_script_model

5f07eae

fix pyright

5d5efcf

fix

3486136

Merge branch 'main' into eval_script_model

af9e44e

josejg enabled auto-merge (squash) July 22, 2024 23:42

josejg merged commit eb41a6e into mosaicml:main Jul 23, 2024
9 checks passed

josejg mentioned this pull request Jul 29, 2024

Log original config #1410

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow train.py-like config for eval.py #1351

Allow train.py-like config for eval.py #1351

josejg commented Jul 12, 2024 •

edited

Loading

irenedea commented Jul 15, 2024

milocress left a comment •

edited

Loading

dakinggg left a comment

josejg commented Jul 17, 2024

dakinggg commented Jul 17, 2024 •

edited

Loading

dakinggg left a comment

josejg commented Jul 17, 2024

Allow train.py-like config for eval.py #1351

Allow train.py-like config for eval.py #1351

Conversation

josejg commented Jul 12, 2024 • edited Loading

irenedea commented Jul 15, 2024

milocress left a comment • edited Loading

Choose a reason for hiding this comment

dakinggg left a comment

Choose a reason for hiding this comment

josejg commented Jul 17, 2024

dakinggg commented Jul 17, 2024 • edited Loading

dakinggg left a comment

Choose a reason for hiding this comment

josejg commented Jul 17, 2024

josejg commented Jul 12, 2024 •

edited

Loading

milocress left a comment •

edited

Loading

dakinggg commented Jul 17, 2024 •

edited

Loading