log eval dataset misconfiguration #1179

milocress · 2024-05-07T00:59:11Z

Make Eval Dataset Misconfiguration Errors Visible through Mosaic Logger

Wraps eval dataset creation with a mosaic logger try/catch.

In Train context

mpt-125m-chinchilla-regression-6OSoWo (the log trace indicates that the error is caught and the context attribute is set)

[rank7]: │   623 │   │   if mosaicml_logger is not None:                                │
[rank7]: │   624 │   │   │   e.context = 'TrainContext'                                 │
[rank7]: │   625 │   │   │   mosaicml_logger.log_exception(e)                           │
[rank7]: │ ❱ 626 │   │   raise e

In Eval context

mpt-125m-chinchilla-regression-6OSoWo (the log trace indicates that the error is caught and the context attribute is set)

[rank0]: │   656 │   │   │   if mosaicml_logger is not None:                            │
[rank0]: │   657 │   │   │   │   e.context = 'EvalContext'                              │
[rank0]: │   658 │   │   │   │   mosaicml_logger.log_exception(e)                       │
[rank0]: │ ❱ 659 │   │   │   raise e

dakinggg

Can you add a run name for a manual test?

…ilocress/llm-foundry into milo/log-eval-dataset-misconfiguration

jjanezhang

Looking great, had a comment on renaming so we can use this for many things in the future.

Flagging that the dataset type won't be added in the convert_delta_to_json.py and convert_text_to_mds.py. I think we would have to parse the datapath to get the train split to wrap the error if you'd like to take a stab! Also happy to do it :) thanks so much Milo!

llmfoundry/utils/exceptions.py

milocress · 2024-05-13T16:48:09Z

Flagging that the dataset type won't be added in the convert_delta_to_json.py and convert_text_to_mds.py. I think we would have to parse the datapath to get the train split to wrap the error if you'd like to take a stab! Also happy to do it :) thanks so much Milo!

I am happy to do this, but let's split it into a separate PR so this one stays small.

jjanezhang

small nit, otherwise lgtm

scripts/train/train.py

log eval dataset misconfiguration

956b2bd

milocress requested a review from dakinggg May 7, 2024 00:59

dakinggg reviewed May 7, 2024

View reviewed changes

milocress added 4 commits May 6, 2024 21:50

Merge branch 'main' into milo/log-eval-dataset-misconfiguration

8c944b8

use context

bb5a1e0

Merge branch 'milo/log-eval-dataset-misconfiguration' of github.com:m…

9ea2f39

…ilocress/llm-foundry into milo/log-eval-dataset-misconfiguration

literally

aabf3e2

milocress requested a review from dakinggg May 7, 2024 21:34

milocress added 3 commits May 7, 2024 21:37

BaseException -> Exception

7ea0ea1

use my archaeological skills to find the right python syntax for 3.9

1f36602

Merge branch 'main' into milo/log-eval-dataset-misconfiguration

970dd35

jjanezhang reviewed May 9, 2024

View reviewed changes

llmfoundry/utils/exceptions.py Outdated Show resolved Hide resolved

milocress added 5 commits May 9, 2024 13:29

Merge branch 'main' into milo/log-eval-dataset-misconfiguration

85e31b7

refactor names for more general use

b7af364

refactor for generality

88ee43f

oops

0af1c18

Merge branch 'main' into milo/log-eval-dataset-misconfiguration

7063674

milocress added 2 commits May 13, 2024 16:58

oops II

234ea4a

merged

fa4e851

milocress requested a review from jjanezhang May 13, 2024 17:11

context -> location

7fb4e56

jjanezhang approved these changes May 14, 2024

View reviewed changes

scripts/train/train.py Outdated Show resolved Hide resolved

milocress added 5 commits May 14, 2024 18:21

use variables instead of strings

d0dd63d

Merge branch 'main' into milo/log-eval-dataset-misconfiguration

a5f3fe0

Merge branch 'main' into milo/log-eval-dataset-misconfiguration

7cfe995

Update exceptions.py

1f4d9e6

delete Mapping

91f2287

milocress enabled auto-merge (squash) May 15, 2024 22:01

Merge branch 'main' into milo/log-eval-dataset-misconfiguration

fe0a1ab

milocress merged commit cfee4e4 into mosaicml:main May 15, 2024
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

log eval dataset misconfiguration #1179

log eval dataset misconfiguration #1179

milocress commented May 7, 2024 •

edited

Loading

dakinggg left a comment

jjanezhang left a comment

milocress commented May 13, 2024

jjanezhang left a comment

log eval dataset misconfiguration #1179

log eval dataset misconfiguration #1179

Conversation

milocress commented May 7, 2024 • edited Loading

Make Eval Dataset Misconfiguration Errors Visible through Mosaic Logger

In Train context

In Eval context

dakinggg left a comment

Choose a reason for hiding this comment

jjanezhang left a comment

Choose a reason for hiding this comment

milocress commented May 13, 2024

jjanezhang left a comment

Choose a reason for hiding this comment

milocress commented May 7, 2024 •

edited

Loading