Multi eval dataset logging #603

snarayan21 · 2023-09-16T18:00:06Z

Previously, only one eval dataloader was supported in llm-foundry (although multiple ICL eval tasks and the gauntlet were supported). Now users can add custom eval datasets by turning eval_dataloader into a list of dataloaders in their yaml. It would look like this:

Before:

eval_loader:
  name: text
  dataset: ...
  drop_last: false
  num_workers: 8

After:

eval_loader:
- label: first
  name: text
  dataset: ...
  drop_last: false
  num_workers: 8
- label: second
  name: text
  dataset: ...
  drop_last: false
  num_workers: 8

Users must specify a label for each eval dataloader so that these metrics are logged separately. This functionality was tested on wandb, mlflow, and tensorboard, shown below.

wandb:

mlflow:

tensorboard:

…parately

dakinggg

Implementation looks good, thanks! Please remove the accidentally committed files, and add a simple unit test.

scripts/train/train.py

irenedea

LGTM with some minor comments

scripts/train/train.py

tests/test_train_inputs.py

tests/test_training.py

irenedea

Can we add a unit test that tests two eval loaders with two different datasets?

…ulti-eval-dataset-logging merging

tests/test_training.py

tests/test_data_prep_scripts.py

irenedea

LGTM! Just a minor comment about using tmp_path instead

snarayan21 added 2 commits September 16, 2023 10:35

added support for multiple eval datasets and logging their metrics se…

3824d6c

…parately

added support for multiple eval datasets and logging their metrics se…

0140b4e

…parately

dakinggg reviewed Sep 17, 2023

View reviewed changes

scripts/train/train.py Outdated Show resolved Hide resolved

scripts/train/train.py Outdated Show resolved Hide resolved

scripts/train/train.py Outdated Show resolved Hide resolved

snarayan21 added 2 commits September 17, 2023 11:19

fixed comments, deleted accidentally added files

92906e5

added tests

3b33f79

dakinggg requested a review from irenedea September 21, 2023 00:40

irenedea approved these changes Sep 21, 2023

View reviewed changes

scripts/train/train.py Outdated Show resolved Hide resolved

scripts/train/train.py Outdated Show resolved Hide resolved

scripts/train/train.py Show resolved Hide resolved

tests/test_train_inputs.py Outdated Show resolved Hide resolved

irenedea reviewed Sep 21, 2023

View reviewed changes

tests/test_training.py Outdated Show resolved Hide resolved

irenedea requested changes Sep 21, 2023

View reviewed changes

snarayan21 added 2 commits September 25, 2023 12:58

Merge branch 'main' of https://github.com/mosaicml/llm-foundry into m…

783e365

…ulti-eval-dataset-logging merging

added multi-dataset tests, linting

87a92bf

snarayan21 force-pushed the multi-eval-dataset-logging branch from ebb8a30 to 87a92bf Compare September 25, 2023 22:48

snarayan21 requested a review from irenedea September 25, 2023 23:18

irenedea reviewed Sep 26, 2023

View reviewed changes

tests/test_training.py Outdated Show resolved Hide resolved

tests/test_data_prep_scripts.py Show resolved Hide resolved

irenedea approved these changes Sep 27, 2023

View reviewed changes

modified to use tmp_path

3687be2

snarayan21 force-pushed the multi-eval-dataset-logging branch from 9416cfb to 3687be2 Compare September 27, 2023 17:35

snarayan21 added 3 commits September 27, 2023 10:36

Merge branch 'main' into multi-eval-dataset-logging

ca729a5

modified to use tmp_path

7ad4c8d

merged main

5d9d824

snarayan21 enabled auto-merge (squash) September 27, 2023 20:52

snarayan21 merged commit 3d4fa0f into main Sep 27, 2023
9 checks passed

dakinggg deleted the multi-eval-dataset-logging branch October 11, 2023 21:30

germanjke mentioned this pull request Nov 28, 2023

eval drops when i have multi_eval dataset mosaicml/composer#2743

Closed

dakinggg mentioned this pull request Apr 20, 2024

Mlflow datasets #1119

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi eval dataset logging #603

Multi eval dataset logging #603

snarayan21 commented Sep 16, 2023 •

edited

Loading

dakinggg left a comment

irenedea left a comment

irenedea left a comment

irenedea left a comment

Multi eval dataset logging #603

Multi eval dataset logging #603

Conversation

snarayan21 commented Sep 16, 2023 • edited Loading

dakinggg left a comment

Choose a reason for hiding this comment

irenedea left a comment

Choose a reason for hiding this comment

irenedea left a comment

Choose a reason for hiding this comment

irenedea left a comment

Choose a reason for hiding this comment

snarayan21 commented Sep 16, 2023 •

edited

Loading