log details to metadata for run analytics #992

angel-ruiz7 · 2024-02-23T00:21:27Z

This will log information via the MosaicMLLogger to place the following keys in a run's metadata for analytics purposes. The data to log includes

model_name: string
script: 'Training', 'Eval'
train_task_type: PRETRAIN, CONTINUED_PRETRAIN, INSTRUCTION_FINETUNE
train_loader_name: string
train_dataset_hf_name: string
eval_loader_name: string
eval_dataset_hf_name: string
tokenizer_name: string
n_heads: number
d_model: int
callbacks: string[]
train_loader_workers: int
eval_loader_workers: int
gauntlet_configured: boolean
icl_configured: boolean

Screenshots

Using the `Quickstart` example

Using the `gpt2-small` example

angel-ruiz7 · 2024-02-23T00:27:12Z

still need to add the subtype. what's the best approach for this? currently, we classify the run by looking through the command to find the name of specific yaml or python files. what would be the best approach to do this inside of train.py?

cc @aspfohl @irenedea

aspfohl

More of an ask for foundry team: Is there anything else that would be useful for analytics?

Should we make it configurable to turn this on or off? Or is presence of MosaicMLLogger enough (users could always turn it off via MOSAICML_PLATFORM env var)

scripts/eval/eval.py

…d conver to lowercase

…l/llm-foundry into angel/log-data-for-run-analytics

…-data-for-run-analytics

This reverts commit 43be314.

llmfoundry/utils/mosaicmllogger_utils.py

llmfoundry/utils/logging_utils.py

…ser instead

…l/llm-foundry into angel/log-data-for-run-analytics

irenedea

Can you include evidence of this working in the PR description? Some manual tests and screenshots would be good.

irenedea

🚀 LGTM! Just one super super tiny formatting comment :) Thanks Angel! Will be great to have more logging and data 😄

llmfoundry/utils/__init__.py

…l/llm-foundry into angel/log-data-for-run-analytics

angel-ruiz7 added 2 commits February 22, 2024 16:04

add uses_llmfoundry, model_name, and llmfoundry_run_type

c0ac767

Merge branch 'main' into angel/log-data-for-run-analytics

fccabd9

angel-ruiz7 requested review from aspfohl, dakinggg and irenedea February 23, 2024 00:28

aspfohl reviewed Feb 26, 2024

View reviewed changes

scripts/eval/eval.py Outdated Show resolved Hide resolved

scripts/eval/eval.py Outdated Show resolved Hide resolved

scripts/eval/eval.py Outdated Show resolved Hide resolved

scripts/eval/eval.py Outdated Show resolved Hide resolved

angel-ruiz7 added 23 commits February 26, 2024 11:17

remove uses_llmfoundry flag, prefix with mosaicml/llmfoundry/, an…

fe8948b

…d conver to lowercase

Merge branch 'angel/log-data-for-run-analytics' of github.com:mosaicm…

faede2c

…l/llm-foundry into angel/log-data-for-run-analytics

get model_name from pretrained_model_name_or_path

1e28dec

Merge branch 'main' of github.com:mosaicml/llm-foundry into angel/log…

a935c42

…-data-for-run-analytics

fix quotes

e8bab05

add TODO comments and remove redundant flushing from eval.py

f49d6b3

get llmfoundry_run_subtype for training runs

f726fe2

check for mosaicml_logger before logging

40e3b83

fix reportUnboundVariable linting error

599807c

add tokenizer and train/eval loader names

a46cc8b

use brackets to get name from model_config

ed2dead

add cloud provider from load / save paths

4165d59

add num_workers for eval_loader and train_loader

598ccb1

Merge branch 'main' into angel/log-data-for-run-analytics

7cb9e0f

try to fix key error

43be314

Revert "try to fix key error"

09dcd56

This reverts commit 43be314.

add d_model, callbacks, and vocab_size

7aa5f34

format, add n_heads

8afa7ad

Merge branch 'main' into angel/log-data-for-run-analytics

e296693

format + support ListConfig and DictConfig for eval_loader_config

336d697

fix access issues with loader_config

850e587

use get() instead of brackets

d098c6c

use get instead of brackets

dcaf7a7

move helpers into mosaicmllogger_utils.py

a95b2fd

irenedea reviewed Mar 12, 2024

View reviewed changes

llmfoundry/utils/mosaicmllogger_utils.py Show resolved Hide resolved

llmfoundry/utils/logging_utils.py Outdated Show resolved Hide resolved

angel-ruiz7 added 5 commits March 12, 2024 11:30

give a description that makes pydocstyle happy

265d508

log cloud_provider_data and cloud_provider_checkpoints from compo…

6aecab8

…ser instead

run formatters

ca0df5e

Merge branch 'main' into angel/log-data-for-run-analytics

84720dc

Merge branch 'main' into angel/log-data-for-run-analytics

50b284c

angel-ruiz7 requested a review from irenedea March 15, 2024 20:35

angel-ruiz7 added 2 commits March 15, 2024 13:38

remove TODOs

7f57a8a

Merge branch 'angel/log-data-for-run-analytics' of github.com:mosaicm…

31d3b79

…l/llm-foundry into angel/log-data-for-run-analytics

irenedea reviewed Mar 15, 2024

View reviewed changes

angel-ruiz7 added 5 commits March 18, 2024 11:19

fix import

564fc41

Revert "fix import"

ccfb4fa

combine both import methodss for mosiacmllogger_utils

3e5f9a4

only import from utils

5806493

format files

b5ed2f3

angel-ruiz7 requested a review from irenedea March 21, 2024 21:20

build loggers outside of evaluate_model

07abaa7

irenedea approved these changes Mar 22, 2024

View reviewed changes

llmfoundry/utils/__init__.py Outdated Show resolved Hide resolved

angel-ruiz7 added 9 commits March 21, 2024 17:52

merge and resolve conflicts

309f683

run formatter on __init__

0252283

create MosaicMLLogger if it doesn't exist in eval.py

a59fdc9

Merge branch 'main' into angel/log-data-for-run-analytics

830683e

docstring fixes

54b78a9

Merge branch 'angel/log-data-for-run-analytics' of github.com:mosaicm…

be2294a

…l/llm-foundry into angel/log-data-for-run-analytics

oops loggers is definitely supposed to be a List

78925c4

don't add mosaicml_logger if it's None

dd792cc

do the same thing for train.py

e6b43a8

angel-ruiz7 merged commit 31e4879 into main Mar 23, 2024
10 checks passed

KuuCi pushed a commit that referenced this pull request Apr 18, 2024

log details to metadata for run analytics (#992)

8245472

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

log details to metadata for run analytics #992

log details to metadata for run analytics #992

angel-ruiz7 commented Feb 23, 2024 •

edited

Loading

angel-ruiz7 commented Feb 23, 2024 •

edited

Loading

aspfohl left a comment

irenedea left a comment

irenedea left a comment

log details to metadata for run analytics #992

log details to metadata for run analytics #992

Conversation

angel-ruiz7 commented Feb 23, 2024 • edited Loading

Screenshots

Using the Quickstart example

Using the gpt2-small example

angel-ruiz7 commented Feb 23, 2024 • edited Loading

aspfohl left a comment

Choose a reason for hiding this comment

irenedea left a comment

Choose a reason for hiding this comment

irenedea left a comment

Choose a reason for hiding this comment

angel-ruiz7 commented Feb 23, 2024 •

edited

Loading

Using the `Quickstart` example

Using the `gpt2-small` example

angel-ruiz7 commented Feb 23, 2024 •

edited

Loading