Add: mlflow for experiment tracking #1059

JohanWork · 2024-01-07T16:27:23Z

Adding MLFOW to Axolotl for experiment tracking, looked into how Weight and Bias has been setup and tried to follow the same pattern. Have tested the changes and everything looks good to me.

Happy for any feedback or comments.

adding mlflow

Imports for mlflow

winglian

Thanks for the PR!

winglian · 2024-01-07T16:58:54Z

README.md

@@ -694,6 +694,10 @@ wandb_name: # Set the name of your wandb run
 wandb_run_id: # Set the ID of your wandb run
 wandb_log_model: # "checkpoint" to log model to wandb Artifacts every `save_steps` or "end" to log only at the end of training

+# mlflow configuration if you're using it
+# Make sure your `MLFLOW_TRACKING_URI` is set.
+mlflow_experiment_name: # Your experiment name


It might be worth logging a warning when this is set but the uri env isnt.

Wouldn't it be better to make a config mlflow_tracking_uri: ? The parsing code should be able to set that if needed.

I agree with what you propose @NanoCode012 . Have update the code.

Update mlflow_tracking_uri

NanoCode012 · 2024-01-08T12:46:36Z

Is there any need to pass this config to the HF Trainer to enable log? I think some validation test would also be needed to make sure both aren't active at same time.

In addition, would you be able to provide an example run output + image on mlflow?

winglian · 2024-01-08T13:59:34Z

I disagree, isn't the tracking uri equivalent to a secret token? It can get inadvertently exposed if you share your YAML (say integration with wandb also)

JohanWork · 2024-01-08T15:39:40Z

I disagree, isn't the tracking uri equivalent to a secret token? It can get inadvertently exposed if you share your YAML (say integration with wandb also)

To my understanding it is only an url actually. In Mlflow auth seams to be handled with username and password. But I might miss something. For reference https://mlflow.org/docs/latest/auth/index.html

I have no strong opinion if it should be or not be in the config file.

NanoCode012 · 2024-01-08T15:54:39Z

Oh, I wasn't aware it was a secret. In this case, I would not recommend it being in the yaml anymore due to reasons wing mentioned.

update trainer building

winglian · 2024-01-08T17:03:54Z

looks like the URI isn't considered a secret, but could be sensitive if someone starts a server and doesn't enable auth on it.
MLFLOW_TRACKING_USERNAME and MLFLOW_TRACKING_PASSWORD are the respective credentials

src/axolotl/core/trainer_builder.py

JohanWork and others added 7 commits January 4, 2024 09:42

Update requirements.txt

95ff2b9

adding mlflow

Update __init__.py

8161937

Imports for mlflow

Update README.md

8ce2780

Create mlflow_.py (#1)

bea9a47

Update README.md

63e41ea

fix precommits

f2fdc3a

Merge branch 'OpenAccess-AI-Collective:main' into adding-mlflow

f04da39

JohanWork changed the title ~~Adding mlflow for experiment tracking~~ Add: mlflow for experiment tracking Jan 7, 2024

winglian approved these changes Jan 7, 2024

View reviewed changes

Update README.md

9fe5420

Update mlflow_tracking_uri

Update trainer_builder.py

f4c35d8

update trainer building

NanoCode012 reviewed Jan 8, 2024

View reviewed changes

src/axolotl/core/trainer_builder.py Outdated Show resolved Hide resolved

chore: lint

e7120e5

winglian reviewed Jan 8, 2024

View reviewed changes

src/axolotl/core/trainer_builder.py Outdated Show resolved Hide resolved

make ternary a bit more readable

1086683

winglian merged commit 090c24d into axolotl-ai-cloud:main Jan 9, 2024
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add: mlflow for experiment tracking #1059

Add: mlflow for experiment tracking #1059

JohanWork commented Jan 7, 2024

winglian left a comment

winglian Jan 7, 2024

NanoCode012 Jan 8, 2024

JohanWork Jan 8, 2024

NanoCode012 commented Jan 8, 2024

winglian commented Jan 8, 2024

JohanWork commented Jan 8, 2024

NanoCode012 commented Jan 8, 2024

winglian commented Jan 8, 2024

Add: mlflow for experiment tracking #1059

Add: mlflow for experiment tracking #1059

Conversation

JohanWork commented Jan 7, 2024

winglian left a comment

Choose a reason for hiding this comment

winglian Jan 7, 2024

Choose a reason for hiding this comment

NanoCode012 Jan 8, 2024

Choose a reason for hiding this comment

JohanWork Jan 8, 2024

Choose a reason for hiding this comment

NanoCode012 commented Jan 8, 2024

winglian commented Jan 8, 2024

JohanWork commented Jan 8, 2024

NanoCode012 commented Jan 8, 2024

winglian commented Jan 8, 2024