Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make mpt7b finetuning more obvious #101

Merged
merged 10 commits into from
May 13, 2023
Merged

Make mpt7b finetuning more obvious #101

merged 10 commits into from
May 13, 2023

Conversation

samhavens
Copy link
Contributor

@samhavens samhavens commented May 10, 2023

Resolves RESEARCH-710

Also change the folder structure of the YAMLs to make it more clear which MPTs are pretrained from the HF Hub and which are an architecture for pretraining.

WARNING: This will not work until #90 is merged!

@vchiley
Copy link
Contributor

vchiley commented May 11, 2023

Note: #90 is merged

Copy link
Contributor

@alextrott16 alextrott16 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A couple minor suggestions to correct some issues with model configs but otherwise looks great!

BTW t5-small_dolly_sft.yaml also has the device instead of init_device issue, but I couldn't suggest a change on it because it was just a rename. Please apply the same fix there.

scripts/train/yamls/finetune/mpt-7b_dolly_sft.yaml Outdated Show resolved Hide resolved
scripts/train/yamls/finetune/mpt-7b_dolly_sft.yaml Outdated Show resolved Hide resolved
@alextrott16 alextrott16 merged commit 918a22a into main May 13, 2023
@alextrott16 alextrott16 deleted the mpt7b-ft-yaml branch May 13, 2023 00:41
bmosaicml pushed a commit that referenced this pull request Jun 6, 2023
bmosaicml pushed a commit that referenced this pull request Jun 6, 2023
* make mpt7b finetuning more obvious

* change yaml structure and references to paths

* needed to intall pre-commit

* fix merge issue

* Local dataset rework

* Apply suggestions from code review

* YAML touch ups

---------

Co-authored-by: Vitaliy Chiley <6439018+vchiley@users.noreply.github.com>
Co-authored-by: Alex Trott <alex@mosaicml.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

5 participants