-
Notifications
You must be signed in to change notification settings - Fork 72
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DocOwl1.5 training code? #51
Comments
Hi, @coder4nlp , the training code is scheduled for release at the end of this month. If you are urgent to finetune our model, you can refer to the training code of mPLUG-Owl2 and make some revisions to adjust to our model. Some hyper-parameters can refer to our paper. |
almost there! |
training codes with DeepSpeed is under debugging and testing 。゚・ (>﹏<) ・゚。 |
@HAWLYQ So sad......。゚・ (>﹏<) ・゚。 |
@HAWLYQ can you test for deepspeed stage 3 integrations, specifically for deadlock issues while training/fine-tuning? |
where are the schedules? |
Within this week~ |
Hi, @coder4nlp @whalefa1I @AR-javis We have released training codes for finetuning docowl1.5 in https://github.com/X-PLUG/mPLUG-DocOwl/tree/main/DocOwl1.5. It's temporarily supported by DeepSpeed zero2. We meet deadlock issues with zero3, if you have any suggestions to share with us, we will appreciate very much~ |
@HAWLYQ Thank you very much! |
hello, how about the venv requirements? I've not seen the requirements.txt. |
Hi, @Coobiw , our environment is the same as mPLUG-Owl2, you can follow instructions at https://github.com/X-PLUG/mPLUG-Owl/tree/main/mPLUG-Owl2 to prepare environments. |
When will the training code be released?Thx.
The text was updated successfully, but these errors were encountered: