-
Notifications
You must be signed in to change notification settings - Fork 72
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
issue while finetuning DocOwl1.5-Omni on dataset #78
Comments
Hi, @AkshataABhat, could you provide more details, such as the GPU device and training script? |
@HAWLYQ Training script is:
|
Hi, @AkshataABhat, the training script seems ok~ I have tested the script with A100-80G and am not sure whether it works well on A100-40G~ We will try whether it works on V100-32G, but due to the work schedule and limited machine resources, this won't be soon, sry for that~ |
@HAWLYQ
also, in train_docowl.py, the code is getting executed until the below line:
about 35 GB is occupied until this step.
pls guide whether this is a gpu issue? or would the script work if checkpoints were locally available. |
The training does not start..my memory is completely occupied but GPU is at 0%.
Screenshot attached below. Pls help .
The text was updated successfully, but these errors were encountered: