-
Notifications
You must be signed in to change notification settings - Fork 87
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
problem when finetuning DocOwl1.5-Omni #82
Comments
Hi, @lmydian1014 this error is similar with #65. This may be because the number of image placeholders in the text input is not equal to the number of image features. There may be multiple input images and only 1 <|image|> in your query. |
Thanks!
What you mean is that I need to have 15 images in ./images folder and also need to change image name in ./images folder and in .json file as "81ouNxgyqBL_1.jpg", ''81ouNxgyqBL_2.jpg'', ''81ouNxgyqBL_3.jpg''? |
Hi, @lmydian1014 , this JSON file is ok. Please check whether all samples in your JSON file raise the IndexError or just partial samples. If only partial samples, could you provide some examples? (ps: the bath_idx=0 is because the per_device_train_batch_size=1 ) |
Hi, I encountered this problem again after around 30 training steps, I tried to print some variables
The output is as follows,
I have double checked that the number of images is equal to the number of <|image|> in the .json file. I am quite confused what is the problem here. Could you please help me regarding how to identify which samples raise this index error? it seems printing those three variables doesn't help much in debugging this issue. Many thanks! |
I have the following error when finetune the DocOwl1.5-Omni. It always raises error when index is 10. Please help!!!
Below is my script.
I have tried to print the batch idx, it's weird that it always equal to 0.
screenshot is provided below
The text was updated successfully, but these errors were encountered: