-
Notifications
You must be signed in to change notification settings - Fork 88
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The model cannot converge when training #11
Comments
Since there are ~320,000 subimages, about 320,000 / 14 = 23,000 iterations is 1 epoch. Don't pray loss falling down before 1 epoch.
|
Thanks! u are right. After 40,000 interations, the loss tends to be 4. |
Maybe too high, maybe not comparable. Final AP should be close to YOLOv2. |
Hi,
I just followed the instruction to train the SSD model, but the loss can't fall.
At the beginning, the base_lr= 0.001 but the loss=nan
Then, I set a lower base_lr = 0.0001 , the loss drops from 40+ to ~10 ,and don't have any change.
Next, I kill the training and set the base_lr=0.001 and resume to train, the loss = nan again.
So, maybe the 0.01 is too big for the model, I lower learning rate which base_lr= 0.0004, but the loss is aways ~8.
how much the loss in the SSD model will finally be? and can you give me some advice to training the data?
The text was updated successfully, but these errors were encountered: