Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

I cannot reproduce the best result of Dss v2 #8

Open
Sucran opened this issue Oct 20, 2018 · 4 comments
Open

I cannot reproduce the best result of Dss v2 #8

Sucran opened this issue Oct 20, 2018 · 4 comments

Comments

@Sucran
Copy link

Sucran commented Oct 20, 2018

I just copy your model, loss, optimizer definition, other settings keep the same except the lr was set as 1e-4 and epoch was set as 100. When epoch reaches 100, the training curve seems to converge, but I test the model, it produces MAE as 0.069, MaxFb as 0.880. Do you change any default setting in your training?

@AceCoooool
Copy link
Owner

I just use 1e-6 as learning rate.

  1. after 100 epoch, the loss change slow. (You can adjust your learning rate, maybe 1e-4 is too large. For example, use dynamic learning rate)
  2. You can use log(loss) to amplify you loss curve to see whether it is converged(it's unfriendly in visdom)

@Sucran
Copy link
Author

Sucran commented Oct 22, 2018

@AceCoooool Thank you for your reply.
I am training again with 1e-6 and 700 epoch.
I have logged 1) the training loss each batch, and I also log 2) the average training and validation loss of each epoch. Besides I plot 3) the MAE and 4) max F-b score just like you did in your code after each epoch. Thanks for your work, visualization code seems easy for me.
The converge I said means MAE/F_b/Loss all change very slowly, maybe dynamic learning rate would help, thank you again.

@holyhao
Copy link

holyhao commented Oct 25, 2018

Tanks for for the works.
So,did you get better results compared to the proposed by AceCoooool , i find that the reproduced results of the work is lowerer than the paper. I wonder if some things is diffrent, try some data augmentations? increase input size? or else.

@tuhinkm
Copy link

tuhinkm commented Feb 25, 2019

I cant reproduce the best results from the paper. Has anyone done it ? Kindly share how...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants