what does the training loss curve look like #27

ghost · 2019-11-19T14:45:56Z

I'm trying to train SSN via train_ssn.py, but after running ~40,000 iterations there seems to be a lot of jittering but no meaningful decrease in the training loss. I know from reading previous issues that convergence takes ~500,000K iterations, but with my computing resources it would take a few days to reach convergence.

So I was wondering whether the authors could kindly tell me / show me what the training loss curve looks like as a function of iteration number, starting from iteration 0 all the way to convergence.

Thank you in advance.

varunjampani · 2019-11-19T15:06:58Z

Sorry that I don't have any saved logs or plots for this. I might not find time soon to re-train to produce these plots.

ghost · 2019-11-20T00:55:56Z

Would it be ok if I instead ask the following two questions?

is it not unusual that the training loss sometimes increases during the early iterations? Is it possible for the training loss to decrease after it increases early on?
how many iterations does it takes for the training loss to start decreasing in an meaningful way?

varunjampani · 2019-11-21T22:30:33Z

If I remember correctly, Adam optimization results in a training curve with ups and downs. I do not remember how much ups and downs the loss is fluctuating.

CYang0515 · 2020-01-07T02:22:24Z

@coarsesand In my experiments, the position loss will increase along with iterations, and the reconstruction loss will decrease gradually.

ghost · 2020-01-19T07:29:12Z

@CYang0515 does decrease in reconstruction loss eventually overpower the increase in position loss, resulting in a decrease in the overall combined loss with increasing number of iterations?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

what does the training loss curve look like #27

what does the training loss curve look like #27

ghost commented Nov 19, 2019 •

edited by ghost

Loading

varunjampani commented Nov 19, 2019

ghost commented Nov 20, 2019

varunjampani commented Nov 21, 2019

CYang0515 commented Jan 7, 2020

ghost commented Jan 19, 2020

what does the training loss curve look like #27

what does the training loss curve look like #27

Comments

ghost commented Nov 19, 2019 • edited by ghost Loading

varunjampani commented Nov 19, 2019

ghost commented Nov 20, 2019

varunjampani commented Nov 21, 2019

CYang0515 commented Jan 7, 2020

ghost commented Jan 19, 2020

ghost commented Nov 19, 2019 •

edited by ghost

Loading