Why can't my training loss decrease with ArcFace? It drops to about 21 and stops falling. #3

LHH20000923 · 2022-01-08T07:48:00Z

No description provided.

fdbtrs · 2022-01-14T13:44:54Z

which database do you use for training, loss function, batch size etc...?

LHH20000923 · 2022-01-22T07:07:10Z

I use CASIA-WebFace, optim is SGD ,lr=0.1, batchsize=256/512, and didn't use knowledge distillation。
The loss curve is shown below。Is it a problem with the data set？

fdbtrs · 2022-01-26T20:39:45Z

For casia dataset, you may need to train the model for at least 50 epochs with lr step of 20 30 and 40. The reported FR result in the paper is based on MS1Mv2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why can't my training loss decrease with ArcFace? It drops to about 21 and stops falling. #3

Why can't my training loss decrease with ArcFace? It drops to about 21 and stops falling. #3

LHH20000923 commented Jan 8, 2022

fdbtrs commented Jan 14, 2022 •

edited

Loading

LHH20000923 commented Jan 22, 2022

fdbtrs commented Jan 26, 2022

Why can't my training loss decrease with ArcFace? It drops to about 21 and stops falling. #3

Why can't my training loss decrease with ArcFace? It drops to about 21 and stops falling. #3

Comments

LHH20000923 commented Jan 8, 2022

fdbtrs commented Jan 14, 2022 • edited Loading

LHH20000923 commented Jan 22, 2022

fdbtrs commented Jan 26, 2022

fdbtrs commented Jan 14, 2022 •

edited

Loading