Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Questions with batch normalization #14

Open
LuChengTHU opened this issue Mar 27, 2019 · 1 comment
Open

Questions with batch normalization #14

LuChengTHU opened this issue Mar 27, 2019 · 1 comment

Comments

@LuChengTHU
Copy link

LuChengTHU commented Mar 27, 2019

WechatIMG57

Hello, I just added BN in resblock and run 'python train.py configs/celebA-HQ', then it cannot converge at all.(see the picture below)

00006000

I noticed the moving average in your training, but I think its influence to BN is quite small. I cannot understand the reason why adding BN cannot converge.
Looking forward to your reply!

@LMescheder
Copy link
Owner

The moving average has a significant influence with batch normalization, as it simply ignores the learned weights from the BN when copying the model. Can you try to deactivate the moving average in the code and try again? Also note that our theoretical results are local in nature and we therefore cannot guarantee global convergence for every possible architecture you come up with :-) . However, from your image it appears that the algorithm is doing something sensible.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants