acgan: Use Generator/Discriminator more closely resembling the ones from the referenced paper, and don't train discriminator to produce class labels for generated images #8482

ozabluda · 2017-11-14T21:11:56Z

This depends on PR #8452, and should be merged after it.

Generator:

Remove extra embedding layer.
Remove extra 1x1 convolutions.
Use transposed convolutions instead of NN upsample+conv (=conv with fractional slides). Note that this corresponds to the paper, but contrary to author's own later advice in Deconvolution and Checkerboard Artifacts. Still, I found that paper network works better on MNIST in this example.

Discriminator:

Use LeakyReLU slope 0.2, instead of default 0.3

These changes

made generated images more diverse (for example, thick and thin are on the same epoch, compare acgan: Use Generator/Discriminator more closely resembling the ones from the referenced paper, and don't train discriminator to produce class labels for generated images #8482 (comment) to acgan: Use same latent vector for all classes in a row #8409 (comment))
Can tolerate harder soft_labels (all the way to 1.0 works without generator collapse, but the results are not as good as 0.9).
Reduced the number of trainable parameters in generator by 65% from 8,698,356 to 3,182,580.
Reduced training time per epoch by 50% from 90 sec to 45 sec on Titan X Maxwell.

…d images

… referenced paper

ozabluda · 2017-11-14T21:17:14Z

Below, left image is generated, right image is actual MNIST dataset (generated side-by-side with PR #8483). In generated images, latent vector is the same per-row (see PR #8409)

Epoch 32:

Epoch 45:

Epoch 48:

fchollet · 2017-11-16T00:38:37Z

@lukedeo if your time allows, could you check out this PR as well?

lukedeo · 2017-11-20T00:07:58Z

@fchollet Overall, this looks good and is a welcome change.

@ozabluda you mention that you remove the extra embedding, but I don't see it removed here... the embedding wasn't mentioned in the paper, but I found the hadamard-y interaction to be useful. Not sure what the intention was here.

ozabluda · 2017-11-20T01:34:06Z

This extra "embedding" was removed:
https://github.com/fchollet/keras/pull/8482/files#diff-dc5bc2fdbf2d3f3268e301900b75e68eL52

Actual Embedding:
https://github.com/fchollet/keras/pull/8482/files#diff-dc5bc2fdbf2d3f3268e301900b75e68eR76

and its Hadamard product (which is cool) are untouched:
https://github.com/fchollet/keras/pull/8482/files#diff-dc5bc2fdbf2d3f3268e301900b75e68eR79

Please note that this PR is on top of PR #8452

lukedeo · 2017-11-20T07:48:06Z

Ah ok I see, sure, that extra hidden rep is extraneous and makes it more in-line with the paper.

Re: #8452, how noticeable is the improvement in quality when the discriminator receives zero weight for the auxiliary task on generated samples? If we're going to go full paper reproduction mode, my thought is to go all the way. I could be fairly easily convinced though, as having the discriminator minimize classification accuracy on generated samples can lead to the generator not examining sections of the support where the the likelihood ratio approaches a flat posterior

ozabluda · 2017-11-20T17:22:30Z

A brief history:

Recently acgan example was broken - generating all-black images. The minimal first fix for it was to introduce soft real/fake labels, PR "acgan: Fix generator producing pure black images" #8383, see generated examples there. This fix was either mandatory for later fixes (to prevent all-black), or was producing batter results. So far, all further fixes made soft labels harder and harder, which is an evidence that those are good fixes).

Second fix was PR "acgan: don't train discriminator to produce class labels for generated images" #8452, see generated examples there for how much more diverse they are than before. See #8452 (comment) for alternatives, and #8452 (comment) for the history how it ended up in the paper and the justification of its removal from the example (initially I was also hesitant to remove it, see the second sentence here: #8452 (comment)). It's better to keep the discussion of that PR in that PR.

Third fix is this PR. It depends on second and first.

Use same latent vector for all classes in a row

don't train discriminator to produce class labels for generated images

ozabluda · 2017-11-24T18:24:00Z

The following PRs were merged into this PR:

acgan: Use same latent vector for all classes in a row #8409
acgan: don't train discriminator to produce class labels for generated images #8452

See #8409 (comment)

fchollet · 2017-11-25T20:02:04Z

examples/mnist_acgan.py

@@ -185,6 +182,13 @@ def build_discriminator():
        num_batches = int(x_train.shape[0] / batch_size)
        progress_bar = Progbar(target=num_batches)

+        # don't train discriminator to produce class labels for generated
+        # images. To preserve total weight of the auxilary classifier,
+        # take real image samples with weight 2.


The meaning of this comment is not clear, please rephrase/clarify

fchollet · 2017-11-25T20:03:38Z

examples/mnist_acgan.py

+        # don't train discriminator to produce class labels for generated
+        # images. To preserve total weight of the auxilary classifier,
+        # take real image samples with weight 2.
+        disc_sample_weight = [np.ones(2 * batch_size, dtype=np.float32),


This uses float32 for one but not the other; it's not really necessary for either (memory is not an issue and they get cast anyway). In any case, better to be consistent

ozabluda · 2017-11-25T21:07:47Z

Review comments addressed.

fchollet

LGTM, thanks

ozabluda added 3 commits November 6, 2017 09:44

Use same latent vector for all classes in a row

10f7c9f

acgan: don't train discriminator to produce class labels for generate…

b969389

…d images

Use Generator/Discriminator more closely resembling the ones from the…

67cd3b0

… referenced paper

ozabluda changed the title ~~Use Generator/Discriminator more closely resembling the ones from the referenced paper.~~ acgan: Use Generator/Discriminator more closely resembling the ones from the referenced paper Nov 14, 2017

ozabluda mentioned this pull request Nov 14, 2017

acgan: Draw real as well as generated images side-by-side #8483

Merged

ozabluda added 3 commits November 24, 2017 09:42

Merge pull request #1 from ozabluda/patch-17

240e423

Use same latent vector for all classes in a row

Merge branch 'patch-12' into patch-8

2963569

Merge pull request #2 from ozabluda/patch-8

412b3aa

don't train discriminator to produce class labels for generated images

This was referenced Nov 24, 2017

acgan: Use same latent vector for all classes in a row #8409

Closed

acgan: don't train discriminator to produce class labels for generated images #8452

Closed

fchollet reviewed Nov 25, 2017

View reviewed changes

clarify comment, remove np.float32

97f885b

fchollet approved these changes Nov 25, 2017

View reviewed changes

fchollet merged commit 017fe07 into keras-team:master Nov 25, 2017

ozabluda deleted the patch-12 branch November 26, 2017 00:32

ozabluda mentioned this pull request Nov 28, 2017

acgan: Add batch normalization to the Generator, etc #8616

Merged

ozabluda mentioned this pull request Jan 12, 2018

Add Example ACGAN cifar-10 version #8937

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

acgan: Use Generator/Discriminator more closely resembling the ones from the referenced paper, and don't train discriminator to produce class labels for generated images #8482

acgan: Use Generator/Discriminator more closely resembling the ones from the referenced paper, and don't train discriminator to produce class labels for generated images #8482

ozabluda commented Nov 14, 2017 •

edited

Loading

ozabluda commented Nov 14, 2017 •

edited

Loading

fchollet commented Nov 16, 2017

lukedeo commented Nov 20, 2017

ozabluda commented Nov 20, 2017

lukedeo commented Nov 20, 2017

ozabluda commented Nov 20, 2017

ozabluda commented Nov 24, 2017 •

edited

Loading

fchollet Nov 25, 2017

fchollet Nov 25, 2017

ozabluda commented Nov 25, 2017

fchollet left a comment

acgan: Use Generator/Discriminator more closely resembling the ones from the referenced paper, and don't train discriminator to produce class labels for generated images #8482

acgan: Use Generator/Discriminator more closely resembling the ones from the referenced paper, and don't train discriminator to produce class labels for generated images #8482

Conversation

ozabluda commented Nov 14, 2017 • edited Loading

ozabluda commented Nov 14, 2017 • edited Loading

fchollet commented Nov 16, 2017

lukedeo commented Nov 20, 2017

ozabluda commented Nov 20, 2017

lukedeo commented Nov 20, 2017

ozabluda commented Nov 20, 2017

ozabluda commented Nov 24, 2017 • edited Loading

fchollet Nov 25, 2017

Choose a reason for hiding this comment

fchollet Nov 25, 2017

Choose a reason for hiding this comment

ozabluda commented Nov 25, 2017

fchollet left a comment

Choose a reason for hiding this comment

ozabluda commented Nov 14, 2017 •

edited

Loading

ozabluda commented Nov 14, 2017 •

edited

Loading

ozabluda commented Nov 24, 2017 •

edited

Loading