CGAN convergence time #31

oliverzhang42 · 2018-09-12T05:36:38Z

I've noticed an inefficiency in the CGAN code. When we append the one-hot encoded labels to the image, they influence the training gradients a lot. Instead, I've noticed that scaling the one-hot encoded labels down by a factor of 0.01 or even 0.001 helps the CGAN converge around twice as fast.

That would mean changing opts.py's conv_cond_concat function. My hack was to change return concat([x, y*tf.ones([x_shapes[0], x_shapes[1], x_shapes[2], y_shapes[3]])], 3) to return concat([x, 0.001*y*tf.ones([x_shapes[0], x_shapes[1], x_shapes[2], y_shapes[3]])], 3) and that worked well for me. I'm not too sure about in general though, perhaps try adding batch norm?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CGAN convergence time #31

CGAN convergence time #31

oliverzhang42 commented Sep 12, 2018 •

edited

CGAN convergence time #31

CGAN convergence time #31

Comments

oliverzhang42 commented Sep 12, 2018 • edited

oliverzhang42 commented Sep 12, 2018 •

edited