chapter4 DCGAN used tf.keras but could not produce same results #20

Nevermetyou65 · 2021-10-22T13:27:34Z

Hi, I am reading the chapter 4 of this book and there seem to be some problem.
The code written in this book is from Keras but when I do the code I prefer to use tf.keras which should not be different.
I implemented the code written in chapter 4 using tf.keras and I got strange result like the loss of discriminator and generator approached 0 and the acc. to 1. Also the result in image-grid is just noise. But when I removed the BathcNormalization layer out of both generator and discriminator, I got fine fake digit images. Any Idea why????

this is the colab containing the code
https://colab.research.google.com/drive/1TF-nkPPkj0HAzKceb3UL_AzSdb-0DjKD?usp=sharing

mjzalewski · 2022-06-13T16:58:49Z

I also had the same problem. When I removed all the BatchNormalization layers from both the discriminator and generator, the problem was even worse (The loss for the generator was high, and the images were just blobs).

I had success when I removed the BatchNormalization from the discriminator only.

I suspect that the Discriminator is training too quickly relative to the generator. When you remove BatchNormalization from the discriminator, it trains more slowly, closer to the training rate of the generator.

bladebump · 2022-07-26T08:12:35Z

I also had the same problem. I change this model to use maxpool ansd remove batchnormalization.it's work.but not good.

marckolak · 2023-07-19T15:54:31Z

I had a same problem.
I removed BatchNormalization and tahn activation layer. I also added a Dropout layer in the discriminator to avoid overfitting.
Here you have modified models for reference

def build_generator(img_shape, z_dim):
    model = Sequential()
    model.add(Input(shape=z_dim))
    
    model.add(Dense(256*7*7))
    model.add(Reshape((7,7,256)))
    
    model.add(Conv2DTranspose(128, kernel_size=3, strides=2, padding='same'))
    model.add(LeakyReLU(alpha=0.01))
    
    model.add(Conv2DTranspose(64, kernel_size=3, strides=1, padding='same'))
    model.add(LeakyReLU(alpha=0.01))
    
    model.add(Conv2DTranspose(1, kernel_size=3, strides=2, padding='same'))
    
    return model

def build_discriminator(img_shape):
    model = Sequential()
    model.add(Input(shape = img_shape))

    model.add(Conv2D(32, kernel_size=3, strides=2, padding='same'))
    model.add(LeakyReLU(alpha=0.01))
    
    model.add(Conv2D(64, kernel_size=3, strides=2, padding='same'))
    model.add(LeakyReLU(alpha=0.01))
    
    model.add(Conv2D(128, kernel_size=3, strides=2, padding='same'))
    model.add(LeakyReLU(alpha=0.01))
    model.add(Dropout(0.4))

    model.add(Flatten())
    model.add(Dense(1, activation='sigmoid'))
    
    return model

The results are much better.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chapter4 DCGAN used tf.keras but could not produce same results #20

chapter4 DCGAN used tf.keras but could not produce same results #20

Nevermetyou65 commented Oct 22, 2021

mjzalewski commented Jun 13, 2022

bladebump commented Jul 26, 2022

marckolak commented Jul 19, 2023

chapter4 DCGAN used tf.keras but could not produce same results #20

chapter4 DCGAN used tf.keras but could not produce same results #20

Comments

Nevermetyou65 commented Oct 22, 2021

mjzalewski commented Jun 13, 2022

bladebump commented Jul 26, 2022

marckolak commented Jul 19, 2023