KLD calculation #1

dksifoua · 2020-04-13T14:47:31Z

Hi,

I think there's an error in your KLD calculation.

This is what you wrote:

# see Appendix B from VAE paper:
# Kingma and Welling. Auto-Encoding Variational Bayes. ICLR, 2014
# https://arxiv.org/abs/1312.6114
# 0.5 * sum(1 + log(sigma^2) - mu^2 - sigma^2)
KLD = -0.5 * torch.mean(torch.mean(1 + logvar - mu.pow(2) - logvar.exp(), 1))

Instead of (what I think it should be)

KLD = -0.5 * torch.mean(torch.sum(1 + logvar - mu.pow(2) - logvar.exp(), 1))

Let me know if I'm right.

Also, could you explain me why you multiply KLD by 0.1 ?
Is that same as multiply BCE a big number? say 1000 for eg?

botkevin · 2020-12-02T01:41:00Z

Pretty sure he multiplies the KLD by .1 because that is his KLD weight hyperparameter

botkevin · 2020-12-02T12:41:19Z

Also, while working on a VAE that I wrote based on this, if I change the offending mean to a sum, my recon loss is much higher (2x), while my kl_loss starts as similar, but decreases faster with torch.mean; and all my reconstructed images are basically the same blurry image. I have no idea why this would change the reconstruction so much... I will have to do some investigation.

botkevin · 2020-12-03T00:55:54Z

Mean is equivalent to sum, it's just a scalar difference. Normally, using adam, if we don't have a composite loss, this scale doesn't matter, so if I change the sum to mean, the program should backpropagate the same. I changed the mean to a sum and decreased the kld weight, which fixed my problem. Basically when I changed the mean to a sum, I put too much weight on the kl weight and caused the latent distributions to be too strongly bound to the normal guassian.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KLD calculation #1

KLD calculation #1

dksifoua commented Apr 13, 2020

botkevin commented Dec 2, 2020

botkevin commented Dec 2, 2020

botkevin commented Dec 3, 2020

KLD calculation #1

KLD calculation #1

Comments

dksifoua commented Apr 13, 2020

botkevin commented Dec 2, 2020

botkevin commented Dec 2, 2020

botkevin commented Dec 3, 2020