Create vae_mnist_new_architecture.jl #487

NikonPic · 2019-09-16T11:16:56Z

Proposal for using the vae mnist exmaple with the newer API from Knet. Type definitions allow increased performance (~20% faster due to lower gc time) and better readability of the network architecture.

CarloLucibello · 2019-09-19T05:50:41Z

examples/variational-autoencoder/vae_mnist_new_architecture.jl

+function train(ae, dtrn, iters)
+    img = convert(Atype, reshape(dtrn.x[:,1], (28, 28, 1, 1)))
+    for epoch = 1:iters
+        @time adam!(ae, dtrn)


if I'm not wrong, this is not the correct way to iterate over epochs, since here each time a new Adam struct is created and information (e.g. accumulated moments) from previous epochs are lost

Oh yes completely correct! Thanks for the advice, i have adapted my example proposal accordingly.

Added the proper version for designing the training, more detailed callback and improved type definitions.

ekinakyurek · 2019-11-28T01:47:30Z

examples/variational-autoencoder/vae_mnist_new_architecture.jl

+
+    BCE = F(0)
+
+    for s = 1:samples


this is probably not efficient, you can run all samples at once by additional "sample" batching. First, you need to reshape μ to (nz,B,1), then you need to sample from randn with size (nz,B,Nsample) and broadcast μ on it. Then, you can change binary_cross_entropy to deal with (nz,B,Nsample) input.

Correct the form was not efficient for sampling multiple times within one batch. The Suggestion to broadcast is more effecient and much faster. However, I was not able to broadcast this efficiently trough the decoder network. As the sampling doesn't increase performance as far as i can tell and the majority of implementations i found do not use it, i have also abandoned this for the example here.

Multiple sampling has been removed as it is also not used in the original VAE approach.

Create vae_mnist_new_architecture.jl

29a860c

Proposal for using the vae mnist exmaple with the newer API from Knet. Type definitions allow increased performance (~20% faster due to lower gc time) and better readability of the network architecture.

CarloLucibello reviewed Sep 19, 2019

View reviewed changes

updated version with proper training iterator

9e461e5

Added the proper version for designing the training, more detailed callback and improved type definitions.

ekinakyurek reviewed Nov 29, 2019

View reviewed changes

Removed multiple sampling in batches

10f4484

Multiple sampling has been removed as it is also not used in the original VAE approach.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create vae_mnist_new_architecture.jl #487

Create vae_mnist_new_architecture.jl #487

NikonPic commented Sep 16, 2019

CarloLucibello Sep 19, 2019

NikonPic Oct 16, 2019

ekinakyurek Nov 28, 2019

NikonPic Jan 7, 2020

Create vae_mnist_new_architecture.jl #487

Are you sure you want to change the base?

Create vae_mnist_new_architecture.jl #487

Conversation

NikonPic commented Sep 16, 2019

CarloLucibello Sep 19, 2019

Choose a reason for hiding this comment

NikonPic Oct 16, 2019

Choose a reason for hiding this comment

ekinakyurek Nov 28, 2019

Choose a reason for hiding this comment

NikonPic Jan 7, 2020

Choose a reason for hiding this comment