Understanding the deconvolution in FCN-32. #4952

warmspringwinds · 2016-11-05T20:07:43Z

Hello,

I am trying to understand the design of the FCN-32 model and especially the parameters of the
deconvolutional layer (convolution transposed).

Specifically, why the stride was chosen to be 32 and the kernel size 64.

So, for example if the input image is of the size 768 by 1024.
After the input is processed by all pooling layers we get 24 by 32 subsampled predictions.

Then the goal is basically to go from those subsampled predictions back to input image size.
Using the equation from the chapter No zero padding, non-unit strides, transposed from here and using stride 32 and kernel 64, I get output of size 800 by
1056. Is it how it is actually done in the current implementation?
I understand that after that we can just crop those to original input size.

My main question is: how did the authors come up with stride 32 and kernel 64 parameters?
I know that after all pooling layers the input gets downsampled by 32 but why the kernel size is 64?
Is it due to the fact that in the paper they initialize the filters to bilinear interpolation filter and wanted the kernel to capture the 4 closest points?

Sorry for posting it here. I just couldn't find the answer for the question in the paper or somewhere else.

The text was updated successfully, but these errors were encountered:

williford · 2016-11-08T15:03:54Z

Questions like these should be asked on the Caffe user mailing list or on another website, such as http://stackoverflow.com/ or http://stats.stackexchange.com/.

shelhamer · 2016-11-09T20:56:27Z

From https://github.com/BVLC/caffe/blob/master/CONTRIBUTING.md:

Please do not post usage, installation, or modeling questions, or other requests for help to Issues.
Use the caffe-users list instead. This helps developers maintain a clear, uncluttered, and efficient view of the state of Caffe.

warmspringwinds · 2016-11-24T01:26:32Z

If someone still confused with this, I found an answer:
http://warmspringwinds.github.io/tensorflow/tf-slim/2016/11/22/upsampling-and-image-segmentation-with-tensorflow-and-tf-slim/

dongzhuoyao · 2016-12-21T02:54:09Z

@warmspringwinds thank you very much!

shelhamer closed this as completed Nov 9, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Understanding the deconvolution in FCN-32. #4952

Understanding the deconvolution in FCN-32. #4952

warmspringwinds commented Nov 5, 2016

williford commented Nov 8, 2016

shelhamer commented Nov 9, 2016

warmspringwinds commented Nov 24, 2016

dongzhuoyao commented Dec 21, 2016

Understanding the deconvolution in FCN-32. #4952

Understanding the deconvolution in FCN-32. #4952

Comments

warmspringwinds commented Nov 5, 2016

williford commented Nov 8, 2016

shelhamer commented Nov 9, 2016

warmspringwinds commented Nov 24, 2016

dongzhuoyao commented Dec 21, 2016