tensorflow-fcn

This is a Tensorflow implementation of Fully Convolutional Networks in Tensorflow. The network can be applied directly or finetuned using tensorflow training code.

Deconvolution Layers are initialized as bilinear upsampling. Conv and FCN layer weights using VGG weights. Numpy load is used to read VGG weights. No Caffe or Caffe-Tensorflow is required to run this. The .npy file for VGG16 however need to be downloaded before using this needwork.

No Pascal VOC finetuning was applied to the weights. The model is meant to be finetuned on your own data. The model can be applied to an image directly (see test_fcn32_vgg.py) but the result will be rather coarse.

Usage

python test_fcn32_vgg.py to test the implementation.

Use this to build the VGG object for finetuning:

vgg = vgg16.Vgg16()
vgg.build(images, train=True, num_classes=num_classes, random_init_fc8=True)

The images is a tensor with shape [None, h, w, 3]. Where h and w can have arbitrary size.

Trick: the tensor can be a placeholder, a variable or even a constant.

Be aware, that num_classes influences the way score_fr (the original fc8 layer) is initialized. For finetuning I recommend using the option random_init_fc8=True.

Finetuning and training

For training build the graph using vgg.build(images, train=True, num_classes=num_classes) were images is q queue yielding image batches. Use a softmax_cross_entropy loss function on top of the output of vgg.up. An Implementation of the loss function can be found in loss.py.

To train the graph you need an input producer and a training script. Have a look at TensorVision to see how to build those.

I had success finetuning the network using Adam Optimizer with a learning rate of 1e-6.

Content

Currently the following Models are provided:

FCN32
FCN16
FCN8

Remark

The deconv layer of tensorflow allows to provide a shape. The crop layer of the original implementation is therefore not needed.

I have slightly altered the naming of the upscore layer.

Field of View

The receptive field (also known as or field of view) of the provided model is:

( ( ( ( ( 7 ) * 2 + 6 ) * 2 + 6 ) * 2 + 6 ) * 2 + 4 ) * 2 + 4 = 404

Predecessors

Weights were generated using Caffe to Tensorflow. The VGG implementation is based on tensorflow-vgg16 and numpy loading is based on tensorflow-vgg. You do not need any of the above cited code to run the model, not do you need caffe.

Install

Installing matplotlib from pip requires the following packages to be installed libpng-dev, libjpeg8-dev, libfreetype6-dev and pkg-config. On Debian, Linux Mint and Ubuntu Systems type:

sudo apt-get install libpng-dev libjpeg8-dev libfreetype6-dev pkg-config
pip install -r requirements.txt

TODO

Provide finetuned FCN weights.
Provide general training code

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
test_data		test_data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
fcn16_vgg.py		fcn16_vgg.py
fcn32_downsampled.png		fcn32_downsampled.png
fcn32_upsampled.png		fcn32_upsampled.png
fcn32_vgg.py		fcn32_vgg.py
fcn8_vgg.py		fcn8_vgg.py
loss.py		loss.py
requirements.txt		requirements.txt
test_fcn16_vgg.py		test_fcn16_vgg.py
test_fcn32_vgg.py		test_fcn32_vgg.py
test_fcn8_vgg.py		test_fcn8_vgg.py
utils.py		utils.py

License

maltebaumann/tensorflow-fcn

Folders and files

Latest commit

History

Repository files navigation

tensorflow-fcn

Usage

Finetuning and training

Content

Remark

Field of View

Predecessors

Install

TODO

About

Resources

License

Stars

Watchers

Forks

Languages