Image translation by CycleGAN and pix2pix in Tensorflow

This is my ongoing tensorflow implementation for unpaired image-to-image translation (Zhu et al., 2017).

Latest results can be found here, comparing paired and unpaired image-to-image translation.

Image-to-image translation learns a mapping from input images to output images, like these examples from the original papers:

CycleGAN: [Project] [Paper] [Torch]

Pix2pix: [Project] [Paper] [Torch]

Prerequisites

Linux or OSX.
Python 2 or Python 3.
CPU or NVIDIA GPU + CUDA CuDNN.

Requirements

Tensorflow 1.0

Preferred

Anaconda Python distribution
PyCharm

Getting Started

Clone this repository

git clone https://github.com/tbullmann/imagetranslation-tensorflow.git
cd imagetranslation-tensorflow

Install Tensorflow, e.g. with Anaconda

Create directories or symlink

mkdir datasets  # or symlink; for datasets
mkdir temp  # or symlink; for checkpoints, test results

Download the CMP Facades dataset (generated from http://cmp.felk.cvut.cz/~tylecr1/facade/)

python tools/download-dataset.py facades datasets

Train the model (this may take 1-8 hours depending on GPU, on CPU you will be waiting for a bit)

python translate.py \
  --model pix2pix \
  --mode train \
  --output_dir temp/facades_train \
  --max_epochs 200 \
  --input_dir datasets/facades/train \
  --which_direction BtoA

Test the model

python translate.py \
  --model pix2pix \
  --mode test \
  --output_dir temp/facades_test \
  --input_dir datasets/facades/val \
  --checkpoint temp/facades_train

The test run will output an HTML file at temp/facades_test/index.html that shows input/output/target image sets.

For training of the CycleGAN use --model CycleGAN instead of --model pix2pix. Both models use u-net as generator by default but can use faststyle-net when specified by --generator faststyle.

You can look at the loss and computation graph for pix2pix and CycleGAN using tensorboard:

tensorboard --logdir=temp/facades_train

If you wish to write in-progress pictures as the network is training, use --display_freq 50. This will update temp/facades_train/index.html every 50 steps with the current training inputs and outputs.

TODO

Finish CycleGAN implementation according to publication Hu et al., 2017

Major issues

test u-net declaration with decoder using encoder dimensions (fix crash when height and width other than powers of 2)
test other datasets, show results on README.md

Minor issues

add image buffer that stores the previous image (to update discriminators using a history of 50 generated images)
add instance normalization (Ulyanov D et al., 2016)
flexible learning rate for the Adams solver
add one-direction test mode for CycleGAN
add identity loss

Done

test CycleGAN with u-net generator and log loss and compare with pix2pix: OK
test CycleGAN with faststyle-net generator and log loss: OK
square loss and several options for loss function for generator (maximising discriminator loss, ...)
refactor summary and export of images to work for all models: Pix2Pix, CycleGAN, Pix2Pix2
two batches delivering unpaired images for CycleGAN
import of images from different subdirectories
different (classic) loss function
recursive implementations for u-net
res-net, highway-net, dense-net implementation with endcoder/decoder as in faststyle res-net
tested transfer of generators from paired to unpaired

Acknowledgement

This repository is based on this Tensorflow implementation of the paired image-to-image translation (Isola et al., 2016) Highway and dense net were adapted from the implementation exemplified in this blog entry.

Citation

If you use this code for your research, please cite the papers this code is based on:

@article{pix2pix2016,
  title={Image-to-Image Translation with Conditional Adversarial Networks},
  author={Isola, Phillip and Zhu, Jun-Yan and Zhou, Tinghui and Efros, Alexei A},
  journal={arXiv preprint arXiv:1611.07004v1},
  year={2016}
}

@article{CycleGAN2017,
  title={Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networkss},
  author={Zhu, Jun-Yan and Park, Taesung and Isola, Phillip and Efros, Alexei A},
  journal={arXiv preprint arXiv:1703.10593},
  year={2017}
}

@inproceedings{johnson2016perceptual,
  title={Perceptual losses for real-time style transfer and super-resolution},
  author={Johnson, Justin and Alahi, Alexandre and Fei-Fei, Li},
  booktitle={European Conference on Computer Vision},
  pages={694--711},
  year={2016},
  organization={Springer}
}

@article{He2016identity,
  title={Identity Mappings in Deep Residual Networks},
  author={Kaiming He and Xiangyu Zhang and Shaoqing Ren and Jian Sun},
  journal={arXiv preprint arXiv:1603.05027},
  year={2016}}

@article{Srivastava2015highway,
  title={Highway Networks},
  author={Rupesh Kumar Srivastava and Klaus Greff and J{\"{u}}rgen Schmidhuber},
  journal={arXiv preprint arXiv:1505.00387},
  year={2015}
}

@article{Huang2016dense,
  title={Densely Connected Convolutional Networks},
  author={Gao Huang and Zhuang Liu and Kilian Q. Weinberger},
  journal={arXiv preprint arXiv:1608.06993},
  year={2016}
}

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
docs		docs
tools		tools
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
__init__.py		__init__.py
translate.py		translate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs

docs

tools

tools

.gitignore

.gitignore

LICENSE.txt

LICENSE.txt

README.md

README.md

init.py

init.py

translate.py

translate.py

Repository files navigation

Image translation by CycleGAN and pix2pix in Tensorflow

CycleGAN: [Project] [Paper] [Torch]

Pix2pix: [Project] [Paper] [Torch]

Prerequisites

Requirements

Preferred

Getting Started

TODO

Finish CycleGAN implementation according to publication Hu et al., 2017

Done

Acknowledgement

Citation

About

Releases

Packages

Contributors 4

Languages

License

tbullmann/imagetranslation-tensorflow

Folders and files

Latest commit

History

Repository files navigation

Image translation by CycleGAN and pix2pix in Tensorflow

CycleGAN: [Project] [Paper] [Torch]

Pix2pix: [Project] [Paper] [Torch]

Prerequisites

Requirements

Preferred

Getting Started

TODO

Finish CycleGAN implementation according to publication Hu et al., 2017

Done

Acknowledgement

Citation

About

Resources

License

Stars

Watchers

Forks

Languages