Photos colorization

Presented algorithm is able to colorize black-white photographies. Graph above shows model architecture. Code is implemented in keras API with tensorflow backend. Resources which helped to establish this code are listed below, but the main one was deep colorization paper [1].
Main change in the structure of the model was the swap of feature extractor model from inception-resnet-v2 to the Xception. Training was done on the places dataset (http://places.csail.mit.edu/). Images selected for test were partially randomly chosen and partially are from flickr image dataset (https://www.flickr.com/photos/tags/dataset/). The network was trained and tested using GPU unit.

Motivation

To practice deep learning in keras enviroment, transfer learning and image processing. Test capabilities of modern algorithms in face of demanding task of image colorization.

Installation

Python is a requirement (Python 3.3 or greater, or Python 2.7). Recommended enviroment is Anaconda distribution to install Python and Spyder (https://www.anaconda.com/download/).

Installing dependencies
To install can be used pip command in command line.

pip install -r requirements.txt

Installing python libraries
Exemplary commands to install python libraries:

pip install numpy  
pip install pandas  
pip install xgboost  
pip install seaborn

Running code
Everything is executed from file main.py. Go to directory where code is downoladed and run a command:

py main.py

Additional requirement is Tensorflow GPU support. Process of configuiring it is described here.

Code examples

def image_a_b_gen(generator, transfer_learning_generator, transfer_learning_model):
	for rgb_image, rgb_tl_image in zip(generator, transfer_learning_generator):
		lab_image = rgb2lab(rgb_image[0])
		luminance = lab_image[:, :, :, [0]]
		ab_components = lab_image[:, :, :, 1:] / 128
		tl_model_features = []
		lab_image_tl = rgb2lab(rgb_tl_image[0])
		luminance_tl = lab_image_tl[:, :, :, [0]]

		for i, sample in enumerate(luminance_tl):
			sample = gray2rgb(sample)
			sample = sample.reshape((1, 331, 331, 3))
			embedding = transfer_learning_model.predict(sample)
			tl_model_features.append(embedding)

		tl_model_features = np.array(tl_model_features)
		tl_model_features_shape_2d = backend.int_shape(Lambda(lambda x: x[:, 0, :], dtype='float32')(tl_model_features))
		tl_model_features = tl_model_features.reshape(tl_model_features_shape_2d)
		yield ([tl_model_features, luminance], ab_components)

def build_encoder(encoder_input):
	encoder_output = Conv2D(64, (3, 3), activation='relu', padding='same', strides=2)(encoder_input)
	encoder_output = Conv2D(128, (3, 3), activation='relu', padding='same')(encoder_output)
	encoder_output = Conv2D(128, (3, 3), activation='relu', padding='same', strides=2)(encoder_output)
	encoder_output = Conv2D(256, (3, 3), activation='relu', padding='same')(encoder_output)
	encoder_output = Conv2D(256, (3, 3), activation='relu', padding='same', strides=2)(encoder_output)
	encoder_output = Conv2D(512, (3, 3), activation='relu', padding='same')(encoder_output)
	encoder_output = Conv2D(512, (3, 3), activation='relu', padding='same')(encoder_output)
	encoder_output = Conv2D(256, (3, 3), activation='relu', padding='same')(encoder_output)
	return encoder_output

Key Concepts

CIELab

Deep Learning

CNNs

Transfer Learning

Xception
https://arxiv.org/pdf/1610.02357.pdf

cuDNN

Model architecture

Image beneth shows implemented model architecture. It is basing on encoder-decoder network mixed with transfer learning model, which is used as a feature extractor. Encoder-decoder structure is called autoencoder. For images it bases on convolutions and subsequent convolutions with upscaling. Convolutions when reducing the size of the image learns the latent representation of the grayscale image. Following upscaling done by decoder learns to reconstruct the color version of photography. In that case the transfer learning model is xception. Name xception comes from extreme version of inception. When in inception 1x1 convolutions were used to project the original input into several separate, smaller input spaces, and from each of those input spaces were used a different type of filter to transform those smaller 3D blocks of data. Xception goes further and instead of partitioning input data into several compressed chunks, it maps the spatial correlations for each output channel separately, and then performs a 1x1 depthwise convolution to capture cross-channel correlation. This operation is known as a depthwise separable convolution[8].
The output of Xception model is a 1000 feature vector which is replicated and added to the output of encoder. This operation is followed by decoder, which restores the input image size.

Results

Below are presented in order from left to right:

original image, which was an input to the algorithm;
grayscale image, it is only the luminance component, one of the outcomes to CIELab transformation;
resut image, which is the output of the presented model.

Resulting images implemented on originaly color photographies: Model implemented on black-white photographies:

Implementation of algorithm to the historical photo:
Couple of photographies which shows the drawbacks of the model. Colorized only part of the images,
leaving a large parts black and white. Mistakes in color sleection for some of the elements.
Resulting images are very often faded:

Summary

Created model is working correctly, suprisingly well taking into acount the fact that it was trained only on computer with only one graphics processing unit. Analyzing image results one can conclude that model has tendency to leave image with faded colors. Another problem is leaving small object not colored. Possible solutions to that drawbacks could be to train model on computer with more powerfull GPUs, like Amazon EC2 p3.16xlarge. Furthermore good option could be addition of class rebalancing basing on probability like in [2] paper.

Resources

[1] Federico Baldassarre, Diego González Morin, Lucas Rodés-Guirao, Deep Koalarization: Image Colorization using CNNs and Inception-Resnet-v2, (https://arxiv.org/abs/1712.03400)
[2] Richard Zhang, Phillip Isola, Alexei A. Efros, Colorful Image Colorization, (https://arxiv.org/abs/1603.08511)
[3] Gustav Larsson, Michael Maire, Gregory Shakhnarovich, Learning Representations for Automatic Colorization, (https://arxiv.org/abs/1603.06668)
[4] Satoshi Iizuka, Edgar Simo-Serra, Hiroshi Ishikawa, Let there be Color!: Joint End-to-end Learning of Global and Local Image Priors for Automatic Image Colorization with Simultaneous Classification, (https://www.researchgate.net/publication/305218105_Let_there_be_color_joint_end-to-end_learning_of_global_and_local_image_priors_for_automatic_image_colorization_with_simultaneous_classification)
[5] Dipanjan Sarkar, Raghav Bali, Tamoghna Ghosh, Hands-On Transfer Learning with Python: Implement advanced deep learning and neural network models using TensorFlow and Keras [6] https://becominghuman.ai/auto-colorization-of-black-and-white-images-using-machine-learning-auto-encoders-technique-a213b47f7339
[7] https://fairyonice.github.io/Color-gray-scale-images-and-manga-using-deep-learning.html
[8] https://towardsdatascience.com/review-xception-with-depthwise-separable-convolution-better-than-inception-v3-image-dc967dd42568

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
results		results
.gitignore		.gitignore
README.md		README.md
evaluation.py		evaluation.py
generators_definition.py		generators_definition.py
gpu_verification.py		gpu_verification.py
main.py		main.py
model_architecture_xception.png		model_architecture_xception.png
model_creation.py		model_creation.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

results

results

.gitignore

.gitignore

README.md

README.md

evaluation.py

evaluation.py

generators_definition.py

generators_definition.py

gpu_verification.py

gpu_verification.py

main.py

main.py

model_architecture_xception.png

model_architecture_xception.png

model_creation.py

model_creation.py

requirements.txt

requirements.txt

Repository files navigation

Photos colorization

Motivation

Installation

Code examples

Key Concepts

Model architecture

Results

Summary

Resources

About

Releases

Packages

Contributors 2

Languages

bluejurand/Photos-colorization

Folders and files

Latest commit

History

Repository files navigation

Photos colorization

Motivation

Installation

Code examples

Key Concepts

Model architecture

Results

Summary

Resources

About

Resources

Stars

Watchers

Forks

Languages