Homework1 Report

Overview

The goal of this assignment is to train a color/texture transfer model using CycleGAN.

Datasets

In this assignment, we use the iphone2dslr_flower dataset, which is introduced in the CycleGAN paper. iphone2dslr_flower contains iPhone and DSLR photos of flowers downloaded from Flickr photos. The main difference between these two classes is the depth of field in the images. Depth of field (DOF) is the distance between the nearest and the furthest objects that are in acceptably sharp focus in an image. We summarize the total number of examples and corresponding DOF type of each class under the below table.

class	iPhone	DSLR
DOF	Shallow	Deep
examples	1813	3316
instance

About CycleGAN

In CycleGAN, they introduce two cycle consistency losses that capture the intuition that if we translate from one domain to the other and back again we should arrive at where we started. More specifically, the two losses are forward cycle-consistency loss and backward cycle-consistency loss.

forward cycle-consistency loss: x → G(x) → F(G(x)) ≈ x
backward cycle-consistency loss: y → F(y) → G(F(y)) ≈ y

In the original paper, the total cycle loss is defined as:

loss = self.lambda1*forward_loss + self.lambda2*backward_loss

which is the sum of two L1 normalized loss(forward_loss amd backward_loss).

Apart from the cycle loss, the identity loss is also introduced in the paper. The intuition behind the identity loss is to encourage the mapping to preserve color composition between the input and output.

Qualitative Results

We use our personal image (e.g., photoed by our iPhones) as inputs. As the table shown below, our model can learn to generate photos in the style of both DSLR and iPhone.

DSLR ← iPhone	iPhone ← DSLR

Comparison with Conventional Method

We compare our result with Color Transfer between Images [Reinhard et al., 2001]. In the paper, the authors utilize the Lab* color space and the mean and std of each L*, a*, and b* channel, respectively, to transfer the color between two images. We use the opencv implementation provided in here.

Source (iPhone)	Target (DSLR)	Results

Source (DSLR)	Target (iPhone)	Results

Our CycleGAN model shows significantly better results comparing to color transfer. Unlike color transfer which learns to transfer the style of a single selected piece of art, our CycleGAN model learns to mimic the style of an entire collection of dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
output/imgs		output/imgs
.gitignore		.gitignore
README.md		README.md
datasets.py		datasets.py
download_dataset.sh		download_dataset.sh
environment.yml		environment.yml
models.py		models.py
test.py		test.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

output/imgs

output/imgs

.gitignore

.gitignore

README.md

README.md

datasets.py

datasets.py

download_dataset.sh

download_dataset.sh

environment.yml

environment.yml

models.py

models.py

test.py

test.py

train.py

train.py

utils.py

utils.py

Repository files navigation

Homework1 Report

Overview

Datasets

About CycleGAN

Qualitative Results

Comparison with Conventional Method

Appendix: Training Progress Bar

About

Releases

Packages

Languages

fuenwang/CVFX-hw1

Folders and files

Latest commit

History

Repository files navigation

Homework1 Report

Overview

Datasets

About CycleGAN

Qualitative Results

Comparison with Conventional Method

Appendix: Training Progress Bar

About

Resources

Stars

Watchers

Forks

Languages