Deep-SR-ITM

This is the official repository of Deep SR-ITM (ICCV2019).

We provide the training and test code along with the trained weights and the dataset (train+test) used for the Deep SR-ITM. Our paper was accepted for oral presentation at ICCV 2019. If you find this repository useful, please consider citing our paper.

Reference:

Soo Ye Kim, Jihyong Oh, Munchurl Kim, "Deep SR-ITM: Joint Learning of Super-Resolution and Inverse Tone-Mapping for 4K UHD HDR Applications", IEEE International Conference on Computer Vision, 2019.

Bibtex:

@inproceedings{kim2019deep,
      title = {Deep SR-ITM: Joint Learning of Super-Resolution and Inverse Tone-Mapping for 4K UHD HDR Applications}, 
      author = {Kim, Soo Ye and Oh, Jihyong and Kim, Munchurl},
      booktitle = {Proceedings of the IEEE International Conference on Computer Vision},
      year = {2019}
}

Supplementary Material is provided here.
Presentation material at ICCV: Poster, PPT

Requirements

Our code is implemented using MatConvNet. (MATLAB required)

Appropriate installations of MatConvNet is necessary via the official website.
Detailed instructions on installing MatConvNet can be found here.

The code was tested under the following setting:

MATLAB 2017a
MatConvNet 1.0-beta25
CUDA 9.0, 10.0
cuDNN 7.1.4
NVIDIA TITAN Xp GPU

If you wish to measure mPSNR (using Deep-SR-ITM/utils/mPSNR_HDR.m), it is dependent on HDR Toolbox.

Test code

Quick Start

Download the source code in a directory of your choice <source_path>.
Download the test dataset from this link and place the 'test' folder in <source_path>/data
Place the files in <source_path>/+dagnn/ to <MatConvNet>/matlab/+dagnn
Run test_mat.m or test_yuv.m

Description

We provide the pre-trained weights for the x2 and x4 models in <source_path>/net.
The test dataset can be downloaded from here.
(Note: Both the SDR video files and the HDR video files are of 3840x2160 resolution)

We provide three test code files:

test_mat.m for testing the provided .mat test set. (fast)
- Please refer to the Quick Start section in order to run this code.
- You can change the SR scale factor (2 or 4) by modifying the scale parameter in the initial settings.
- You can choose which metrics to use for evaluation (PSNR, SSIM, mPSNR, MS-SSIM).
- When you run this code, evaluation will be performed on the selected metrics and the .mat prediction file will be saved in <source_path>/pred/
test_yuv.m for testing the provided .yuv test set. (slow)
- Please refer to the Quick Start section in order to run this code.
- You can change the SR scale factor (2 or 4) by modifying the scale parameter in the initial settings.
- You can choose which metrics to use for evaluation (PSNR, SSIM, mPSNR, MS-SSIM).
- When you run this code, evaluation will be performed on the selected metrics and the .yuv prediction file will be saved in <source_path>/pred/
- This test code will generate an HDR YUV video file of 10 bits/pixel, after the PQ-OETF, in the BT.2020 color container and this YUV file can be viewed on HDR TVs after encoding with the above specifications.
test_myyuv.m for testing your own YUV files with this code.
- You must specify the settings part according to your YUV file specifications.
- After setting up the specifications, please follow through the instructions 1. to 3. in the Quick Start section and then run this code.
- You can change the SR scale factor (2 or 4) by modifying the scale parameter in the initial settings.
- This test code will generate an HDR YUV video file of 10 bits/pixel, after the PQ-OETF, in the BT.2020 color container and this YUV file can be viewed on HDR TVs after encoding with the above specifications.
test_mat_cpu.m for testing the provided .mat test set on a CPU. (very slow)
- This version can be executed in the same way as test_mat.m

Training code

Quick Start

Download the source code in a directory of your choice <source_path>.
Download the train dataset from here and place the 'train' folder in <source_path\>/data
Place the files in <source_path\>/+dagnn/ to <MatConvNet\>/matlab/+dagnn
Run train_base_net.m (pre-training) then run train_full_net.m (full training with modulation components)

Description

The train dataset can be downloaded from here. We create a training set prior to training (instead of cropping the training patches every mini-batch), as it is inefficient to read the 4K frames at every iteration.

We provided two training code files for pre-training and fully training the whole network.

train_base_net.m for pre-training.
- Please refer to the Quick Start section in order to run this code.
- The trained weights will be saved in <source_path>/net/net_base
- The network model (net_base) can be found in the file net_base.m (net_base_x4.m for x4 model)
- Use train_base_net_x4.m for training the x4 model.
train_full_net.m for full training of the whole network including the modulation components.
- Please refer to the Quick Start section in order to run this code.
- This code will initialize the corresonding weights of net_full with those trained on net_base. (Hence, it requires the pre-training of net_base using the train_base_net.m code.)
- The trained weights will be saved in <source_path>/net/net_full
- The network model (net_full) can be found in the file net_full.m (net_full_x4.m for x4 model)
- Use train_full_net_x4.m for training the x4 model.

Testing with the trained model
Make sure to modify the lines in the test code to load the trained network when testing with the trained model.

Multi-purpose CNN (New update 09.09)

Additionally, we also provide the pre-trained parameters of our previous work (Multi-purpose CNN) in the below reference, re-trained on the same data as Deep SR-ITM as compared in our ICCV paper.

Reference:

Soo Ye Kim and Munchurl Kim, "A Multi-purpose Convolutional Neural Network for Simultaneous Super-Resolution and High Dynamic Range Image Reconstruction," Asian Conference on Computer Vision, 2018.

Bibtex:

@inproceedings{kim2018a,
      title = {A Multi-purpose Convolutional Neural Network for Simultaneous Super-Resolution and High Dynamic Range Image Reconstruction}, 
      author = {Kim, Soo Ye and Kim, Munchurl},
      booktitle = {Proceedings of the Asian Conference on Computer Vision},
      year = {2018}
}

Description

We provide the pre-trained weights for the x2 and x4 models in <source_path>/net. The testing procedure is the same as Deep SR-ITM, and you can easily test the Multi-purpose CNN by specifying 'Multi-purpose CNN' as the model in the Settings part in the test code files.

Contact

Please contact me via email (sooyekim@kaist.ac.kr) for any problems regarding the released code.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
+dagnn		+dagnn
data		data
net		net
pred		pred
utils		utils
README.md		README.md
net_base.m		net_base.m
net_base_x4.m		net_base_x4.m
net_full.m		net_full.m
net_full_x4.m		net_full_x4.m
test_mat.m		test_mat.m
test_mat_cpu.m		test_mat_cpu.m
test_myyuv.m		test_myyuv.m
test_yuv.m		test_yuv.m
train_base_net.m		train_base_net.m
train_base_net_x4.m		train_base_net_x4.m
train_full_net.m		train_full_net.m
train_full_net_x4.m		train_full_net_x4.m

sooyekim/Deep-SR-ITM

Folders and files

Latest commit

History

Repository files navigation

Deep-SR-ITM

Requirements

Test code

Quick Start

Description

Training code

Quick Start

Description

Multi-purpose CNN (New update 09.09)

Description

Contact

About

Resources

Stars

Watchers

Forks

Languages