Deformable 3D Convolution for Video Super-Resolution

Pytorch implementation of deformable 3D convolution network (D3Dnet). [PDF]

Our code is based on cuda and can perform deformation in any dimension of 3D convolution.

Overview

Architecture of D3Dnet

Architecture of D3D

Requirements

Python 3
pytorch (1.0.0), torchvision (0.2.2) / pytorch (1.2.0), torchvision (0.4.0)
numpy, PIL
Visual Studio 2015

Build

Compile deformable 3D convolution:

Cd to code/dcn.
For Windows users, run cmd make.bat. For Linux users, run bash make.sh. The scripts will build D3D automatically and create some folders.
We offer customized settings for any dimension (e.g., Temporal, Height, Width) you want to deform. See code/dcn/test.py for more details.

Datasets

Training dataset

Download the Vimeo dataset and put the images in code/data/Vimeo.
Cd to code/data/Vimeo and run generate_LR_Vimeo90K.m to generate training data as below:

  Vimeo
    └── sequences
           ├── 00001
           ├── 00002
           ├── ...
    └── LR_x4
           ├── 00001
           ├── 00002
           ├── ...		
    ├── readme.txt 
    ├── sep_trainlist.txt
    ├── sep_testlist.txt
    └── generate_LR_Vimeo90K.m

Test dataset

Download the dataset Vid4 and SPMC-11 dataset in https://pan.baidu.com/s/1PKZeTo8HVklHU5Pe26qUtw (Code: 4l5r) and put the folder in code/data.
(optional) You can also download Vid4 and SPMC-11 or other video datasets and prepare test data in code/data as below:

 data
  └── dataset_1
         └── scene_1
               └── hr    
                  ├── hr_01.png  
                  ├── hr_02.png  
                  ├── ...
                  └── hr_M.png    
               └── lr_x4
                  ├── lr_01.png  
                  ├── lr_02.png  
                  ├── ...
                  └── lr_M.png   
         ├── ...		  
         └── scene_M
  ├── ...    
  └── dataset_N

Results

Quantitative Results

We have organized the Matlab code framework of Video Quality Assessment metric T-MOVIE and MOVIE. [Code]
Welcome to have a look and use our code.

Qualitative Results

A demo video is available at https://wyqdatabase.s3-us-west-1.amazonaws.com/D3Dnet.mp4

Citiation

@article{D3Dnet,
  author = {Ying, Xinyi and Wang, Longguang and Wang, Yingqian and Sheng, Weidong and An, Wei and Guo, Yulan},
  title = {Deformable 3D Convolution for Video Super-Resolution},
  journal = {IEEE Signal Processing Letters},
  volume = {27},
  pages = {1500-1504}，
  year = {2020},
}

Acknowledgement

This code is built on [DCNv2] and [SOF-VSR]. We thank the authors for sharing their codes.

Contact

Please contact us at yingxinyi18@nudt.edu.cn for any question.

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
code		code
images		images
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

code

code

images

images

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Deformable 3D Convolution for Video Super-Resolution

Overview

Architecture of D3Dnet

Architecture of D3D

Requirements

Build

Datasets

Training dataset

Test dataset

Results

Quantitative Results

Qualitative Results

Citiation

Acknowledgement

Contact

About

Releases

Packages

Contributors 3

Languages

License

XinyiYing/D3Dnet

Folders and files

Latest commit

History

Repository files navigation

Deformable 3D Convolution for Video Super-Resolution

Overview

Architecture of D3Dnet

Architecture of D3D

Requirements

Build

Datasets

Training dataset

Test dataset

Results

Quantitative Results

Qualitative Results

Citiation

Acknowledgement

Contact

About

Resources

License

Stars

Watchers

Forks

Languages