BSRT: Improving Burst Super-Resolution with Swin Transformer and Flow-Guided Deformable Alignment (CVPRW 2022)

This work addresses the Burst Super-Resolution (BurstSR) task using a new architecture, which requires restoring a high-quality image from a sequence of noisy, misaligned, and low-resolution RAW bursts. To overcome the challenges in BurstSR, we propose a Burst Super-Resolution Transformer (BSRT), which can significantly improve the capability of extracting inter-frame information and reconstruction. To achieve this goal, we propose a Pyramid Flow-Guided Deformable Convolution Network (Pyramid FG-DCN) and incorporate Swin Transformer Blocks and Groups as our main backbone. More specifically, we combine optical flows and deformable convolutions, hence our BSRT can handle misalignment and aggregate the potential texture information in multi-frames more efficiently. In addition, our Transformer-based structure can capture long-range dependency to further improve the performance. The evaluation on both synthetic and real-world tracks demonstrates that our approach achieves a new state-of-the-art in BurstSR task. Further, our BSRT wins the championship in the NTIRE2022 Burst Super-Resolution Challenge.

Read the full paper on arXiv: https://arxiv.org/abs/2204.08332.

Note BSRT is the winner of the NTIRE 2022 Burst Super-Resolution Challenge Real-World Track! You can also find our winner method in NTIRE 2021 Burst Super-Resolution Challenge here.

Warning This fork of the repo is under heavy development. The code is likely to chance and no I provide no promises of stability. It follows that this README is also likely to change and should not be considered authoritative or even correct!

Comparison with State-of-the-art Burst Super-Resolution Methods

Overview Architecture

Dependencies

Docker: >= 20
Python: Python 3.10
CUDA: Runtime must be available

Building a Docker Images

Dockerfile.micromamba

TODO: Description

sudo docker buildx build -t bsrt_micromamba -f dockerfiles/Dockerfile.micromamba .

Dockerfile.from_scratch

TODO: Description

TODO: Does not include BSRT yet

sudo docker buildx build -t bsrt_from_scratch -f dockerfiles/Dockerfile.from_scratch .

docker-bake.hcl

TODO: Description

bash bake.sh

Running a Docker Image

sudo docker run \
  --gpus all \
  --mount type=bind,source="$HOME/datasets",target="$HOME/datasets" \
  --ipc=host \
  --ulimit memlock=-1 \
  --ulimit stack=67108864 \
  bsrt bash -c "python3 /bsrt/bsrt/main.py"

Training

Hyperparameter search

Create a ramdisk to store the training dataset

mkdir "$HOME/ramdisk"
sudo mount -t tmpfs tmpfs "$HOME/ramdisk"
sudo chown $USER:$USER -R "$HOME/ramdisk"
mkdir "$HOME/ramdisk/datasets"

Download and run the docker image

docker run \
  --gpus all \
  --mount type=bind,source="$HOME/ramdisk/datasets",target=/datasets \
  --ipc=host \
  --ulimit memlock=-1 \
  --ulimit stack=67108864 \
  --rm connorbaker01/bsrt:main

Run a hyperparameter search backed by Optuna

python -m bsrt.tuning.tuner \
  --experiment_name "model_with_adam_plateau" \
  --optimizer "AdamW" \
  --scheduler "ReduceLROnPlateau" \
  --precision "bfloat16" \
  --num_trials 1 \
  --max_epochs 20 \
  --batch_size 16 \
  --limit_train_batches 0.1 \
  --limit_val_batches 0.1 \
  --data_dir /datasets \
  --wandb_api_key="<your WandB API key>" \
  --db_uri="<your DB connection string>"

Pre-training with synthetic data

TODO(connor)

Original parameters used for pre-training:

python main.py --n_GPUs 8 --print_every 40 --lr 0.0001 --decay 150-300 --save bsrt_tiny --model BSRT --fp16 --model_level S --swinfeature --batch_size 32 --burst_size 14 --patch_size 256

Training with real-world data

TODO(connor)

Original parameters used for training:

python main.py --n_GPUs 8 --print_every 20 --lr 0.00005 --decay 40-80 --save bsrt_tiny --model BSRT --fp16 --model_level S --swinfeature --batch_size 8 --burst_size 14 --patch_size 80 --pre_train ../../synthetic/train_log/bsrt/real_models/bsrt_tiny/bsrt_best_epoch.pth

Inference

TODO(connor): add inference script and pretrained weights

Results

Comparison on Synthetic dataset

Comparison on Real-World dataset

Citations

If our code helps your research or work, please consider citing our paper. The following is a BibTeX reference.

@inproceedings{luo2022bsrt,
  title={BSRT: Improving Burst Super-Resolution with Swin Transformer and Flow-Guided Deformable Alignment},
  author={Luo, Ziwei and Li, Youwei and Cheng, Shen and Yu, Lei and Wu, Qi and Wen, Zhihong and Fan, Haoqiang and Sun, Jian and Liu, Shuaicheng},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={998--1008},
  year={2022}
}

Contact

email: [ziwei.ro@gmail.com]

Name		Name	Last commit message	Last commit date
Latest commit History 309 Commits
.github		.github
bsrt		bsrt
figs		figs
nix		nix
.envrc		.envrc
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
flake.lock		flake.lock
flake.nix		flake.nix
pyproject.toml		pyproject.toml

License

ConnorBaker/BSRT

Folders and files

Latest commit

History

Repository files navigation

BSRT: Improving Burst Super-Resolution with Swin Transformer and Flow-Guided Deformable Alignment (CVPRW 2022)

Comparison with State-of-the-art Burst Super-Resolution Methods

Overview Architecture

Dependencies

Building a Docker Images

Dockerfile.micromamba

Dockerfile.from_scratch

docker-bake.hcl

Running a Docker Image

Training

Hyperparameter search

Pre-training with synthetic data

Training with real-world data

Inference

Results

Comparison on Synthetic dataset

Comparison on Real-World dataset

Citations

Contact

About

Resources

License

Stars

Watchers

Forks

Languages