GitHub - Bei-Jin/STMFANet

Introduction

This repository is the implementation of "Exploring Spatial-Temporal Multi-Frequency Analysis for High-Fidelity and Temporal-Consistency Video Prediction". (arXiv report here).

Video prediction is a pixel-wise dense prediction task to infer future frames based on past frames. Missing appearance details and motion blur are still two major problems for current models, leading to image distortion and temporal inconsistency. We point out the necessity of exploring multi-frequency analysis to deal with the two problems. Inspired by the frequency band decomposition characteristic of Human Vision System (HVS), we propose a video prediction network based on multi-level wavelet analysis to uniformly deal with spatial and temporal information. Specifically, multi-level spatial discrete wavelet transform decomposes each video frame into anisotropic sub-bands with multiple frequencies, helping to enrich structural information and reserve fine details. On the other hand, multi-level temporal discrete wavelet transform which operates on time axis decomposes the frame sequence into sub-band groups of different frequencies to accurately capture multi-frequency motions under fixed frame rate. Extensive experiments on diverse datasets demonstrate that our model shows significant improvements on fidelity and temporal consistency over state-of-the-art works.

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
.idea		.idea
data		data
options		options
util		util
videos		videos
wavenet_models		wavenet_models
README.md		README.md
train_kth.py		train_kth.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.idea

.idea

data

data

options

options

util

util

videos

videos

wavenet_models

wavenet_models

README.md

README.md

train_kth.py

train_kth.py

Repository files navigation

Introduction

Result show

About

Releases

Packages

Languages

Bei-Jin/STMFANet

Folders and files

Latest commit

History

Repository files navigation

Introduction

Result show

About

Resources

Stars

Watchers

Forks

Languages