A Challenging Benchmark of Anime Style Recognition

This repository provides Large-Scale Anime Style Recognition dataset and baseline approaches of the benchmark.

Anime Style Recognition (ASR)

Given two images of different anime roles, anime style recognition (ASR) aims to learn abstract painting style to determine whether the two images are from the same work, which is an interesting but challenging problem. Unlike biometric recognition, such as face recognition, iris recognition, and person re-identification, ASR suffers from a much larger semantic gap but receives less attention. In this paper, we propose a challenging ASR benchmark. Firstly, we collect a large-scale ASR dataset (LSASRD), which contains 20,937 images of 190 anime works and each work at least has ten different roles. In addition to the large-scale, LSASRD contains a list of challenging factors, such as complex illuminations, various poses, theatrical colors and exaggerated compositions. Secondly, we design a cross-role protocol to evaluate ASR performance, in which query and gallery images must come from different roles to validate an ASR model is to learn abstract painting style rather than learn discriminative features of roles. Finally, we apply two powerful person re-identification methods, namely, AGW and TransReID, to construct the baseline performance on LSASRD. Surprisingly, the recent transformer model (i.e., TransReID) only acquires a 42.24% mAP on LSASRD. Therefore, we believe that the ASR task of a huge semantic gap deserves deep and long-term research.

The LSASRD is a challenging dataset established for anime style recognition. The above figure visualizes the statistics of the proposed dataset.

Reference method: AGW and TransReID

Dataset

Download and Sign the LSASRD Release Agreement.
Submit the document to lei_houtong@stu.hqu.edu.cn or jqzhu@hqu.edu.cn.
Download dataset from the link received from us by e-mail.

Acknowledgement

This project was supported by National Training Program on Undergraduate Innovation and Entrepreneurship of China (No. 202110385018).

Citation

If you use the LSASR dataset or the benchmark for your research, please cite our paper as follows.

@InProceedings{Li_2022_CVPR,
    author    = {Li, Haotang and Guo, Shengtao and Lyu, Kailin and Yang, Xiao and Chen, Tianchen and Zhu, Jianqing and Zeng, Huanqiang},
    title     = {A Challenging Benchmark of Anime Style Recognition},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops},
    month     = {June},
    year      = {2022},
    pages     = {4721-4730}
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
configs		configs
data		data
docs		docs
figs		figs
modeling		modeling
solver		solver
utils		utils
.gitignore		.gitignore
LSASRD RELEASE AGREEMENT.pdf		LSASRD RELEASE AGREEMENT.pdf
README.md		README.md
_config.yml		_config.yml
config.py		config.py
main.py		main.py
tester.py		tester.py
trainer.py		trainer.py

nkjcqvcpi/ASR

Folders and files

Latest commit

History

Repository files navigation

Anime Style Recognition (ASR)

Dataset

Acknowledgement

Citation

About

Resources

Stars

Watchers

Forks

Languages