Attentive Normalization

Please check our refactorized code at iVMCL-Release.

This repo contains the code and pretrained models for AOGNets: Compositional Grammatical Architectures for Deep Learning (CVPR 2019) and Attentive Normalization. The models are trained on COCO object detection and instance segmentation task with Mask-RCNN and Cascade-Mask-RCNN model. We replace the backbone with our imagenet pretrained backbones and head normalization with our Attentive Normalization. The results and trained models could be found in the table below.

@article{li2019attentive,
  title={Attentive Normalization},
  author={Li, Xilai and Sun, Wei and Wu, Tianfu},
  journal={arXiv preprint arXiv:1908.01259},
  year={2019}
}

@inproceedings{li2019aognets,
  title={AOGNets: Compositional Grammatical Architectures for Deep Learning},
  author={Li, Xilai and Song, Xi and Wu, Tianfu},
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  pages={6220--6230},
  year={2019}
}

Getting Started

The ImageNet pretrained models are needed for training, and they can be downloaded from the https://github.com/iVMCL/AOGNet-v2.

git clone https://github.com/iVMCL/AttentiveNorm_Detection.git
cd AttentiveNorm_Detection
mkdir pretrained_models   # And put all the pretrained models under that directory.

Installation, data preparation, and training/evaluating models are the same as it in the original mmdetection repo.

Results and Models

Mask-RCNN

Architecture	Backbone	Head	#Params	box AP	mask AP	Download
ResNet-50	BN	-	45.71M	39.2	35.2	Google Drive
	AN (w/ BN)	-	45.91M	40.8	36.4	Google Drive
	*GN	GN	45.72M	40.3	35.7	-
	*SN	SN	-	41.0	36.5	-
	AN (w/ BN)	AN (w/ GN)	45.96M	41.6	37.4	Google Drive
ResNet-101	BN	-	64.70M	41.4	36.8	Google Drive
	AN (w/ BN)	-	65.15M	43.1	38.2	Google Drive
	*GN	GN	64.71M	41.8	36.8	-
	AN (w/ BN)	AN (w/ GN)	65.20M	43.2	38.8	Google Drive
AOGNet-12m	BN	-	33.09M	40.7	36.4	Google Drive
	AN (w/ BN)	-	33.21M	42.0	37.8	Google Drive
	AN (w/ BN)	AN (w/ GN)	33.26M	43.0	38.7	Google Drive
AOGNet-40m	BN	-	60.73M	43.4	38.5	Google Drive
	AN (w/ BN)	-	60.97M	44.1	39.0	Google Drive
	AN (w/ BN)	AN (w/ GN)	61.02M	44.9	40.2	Google Drive

Cascade Mask-RCNN

Architecture	Backbone	Head	#Params	box AP	mask AP	Download
ResNet-101	BN	-	96.32M	44.4	38.2	Google Drive
ResNet-101	AN (w/ BN)	-	96.77M	45.8	39.6	Google Drive
AOGNet-40m	BN	-	92.35M	45.6	39.3	Google Drive
AOGNet-40m	AN (w/ BN)	-	92.58M	46.5	40.0	Google Drive

MMDetection

News: We released the technical report on ArXiv.

Introduction

The master branch works with PyTorch 1.1 or higher.

mmdetection is an open source object detection toolbox based on PyTorch. It is a part of the open-mmlab project developed by Multimedia Laboratory, CUHK.

Major features

Modular Design

We decompose the detection framework into different components and one can easily construct a customized object detection framework by combining different modules.
Support of multiple frameworks out of box

The toolbox directly supports popular and contemporary detection frameworks, e.g. Faster RCNN, Mask RCNN, RetinaNet, etc.
High efficiency

All basic bbox and mask operations run on GPUs now. The training speed is faster than or comparable to other codebases, including Detectron, maskrcnn-benchmark and SimpleDet.
State of the art

The toolbox stems from the codebase developed by the MMDet team, who won COCO Detection Challenge in 2018, and we keep pushing it forward.

Apart from MMDetection, we also released a library mmcv for computer vision research, which is heavily depended on by this toolbox.

License

This project is released under the Apache 2.0 license.

Updates

v1.0rc0 (27/07/2019)

Implement lots of new methods and components (Mixed Precision Training, HTC, Libra R-CNN, Guided Anchoring, Empirical Attention, Mask Scoring R-CNN, Grid R-CNN (Plus), GHM, GCNet, FCOS, HRNet, Weight Standardization, etc.). Thank all collaborators!
Support two additional datasets: WIDER FACE and Cityscapes.
Refactoring for loss APIs and make it more flexible to adopt different losses and related hyper-parameters.
Speed up multi-gpu testing.
Integrate all compiling and installing in a single script.

v0.6.0 (14/04/2019)

Up to 30% speedup compared to the model zoo.
Support both PyTorch stable and nightly version.
Replace NMS and SigmoidFocalLoss with Pytorch CUDA extensions.

v0.6rc0(06/02/2019)

Migrate to PyTorch 1.0.

v0.5.7 (06/02/2019)

Add support for Deformable ConvNet v2. (Many thanks to the authors and @chengdazhi)
This is the last release based on PyTorch 0.4.1.

v0.5.6 (17/01/2019)

Add support for Group Normalization.
Unify RPNHead and single stage heads (RetinaHead, SSDHead) with AnchorHead.

v0.5.5 (22/12/2018)

Add SSD for COCO and PASCAL VOC.
Add ResNeXt backbones and detection models.
Refactoring for Samplers/Assigners and add OHEM.
Add VOC dataset and evaluation scripts.

v0.5.4 (27/11/2018)

Add SingleStageDetector and RetinaNet.

v0.5.3 (26/11/2018)

Add Cascade R-CNN and Cascade Mask R-CNN.
Add support for Soft-NMS in config files.

v0.5.2 (21/10/2018)

Add support for custom datasets.
Add a script to convert PASCAL VOC annotations to the expected format.

v0.5.1 (20/10/2018)

Add BBoxAssigner and BBoxSampler, the train_cfg field in config files are restructured.
ConvFCRoIHead / SharedFCRoIHead are renamed to ConvFCBBoxHead / SharedFCBBoxHead for consistency.

Benchmark and model zoo

Supported methods and backbones are shown in the below table. Results and models are available in the Model zoo.

	ResNet	ResNeXt	SENet	VGG	HRNet
RPN	✓	✓	☐	✗	✓
Fast R-CNN	✓	✓	☐	✗	✓
Faster R-CNN	✓	✓	☐	✗	✓
Mask R-CNN	✓	✓	☐	✗	✓
Cascade R-CNN	✓	✓	☐	✗	✓
Cascade Mask R-CNN	✓	✓	☐	✗	✓
SSD	✗	✗	✗	✓	✗
RetinaNet	✓	✓	☐	✗	✓
GHM	✓	✓	☐	✗	✓
Mask Scoring R-CNN	✓	✓	☐	✗	✓
FCOS	✓	✓	☐	✗	✓
Double-Head R-CNN	✓	✓	☐	✗	✓
Grid R-CNN (Plus)	✓	✓	☐	✗	✓
Hybrid Task Cascade	✓	✓	☐	✗	✓
Libra R-CNN	✓	✓	☐	✗	✓
Guided Anchoring	✓	✓	☐	✗	✓

Other features

Installation

Please refer to INSTALL.md for installation and dataset preparation.

Get Started

Please see GETTING_STARTED.md for the basic usage of MMDetection.

Contributing

We appreciate all contributions to improve MMDetection. Please refer to CONTRIBUTING.md for the contributing guideline.

Acknowledgement

MMDetection is an open source project that is contributed by researchers and engineers from various colleges and companies. We appreciate all the contributors who implement their methods or add new features, as well as users who give valuable feedbacks. We wish that the toolbox and benchmark could serve the growing research community by providing a flexible toolkit to reimplement existing methods and develop their own new detectors.

Citation

If you use this toolbox or benchmark in your research, please cite this project.

@article{mmdetection,
  title   = {{MMDetection}: Open MMLab Detection Toolbox and Benchmark},
  author  = {Chen, Kai and Wang, Jiaqi and Pang, Jiangmiao and Cao, Yuhang and
             Xiong, Yu and Li, Xiaoxiao and Sun, Shuyang and Feng, Wansen and
             Liu, Ziwei and Xu, Jiarui and Zhang, Zheng and Cheng, Dazhi and
             Zhu, Chenchen and Cheng, Tianheng and Zhao, Qijie and Li, Buyu and
             Lu, Xin and Zhu, Rui and Wu, Yue and Dai, Jifeng and Wang, Jingdong
             and Shi, Jianping and Ouyang, Wanli and Loy, Chen Change and Lin, Dahua},
  journal= {arXiv preprint arXiv:1906.07155},
  year={2019}
}

Contact

This repo is currently maintained by Kai Chen (@hellock), Jiangmiao Pang (@OceanPang), Jiaqi Wang (@myownskyW7) and Yuhang Cao (@yhcao6).

Name		Name	Last commit message	Last commit date
Latest commit History 667 Commits
.github		.github
configs		configs
demo		demo
docker		docker
docs		docs
mmdet		mmdet
tests		tests
tools		tools
.gitignore		.gitignore
.isort.cfg		.isort.cfg
.pre-commit-config.yaml		.pre-commit-config.yaml
.style.yapf		.style.yapf
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
pytest.ini		pytest.ini
requirements.txt		requirements.txt
setup.py		setup.py

License

iVMCL/AttentiveNorm_Detection

Folders and files

Latest commit

History

Repository files navigation

Attentive Normalization

Getting Started

Results and Models

MMDetection

Introduction

Major features

License

Updates

Benchmark and model zoo

Installation

Get Started

Contributing

Acknowledgement

Citation

Contact

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Languages