Context-Encoding-of-Detected-Buildings

Bounding Boxes Are All We Need: Street View Image Classification via Context Encoding of Detected Buildings

IEEE Transactions on Geoscience and Remote Sensing

Introduction

In this paper, an approach based on a detector-encoder-classifier framework is proposed. Different from common end-to-end models, our approach does not use visual features of the whole image directly. The proposed framework obtains the bounding boxes of buildings in street view images from a detector. Their contextual information such as building classes and positions are then encoded into metadata and finally classified by a recurrent neural network.
To verify our approach, we made a dataset of 19,070 street view images and 38,857 buildings based on the BIC_GSV dataset through a combination of automatic label acquisition and expert annotation. The dataset can be used not only for street view image classification aiming at urban land use analysis, but also for multi-class building detection. Experiments show that the proposed method achieves significant performance improvement over the models based on end-to-end convolutional neural network.

Results

Models	M-P	M-R	M-F1
layout+perfect detector	95.54	92.15	93.82
layout+Ca101 best	81.81	80.94	81.37
co-occurrence+Ca101 best	81.47	80.53	81.00
baseline:ResNet50	69.16	68.94	69.05

Bulidings bounding bboxs were genarated using MMDetection with their default hyperparameters and pre-trained models.

BEAUTY Dataset

Download BEAUTY Dataset

BEAUTY(Building dEtection And Urban funcTional-zone portraYing): A street view image dataset with a dual label system is made based on the exist BIC_GSV dataset.

The authors would like to thank the authors of reference [1] for publishing the BIC GSV dataset including city scale GSV images. We would also like to thank Mengshuo Fan and Zhiwei He, the experts in architecture and urban planning from the BIM Research Center, Qingdao Research Institute of Urban and Rural Construction for their professional guidance on manual annotation. Thanks to those who participated in manual annotation for building detection: Yu Ma, Shanshan Lin, Ying Guo and Kaixin Li, and who participated in manual annotation for street view image classification: Ying Zhang, Jiaojie Wang, Shujing Ma and Yue Wang.

[1] J. Kang, M. Korner, Y. Wang, H. Taubenb ¨ ock, and X. X. Zhu, “Building ¨ instance classification using street view images,” ISPRS Journal of Photogrammetry and Remote Sensing, vol. 145, pp. 44–59, 2018.

Experiments and requirement

All experiments are based on the same hardware and software conditions as follows: GPU: GeForce GTX 1080 × 2; OS: Ubuntu 18.04.3 LTS; CUDA Version: 10.0.130; PyTorch Version: 1.4.0 for cu100; and TorchVision Version: 0.5.0 for cu100.

Get Docker

citation:

If you find our work is useful, please kindly cite the following:

Plain Text

K. Zhao, Y. Liu, S. Hao, S. Lu, H. Liu and L. Zhou, 
"Bounding Boxes Are All We Need: Street View Image Classification via Context Encoding of Detected Buildings," 
in IEEE Transactions on Geoscience and Remote Sensing, 
doi: 10.1109/TGRS.2021.3064316.

BibTeX

@ARTICLE{9380541,  
  author={K. {Zhao} and Y. {Liu} and S. {Hao} and S. {Lu} and H. {Liu} and L. {Zhou}},  
  journal={IEEE Transactions on Geoscience and Remote Sensing},   
  title={Bounding Boxes Are All We Need: Street View Image Classification via Context Encoding of Detected Buildings},   
  year={2021},  
  volume={},  
  number={},  
  pages={1-17},  
  doi={10.1109/TGRS.2021.3064316}
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
attention_experments		attention_experments
cnn_baseline		cnn_baseline
log		log
rnn_experiments		rnn_experiments
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

attention_experments

attention_experments

cnn_baseline

cnn_baseline

log

log

rnn_experiments

rnn_experiments

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Context-Encoding-of-Detected-Buildings

Introduction

Results

BEAUTY Dataset

Experiments and requirement

citation:

About

Releases

Packages

Contributors 2

Languages

License

kyle-one/Context-Encoding-of-Detected-Buildings

Folders and files

Latest commit

History

Repository files navigation

Context-Encoding-of-Detected-Buildings

Introduction

Results

BEAUTY Dataset

Experiments and requirement

citation:

About

Topics

Resources

License

Stars

Watchers

Forks

Languages