cross-view-image-matching

[Paper, ICCV 2019] [Presentation Video]

Poster

Abstract

The visual entities in cross-view (e.g. ground and aerial)images exhibit drastic domain changes due to the differences in viewpoints each set of images is captured from. Existing state-of-the-art methods address the problem by learning view-invariant image descriptors. We propose a novel method for solving this task by exploiting the generative powers of conditional GANs to synthesize an aerial representation of a ground-level panorama query and use it to minimize the domain gap between the two views. The synthesized image being from the same view as the reference (target) image, helps the network to preserve important cues in aerial images following our Joint Feature Learning approach. We fuse the complementary features from a synthesized aerial image with the original ground-level panorama features to obtain a robust query representation. In addition, we employ multi-scale feature aggregation in order to preserve image representations at different scales useful for solving this complex task. Experimental results show that our proposed approach performs significantly better than the state-of-the-art methods on the challenging CVUSA dataset in terms of top-1 and top-1% retrieval accuracies. Furthermore, we evaluate the generalization of the proposed method for urban landscapes on our newly collected cross-view localization dataset with geo-reference information.

Code

xview-synthesis:

Code to synthesize cross-view images, i.e generate an aerial image for a given ground panorama and vice versa. The code is borrowed from cross-view image synthesis. This is implemented in Torch LUA. Refer the repo for basic instructions regarding how to get started with the code.

two_stream:

Code to train the two-stream baseline network.

joint_feature_learning:

Code to jointly learn the features for ground panorama (query) and synthesized aerial from the query and also for aerial images. (Need to generate the aerial images first).

feature_fusion:

Code to learn fused representations for ground panorama and synthesized aerial from ground to obtain robust query descriptor and the aerial image descriptor to use them for image matching.

The code on image matching is partly borrowed from cvmnet. This is implemented in Tensorflow.

Training and Test data

Datasets

The original datasets are available here:

CVUSA
UCF-OP Please feel free to contact me via email if the link doesn't work.

Models

CVUSA Dataset

Pretrained models can be downloaded individually here: [xview-synthesis] [two-stream] [joint_feature_learning] [feature_fusion]

All these models can be downloaded at once using this link (~ 4.2 GB). [CVUSA Pretrained Models]

UCF-OP Dataset

Coming soon...

Evaluation

For ease of comparison of our method by future researchers on CVUSA and CVACT datasets, we provide the following:

Feature files for test set of CVUSA:[CVUSA Test Features]

We also conducted experiments on [CVACT Dataset] and provide the feature files here: [CVACT Test Features]

Citation

If you find our works useful for your research, please cite the following papers:

Bridging the Domain Gap for Ground-to-Aerial Image Matching, ICCV 2019 pdf, bibtex
Cross-View Image Synthesis Using Conditional GANs, CVPR 2018 pdf, bibtex
Cross-view image synthesis using geometry-guided conditional GANs, CVIU 2019 pdf, bibtex

Questions

Please contact: 'krishna.regmi7@gmail.com'

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
feature_fusion/src		feature_fusion/src
joint_feature_learning/src		joint_feature_learning/src
resources		resources
two_stream/src		two_stream/src
xview-synthesis		xview-synthesis
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature_fusion/src

feature_fusion/src

joint_feature_learning/src

joint_feature_learning/src

resources

resources

two_stream/src

two_stream/src

xview-synthesis

xview-synthesis

README.md

README.md

Repository files navigation

cross-view-image-matching

Poster

Abstract

Code

xview-synthesis:

two_stream:

joint_feature_learning:

feature_fusion:

Training and Test data

Datasets

Models

CVUSA Dataset

UCF-OP Dataset

Evaluation

Citation

Questions

About

Releases

Packages

Languages

kregmi/cross-view-image-matching

Folders and files

Latest commit

History

Repository files navigation

cross-view-image-matching

Poster

Abstract

Code

xview-synthesis:

two_stream:

joint_feature_learning:

feature_fusion:

Training and Test data

Datasets

Models

CVUSA Dataset

UCF-OP Dataset

Evaluation

Citation

Questions

About

Topics

Resources

Stars

Watchers

Forks

Languages