research-article

Visual Self-localization via Inferring View-to-Map Correspondences

Authors:

Xiaobai LiuAuthors Info & Claims

VSCC '17: Proceedings of the Workshop on Visual Analysis in Smart and Connected Communities

Pages 33 - 39

https://doi.org/10.1145/3132734.3132740

Published: 23 October 2017 Publication History

Abstract

This paper poses self-localization of a moving camera as a view-to-map correspondence problem, utilizing a large set of ground-view maps rendered by existing mapping tool, like Google Earth. To address the viewpoint and appearance differences between the rendered maps and camera views, we present a unified solution that comprises of three components. Firstly, we represent each rendered map with a set of view-dependent feature patches and discriminatively train an appearance model for each patch. With these models we cast the view-to-map correspondence task as a ranking task. Secondly, we introduce a programming based method to discover feature correspondences over consecutive frames which are used to estimate visual odometry of camera. The programming formula is regularized with both flow type constraints and spatial smoothness constraints to account for scene noises. Thirdly, we present a joint probabilistic formula to integrate both visual odometry and view-to-map correspondences for reasoning with uncertainties. Evaluations with comparisons over challenging monocular videos demonstrated that our method clearly outperforms the alternative methods. In particular, our method is capable of localizing a moving camera with sub-meter accuracies in the scenario of about 13,000 square meters.

References

[1]

M. Aubry, B. Russell, and J. Sivic. 2014. Painting-to-3D model alignment via discriminative visual elements. SIGGRAPH (2014).

[2]

H. Badino, A. Yamamoto, and T. Kanade. 2013. Visual odometry by multiple-frame feature integration. International workshop on Computer Vision for Autonomous Driving at ICCV.

Digital Library

[3]

H. Bay, A. Ess T. Tuytelaars, and L. Van Gool. 2008. SURF: Speeded Up Robust Features. CVIU, Vol. 110, 3 (2008), 346--359.

Digital Library

[4]

J. Berclaz, F. Fleuret, E. Turetken, and P. Fual. 2011. Multiple object tracking using k-shortest paths optimization. TPAMI, Vol. 33, 9 (2011), 1806--1819.

Digital Library

[5]

M. Brubaker, A. Geiger, and R. Urtasun. 2013. Lost! Leveraging the Crowd for Probabilistic Visual Self-Localization. CVPR.

Digital Library

[6]

N. Dalal and B. Triggs. 2005. Histograms of Oriented Gradients for Human Detection. CVPR.

Digital Library

[7]

A. J. Davison. 2003. Real-time Simultaneous Localisation and Mapping with a Single Camera. ICCV.

Digital Library

[8]

P. Felzenszwalb, R. Girshick, D. McAllester, and D. Ramanan. 2010. Object detection with discriminatively trained part based models. TPAMI (2010).

Digital Library

[9]

P. Felzenszwalb and D. Huttenlocher. 2005. Efficient Belief Propagation for early vision. IJCV (2005).

Digital Library

[10]

A. Geiger, J. Ziegler, and C. Stiller. 2011. StereoScan: dense 3D reconstruction in real-time. Intelligent Vehicles Symposium.

[11]

J. Hays and A. Efros. 2008. im2gps: estimating geographic information from a single image. CVPR.

[12]

D. Fox J.-S. Gutmann, W. Burgard and K. Konolige. 1998. An experimental comparison of localization methods. ICIRS.

[13]

S. Khan and M. Shah. 2009. Tracking Multiple Occluding People by Localizing on Multiple Scene Planes. TPAMI, Vol. 31, 3 (2009), 505--519.

Digital Library

[14]

W. K. Leow, C.-C. Chiang, and Y.-P. Hung. 2008. Localization and Mapping of Surveillance Cameras in City Map MM.

Digital Library

[15]

Y. Li, N. Snavely, D. Huttenlocher, and P. Fua. 2012. Worldwide pose estimation using 3d point clouds. ECCV.

Digital Library

[16]

C. Liu, J. Yuen, and A. Torralba. 2009. SIFT Flow: dense correspondence across scenes and its applications. TPAMI (2009).

Digital Library

[17]

T. Malisiewicz, A. Gupta, and A. A. Efros. 2011. Ensemble of exemplar-svms for object detection and beyond. ICCV.

Digital Library

[18]

C. Mei, G. Sibley, M. Cummins, P. Newman, and I. Reid. 2010. A system for large-scale mapping in constant-time using stereos. IJCV (2010).

Digital Library

[19]

P. Merriaux, Y. Dupuis, and P. Vasseur. 2015. Fast and robust vehicle positioning on graph-based representation of drivable maps. ICRA.

[20]

R. Mohedano, A. Cavallaro, and N. Garcia. 2014. Camera Localization Using Trajectories and Maps. TPAMI (2014).

Digital Library

[21]

O. Pink, F. Moosmann, and A. Bachmann. 2009. Visual features for vehicle localization and ego-motion estimation. In IV.

[22]

G. Schindler, M. Brown, and R. Szeliski. 2007. City-Scale Location Recognition. In CVPR.

[23]

S. Song and M. Chandraker. 2014. Robust Scale Estimation in Real-Time Monocular SFM for Autonomous Driving. In CVPR.

Digital Library

[24]

G. Vaca-Castano, A. Zamir, and M. Shah. 2012. City scale geo-spatial trajectory estimation of a moving camera. In CVPR.

Digital Library

[25]

S. Wang, S. Fidler, and R. Urtasun. 2015. Lost Shopping! Monocular Localization in Large Indoor Spaces. In CVPR.

[26]

B. Williams, G. Klein, and I. Reid. 2011. Automatic Relocalization and Loop Closing for Real-Time Monocular SLAM. TPAMI (2011).

Digital Library

[27]

A. Zamir and M. Shah. 2010. Accurate Image Localization Based on Google Maps Street View. In ECCV.

Digital Library

[28]

J. Zhang and S. Singh. 2014. LOAM: Lidar Odometry and mapping in real-time. In Robotics: science and Systems conference.

Index Terms

Visual Self-localization via Inferring View-to-Map Correspondences
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
  2. Machine learning

Recommendations

Semi-dense Visual Odometry for a Monocular Camera
ICCV '13: Proceedings of the 2013 IEEE International Conference on Computer Vision

We propose a fundamentally novel approach to real-time visual odometry for a monocular camera. It allows to benefit from the simplicity and accuracy of dense tracking - which does not depend on visual features - while running in real-time on a CPU. The ...
Selective visual odometry for accurate AUV localization

In this paper we present a stereo visual odometry system developed for autonomous underwater vehicle localization tasks. The main idea is to make use of only highly reliable data in the estimation process, employing a robust keypoint tracking approach ...
Real-time Quadrifocal Visual Odometry

In this paper we describe a new image-based approach to tracking the six-degree-of-freedom trajectory of a stereo camera pair. The proposed technique estimates the pose and subsequently the dense pixel matching between temporal image pairs in a sequence ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

VSCC '17: Proceedings of the Workshop on Visual Analysis in Smart and Connected Communities

October 2017

58 pages

ISBN:9781450355063

DOI:10.1145/3132734

Program Chairs:
Xiaobai Liu
San Diego State University, USA
,
Yadong Mu
Peking University, China
,
Yu-Gang Jiang
Fudan University, China
,
Jiebo Luo
University of Rochester, USA

Copyright © 2017 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 October 2017

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

MM '17

Sponsor:

SIGMM

MM '17: ACM Multimedia Conference

October 23, 2017

California, Mountain View, USA

Acceptance Rates

VSCC '17 Paper Acceptance Rate 6 of 12 submissions, 50%;

Overall Acceptance Rate 6 of 12 submissions, 50%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
64
Total Downloads

Downloads (Last 12 months)3
Downloads (Last 6 weeks)0

Reflects downloads up to 03 Nov 2024

Other Metrics

View Author Metrics

Citations

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents