skip to main content
10.1145/3132734.3132740acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Visual Self-localization via Inferring View-to-Map Correspondences

Published: 23 October 2017 Publication History

Abstract

This paper poses self-localization of a moving camera as a view-to-map correspondence problem, utilizing a large set of ground-view maps rendered by existing mapping tool, like Google Earth. To address the viewpoint and appearance differences between the rendered maps and camera views, we present a unified solution that comprises of three components. Firstly, we represent each rendered map with a set of view-dependent feature patches and discriminatively train an appearance model for each patch. With these models we cast the view-to-map correspondence task as a ranking task. Secondly, we introduce a programming based method to discover feature correspondences over consecutive frames which are used to estimate visual odometry of camera. The programming formula is regularized with both flow type constraints and spatial smoothness constraints to account for scene noises. Thirdly, we present a joint probabilistic formula to integrate both visual odometry and view-to-map correspondences for reasoning with uncertainties. Evaluations with comparisons over challenging monocular videos demonstrated that our method clearly outperforms the alternative methods. In particular, our method is capable of localizing a moving camera with sub-meter accuracies in the scenario of about 13,000 square meters.

References

[1]
M. Aubry, B. Russell, and J. Sivic. 2014. Painting-to-3D model alignment via discriminative visual elements. SIGGRAPH (2014).
[2]
H. Badino, A. Yamamoto, and T. Kanade. 2013. Visual odometry by multiple-frame feature integration. International workshop on Computer Vision for Autonomous Driving at ICCV.
[3]
H. Bay, A. Ess T. Tuytelaars, and L. Van Gool. 2008. SURF: Speeded Up Robust Features. CVIU, Vol. 110, 3 (2008), 346--359.
[4]
J. Berclaz, F. Fleuret, E. Turetken, and P. Fual. 2011. Multiple object tracking using k-shortest paths optimization. TPAMI, Vol. 33, 9 (2011), 1806--1819.
[5]
M. Brubaker, A. Geiger, and R. Urtasun. 2013. Lost! Leveraging the Crowd for Probabilistic Visual Self-Localization. CVPR.
[6]
N. Dalal and B. Triggs. 2005. Histograms of Oriented Gradients for Human Detection. CVPR.
[7]
A. J. Davison. 2003. Real-time Simultaneous Localisation and Mapping with a Single Camera. ICCV.
[8]
P. Felzenszwalb, R. Girshick, D. McAllester, and D. Ramanan. 2010. Object detection with discriminatively trained part based models. TPAMI (2010).
[9]
P. Felzenszwalb and D. Huttenlocher. 2005. Efficient Belief Propagation for early vision. IJCV (2005).
[10]
A. Geiger, J. Ziegler, and C. Stiller. 2011. StereoScan: dense 3D reconstruction in real-time. Intelligent Vehicles Symposium.
[11]
J. Hays and A. Efros. 2008. im2gps: estimating geographic information from a single image. CVPR.
[12]
D. Fox J.-S. Gutmann, W. Burgard and K. Konolige. 1998. An experimental comparison of localization methods. ICIRS.
[13]
S. Khan and M. Shah. 2009. Tracking Multiple Occluding People by Localizing on Multiple Scene Planes. TPAMI, Vol. 31, 3 (2009), 505--519.
[14]
W. K. Leow, C.-C. Chiang, and Y.-P. Hung. 2008. Localization and Mapping of Surveillance Cameras in City Map MM.
[15]
Y. Li, N. Snavely, D. Huttenlocher, and P. Fua. 2012. Worldwide pose estimation using 3d point clouds. ECCV.
[16]
C. Liu, J. Yuen, and A. Torralba. 2009. SIFT Flow: dense correspondence across scenes and its applications. TPAMI (2009).
[17]
T. Malisiewicz, A. Gupta, and A. A. Efros. 2011. Ensemble of exemplar-svms for object detection and beyond. ICCV.
[18]
C. Mei, G. Sibley, M. Cummins, P. Newman, and I. Reid. 2010. A system for large-scale mapping in constant-time using stereos. IJCV (2010).
[19]
P. Merriaux, Y. Dupuis, and P. Vasseur. 2015. Fast and robust vehicle positioning on graph-based representation of drivable maps. ICRA.
[20]
R. Mohedano, A. Cavallaro, and N. Garcia. 2014. Camera Localization Using Trajectories and Maps. TPAMI (2014).
[21]
O. Pink, F. Moosmann, and A. Bachmann. 2009. Visual features for vehicle localization and ego-motion estimation. In IV.
[22]
G. Schindler, M. Brown, and R. Szeliski. 2007. City-Scale Location Recognition. In CVPR.
[23]
S. Song and M. Chandraker. 2014. Robust Scale Estimation in Real-Time Monocular SFM for Autonomous Driving. In CVPR.
[24]
G. Vaca-Castano, A. Zamir, and M. Shah. 2012. City scale geo-spatial trajectory estimation of a moving camera. In CVPR.
[25]
S. Wang, S. Fidler, and R. Urtasun. 2015. Lost Shopping! Monocular Localization in Large Indoor Spaces. In CVPR.
[26]
B. Williams, G. Klein, and I. Reid. 2011. Automatic Relocalization and Loop Closing for Real-Time Monocular SLAM. TPAMI (2011).
[27]
A. Zamir and M. Shah. 2010. Accurate Image Localization Based on Google Maps Street View. In ECCV.
[28]
J. Zhang and S. Singh. 2014. LOAM: Lidar Odometry and mapping in real-time. In Robotics: science and Systems conference.

Index Terms

  1. Visual Self-localization via Inferring View-to-Map Correspondences

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      VSCC '17: Proceedings of the Workshop on Visual Analysis in Smart and Connected Communities
      October 2017
      58 pages
      ISBN:9781450355063
      DOI:10.1145/3132734
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 23 October 2017

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. localization
      2. maps
      3. visual odometry

      Qualifiers

      • Research-article

      Conference

      MM '17
      Sponsor:
      MM '17: ACM Multimedia Conference
      October 23, 2017
      California, Mountain View, USA

      Acceptance Rates

      VSCC '17 Paper Acceptance Rate 6 of 12 submissions, 50%;
      Overall Acceptance Rate 6 of 12 submissions, 50%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 64
        Total Downloads
      • Downloads (Last 12 months)3
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 03 Nov 2024

      Other Metrics

      Citations

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media