NeuMap: Neural Coordinate Mapping by Auto-Transdecoder for Camera Localization

Tang, Shitao; Tang, Sicong; Tagliasacchi, Andrea; Tan, Ping; Furukawa, Yasutaka

Computer Science > Computer Vision and Pattern Recognition

arXiv:2211.11177 (cs)

[Submitted on 21 Nov 2022 (v1), last revised 26 Mar 2023 (this version, v2)]

Title:NeuMap: Neural Coordinate Mapping by Auto-Transdecoder for Camera Localization

Authors:Shitao Tang, Sicong Tang, Andrea Tagliasacchi, Ping Tan, Yasutaka Furukawa

View PDF

Abstract:This paper presents an end-to-end neural mapping method for camera localization, dubbed NeuMap, encoding a whole scene into a grid of latent codes, with which a Transformer-based auto-decoder regresses 3D coordinates of query pixels. State-of-the-art feature matching methods require each scene to be stored as a 3D point cloud with per-point features, consuming several gigabytes of storage per scene. While compression is possible, performance drops significantly at high compression rates. Conversely, coordinate regression methods achieve high compression by storing scene information in a neural network but suffer from reduced robustness. NeuMap combines the advantages of both approaches by utilizing 1) learnable latent codes for efficient scene representation and 2) a scene-agnostic Transformer-based auto-decoder to infer coordinates for query pixels. This scene-agnostic network design learns robust matching priors from large-scale data and enables rapid optimization of codes for new scenes while keeping the network weights fixed. Extensive evaluations on five benchmarks show that NeuMap significantly outperforms other coordinate regression methods and achieves comparable performance to feature matching methods while requiring a much smaller scene representation size. For example, NeuMap achieves 39.1% accuracy in the Aachen night benchmark with only 6MB of data, whereas alternative methods require 100MB or several gigabytes and fail completely under high compression settings. The codes are available at this https URL

Comments:	CVPR2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2211.11177 [cs.CV]
	(or arXiv:2211.11177v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2211.11177

Submission history

From: Shitao Tang [view email]
[v1] Mon, 21 Nov 2022 04:46:22 UTC (6,892 KB)
[v2] Sun, 26 Mar 2023 06:22:15 UTC (13,947 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:NeuMap: Neural Coordinate Mapping by Auto-Transdecoder for Camera Localization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:NeuMap: Neural Coordinate Mapping by Auto-Transdecoder for Camera Localization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators