MonoLoco: Monocular 3D Pedestrian Localization and Uncertainty Estimation

Bertoni, Lorenzo; Kreiss, Sven; Alahi, Alexandre

Computer Science > Computer Vision and Pattern Recognition

arXiv:1906.06059 (cs)

[Submitted on 14 Jun 2019 (v1), last revised 20 Aug 2019 (this version, v2)]

Title:MonoLoco: Monocular 3D Pedestrian Localization and Uncertainty Estimation

Authors:Lorenzo Bertoni, Sven Kreiss, Alexandre Alahi

View PDF

Abstract:We tackle the fundamentally ill-posed problem of 3D human localization from monocular RGB images. Driven by the limitation of neural networks outputting point estimates, we address the ambiguity in the task by predicting confidence intervals through a loss function based on the Laplace distribution. Our architecture is a light-weight feed-forward neural network that predicts 3D locations and corresponding confidence intervals given 2D human poses. The design is particularly well suited for small training data, cross-dataset generalization, and real-time applications. Our experiments show that we (i) outperform state-of-the-art results on KITTI and nuScenes datasets, (ii) even outperform a stereo-based method for far-away pedestrians, and (iii) estimate meaningful confidence intervals. We further share insights on our model of uncertainty in cases of limited observations and out-of-distribution samples.

Comments:	International Conference on Computer Vision (ICCV) 2019
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1906.06059 [cs.CV]
	(or arXiv:1906.06059v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1906.06059

Submission history

From: Lorenzo Bertoni [view email]
[v1] Fri, 14 Jun 2019 07:39:03 UTC (5,485 KB)
[v2] Tue, 20 Aug 2019 15:43:44 UTC (5,487 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2019-06

Change to browse by:

References & Citations

1 blog link

(what is this?)

DBLP - CS Bibliography

listing | bibtex

Lorenzo Bertoni
Sven Kreiss
Alexandre Alahi

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:MonoLoco: Monocular 3D Pedestrian Localization and Uncertainty Estimation

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:MonoLoco: Monocular 3D Pedestrian Localization and Uncertainty Estimation

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators