Curiosity-driven 3D Object Detection Without Labels

Griffiths, David; Boehm, Jan; Ritschel, Tobias

Computer Science > Computer Vision and Pattern Recognition

arXiv:2012.01230 (cs)

[Submitted on 2 Dec 2020 (v1), last revised 15 Oct 2021 (this version, v3)]

Title:Curiosity-driven 3D Object Detection Without Labels

Authors:David Griffiths, Jan Boehm, Tobias Ritschel

View PDF

Abstract:In this paper we set out to solve the task of 6-DOF 3D object detection from 2D images, where the only supervision is a geometric representation of the objects we aim to find. In doing so, we remove the need for 6-DOF labels (i.e., position, orientation etc.), allowing our network to be trained on unlabeled images in a self-supervised manner. We achieve this through a neural network which learns an explicit scene parameterization which is subsequently passed into a differentiable renderer. We analyze why analysis-by-synthesis-like losses for supervision of 3D scene structure using differentiable rendering is not practical, as it almost always gets stuck in local minima of visual ambiguities. This can be overcome by a novel form of training, where an additional network is employed to steer the optimization itself to explore the entire parameter space i.e., to be curious, and hence, to resolve those ambiguities and find workable minima.

Comments:	19 pages, 17 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2012.01230 [cs.CV]
	(or arXiv:2012.01230v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2012.01230

Submission history

From: David Griffiths Mr [view email]
[v1] Wed, 2 Dec 2020 14:17:16 UTC (25,683 KB)
[v2] Fri, 19 Feb 2021 13:55:40 UTC (4,231 KB)
[v3] Fri, 15 Oct 2021 19:11:40 UTC (21,624 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2020-12

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

David Griffiths
Jan Boehm
Tobias Ritschel

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Curiosity-driven 3D Object Detection Without Labels

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Curiosity-driven 3D Object Detection Without Labels

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators