Auto-Rectify Network for Unsupervised Indoor Depth Estimation

Bian, Jia-Wang; Zhan, Huangying; Wang, Naiyan; Chin, Tat-Jun; Shen, Chunhua; Reid, Ian

Computer Science > Computer Vision and Pattern Recognition

arXiv:2006.02708 (cs)

[Submitted on 4 Jun 2020 (v1), last revised 14 Dec 2021 (this version, v2)]

Title:Auto-Rectify Network for Unsupervised Indoor Depth Estimation

Authors:Jia-Wang Bian, Huangying Zhan, Naiyan Wang, Tat-Jun Chin, Chunhua Shen, Ian Reid

View PDF

Abstract:Single-View depth estimation using the CNNs trained from unlabelled videos has shown significant promise. However, excellent results have mostly been obtained in street-scene driving scenarios, and such methods often fail in other settings, particularly indoor videos taken by handheld devices. In this work, we establish that the complex ego-motions exhibited in handheld settings are a critical obstacle for learning depth. Our fundamental analysis suggests that the rotation behaves as noise during training, as opposed to the translation (baseline) which provides supervision signals. To address the challenge, we propose a data pre-processing method that rectifies training images by removing their relative rotations for effective learning. The significantly improved performance validates our motivation. Towards end-to-end learning without requiring pre-processing, we propose an Auto-Rectify Network with novel loss functions, which can automatically learn to rectify images during training. Consequently, our results outperform the previous unsupervised SOTA method by a large margin on the challenging NYUv2 dataset. We also demonstrate the generalization of our trained model in ScanNet and Make3D, and the universality of our proposed learning method on 7-Scenes and KITTI datasets.

Comments:	Accepted to TPAMI. Find code at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2006.02708 [cs.CV]
	(or arXiv:2006.02708v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2006.02708

Submission history

From: Jiawang Bian [view email]
[v1] Thu, 4 Jun 2020 08:59:17 UTC (1,643 KB)
[v2] Tue, 14 Dec 2021 06:17:08 UTC (7,832 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Auto-Rectify Network for Unsupervised Indoor Depth Estimation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Auto-Rectify Network for Unsupervised Indoor Depth Estimation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators