Analysis of NaN Divergence in Training Monocular Depth Estimation Model

Kim, Bum Jun; Jang, Hyeonah; Kim, Sang Woo

Computer Science > Computer Vision and Pattern Recognition

arXiv:2311.03938 (cs)

[Submitted on 7 Nov 2023]

Title:Analysis of NaN Divergence in Training Monocular Depth Estimation Model

Authors:Bum Jun Kim, Hyeonah Jang, Sang Woo Kim

View PDF

Abstract:The latest advances in deep learning have facilitated the development of highly accurate monocular depth estimation models. However, when training a monocular depth estimation network, practitioners and researchers have observed not a number (NaN) loss, which disrupts gradient descent optimization. Although several practitioners have reported the stochastic and mysterious occurrence of NaN loss that bothers training, its root cause is not discussed in the literature. This study conducted an in-depth analysis of NaN loss during training a monocular depth estimation network and identified three types of vulnerabilities that cause NaN loss: 1) the use of square root loss, which leads to an unstable gradient; 2) the log-sigmoid function, which exhibits numerical stability issues; and 3) certain variance implementations, which yield incorrect computations. Furthermore, for each vulnerability, the occurrence of NaN loss was demonstrated and practical guidelines to prevent NaN loss were presented. Experiments showed that both optimization stability and performance on monocular depth estimation could be improved by following our guidelines.

Comments:	10 pages, 3 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2311.03938 [cs.CV]
	(or arXiv:2311.03938v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2311.03938

Submission history

From: Bum Jun Kim [view email]
[v1] Tue, 7 Nov 2023 12:19:30 UTC (947 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Analysis of NaN Divergence in Training Monocular Depth Estimation Model

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Analysis of NaN Divergence in Training Monocular Depth Estimation Model

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators