An Extremely Effective Spatial Pyramid and Pixel Shuffle Upsampling Decoder for Multiscale Monocular Depth Estimation

Comput Intell Neurosci. 2022 Aug 1:2022:4668001. doi: 10.1155/2022/4668001. eCollection 2022.

Abstract

To estimate the accurate depth from a single image, we proposed a novel and effective depth estimation architecture to solve the problem of missing and blurred contours of small objects in the depth map. The architecture consists of Extremely Effective Spatial Pyramid modules (EESP) and Pixel Shuffle upsampling Decoders (PSD). The results of this study show that multilevel information and the upsampling method in the decoders are essential for recovering the accurate depth map. Through the model we proposed, competitive performance compared with state-of-the-art methods in terms of reconstruction of object boundaries and the detection rate of small objects has been demonstrated. Our approach has wide applications in higher-level visual tasks, including 3D reconstruction and autonomous driving.