Robust RGB-D Fusion for Saliency Detection

Wu, Zongwei; Gobichettipalayam, Shriarulmozhivarman; Tamadazte, Brahim; Allibert, Guillaume; Paudel, Danda Pani; Demonceaux, Cédric

Computer Science > Computer Vision and Pattern Recognition

arXiv:2208.01762 (cs)

[Submitted on 2 Aug 2022 (v1), last revised 30 Aug 2022 (this version, v2)]

Title:Robust RGB-D Fusion for Saliency Detection

Authors:Zongwei Wu, Shriarulmozhivarman Gobichettipalayam, Brahim Tamadazte, Guillaume Allibert, Danda Pani Paudel, Cédric Demonceaux

View PDF

Abstract:Efficiently exploiting multi-modal inputs for accurate RGB-D saliency detection is a topic of high interest. Most existing works leverage cross-modal interactions to fuse the two streams of RGB-D for intermediate features' enhancement. In this process, a practical aspect of the low quality of the available depths has not been fully considered yet. In this work, we aim for RGB-D saliency detection that is robust to the low-quality depths which primarily appear in two forms: inaccuracy due to noise and the misalignment to RGB. To this end, we propose a robust RGB-D fusion method that benefits from (1) layer-wise, and (2) trident spatial, attention mechanisms. On the one hand, layer-wise attention (LWA) learns the trade-off between early and late fusion of RGB and depth features, depending upon the depth accuracy. On the other hand, trident spatial attention (TSA) aggregates the features from a wider spatial context to address the depth misalignment problem. The proposed LWA and TSA mechanisms allow us to efficiently exploit the multi-modal inputs for saliency detection while being robust against low-quality depths. Our experiments on five benchmark datasets demonstrate that the proposed fusion method performs consistently better than the state-of-the-art fusion alternatives.

Comments:	Accepted to 3DV 2022
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2208.01762 [cs.CV]
	(or arXiv:2208.01762v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2208.01762

Submission history

From: Zongwei Wu [view email]
[v1] Tue, 2 Aug 2022 21:23:00 UTC (5,276 KB)
[v2] Tue, 30 Aug 2022 15:17:06 UTC (5,274 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Robust RGB-D Fusion for Saliency Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Robust RGB-D Fusion for Saliency Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators