Multi-modal Sensor Fusion for Auto Driving Perception: A Survey

Huang, Keli; Shi, Botian; Li, Xiang; Li, Xin; Huang, Siyuan; Li, Yikang

Computer Science > Computer Vision and Pattern Recognition

arXiv:2202.02703 (cs)

[Submitted on 6 Feb 2022 (v1), last revised 16 Dec 2024 (this version, v3)]

Title:Multi-modal Sensor Fusion for Auto Driving Perception: A Survey

Authors:Keli Huang, Botian Shi, Xiang Li, Xin Li, Siyuan Huang, Yikang Li

View PDF HTML (experimental)

Abstract:Multi-modal fusion is a fundamental task for the perception of an autonomous driving system, which has recently intrigued many researchers. However, achieving a rather good performance is not an easy task due to the noisy raw data, underutilized information, and the misalignment of multi-modal sensors. In this paper, we provide a literature review of the existing multi-modal-based methods for perception tasks in autonomous driving. Generally, we make a detailed analysis including over 50 papers leveraging perception sensors including LiDAR and camera trying to solve object detection and semantic segmentation tasks. Different from traditional fusion methodology for categorizing fusion models, we propose an innovative way that divides them into two major classes, four minor classes by a more reasonable taxonomy in the view of the fusion stage. Moreover, we dive deep into the current fusion methods, focusing on the remaining problems and open-up discussions on the potential research opportunities. In conclusion, what we expect to do in this paper is to present a new taxonomy of multi-modal fusion methods for the autonomous driving perception tasks and provoke thoughts of the fusion-based techniques in the future.

Comments:	14 pages, 8 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
ACM classes:	I.2.10
Cite as:	arXiv:2202.02703 [cs.CV]
	(or arXiv:2202.02703v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2202.02703

Submission history

From: Keli Huang [view email]
[v1] Sun, 6 Feb 2022 04:18:45 UTC (2,548 KB)
[v2] Sun, 27 Feb 2022 05:48:08 UTC (2,545 KB)
[v3] Mon, 16 Dec 2024 23:42:56 UTC (2,547 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Multi-modal Sensor Fusion for Auto Driving Perception: A Survey

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Multi-modal Sensor Fusion for Auto Driving Perception: A Survey

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators