LiCROcc: Teach Radar for Accurate Semantic Occupancy Prediction using LiDAR and Camera

Ma, Yukai; Mei, Jianbiao; Yang, Xuemeng; Wen, Licheng; Xu, Weihua; Zhang, Jiangning; Shi, Botian; Liu, Yong; Zuo, Xingxing

Computer Science > Computer Vision and Pattern Recognition

arXiv:2407.16197v1 (cs)

[Submitted on 23 Jul 2024]

Title:LiCROcc: Teach Radar for Accurate Semantic Occupancy Prediction using LiDAR and Camera

Authors:Yukai Ma, Jianbiao Mei, Xuemeng Yang, Licheng Wen, Weihua Xu, Jiangning Zhang, Botian Shi, Yong Liu, Xingxing Zuo

View PDF HTML (experimental)

Abstract:Semantic Scene Completion (SSC) is pivotal in autonomous driving perception, frequently confronted with the complexities of weather and illumination changes. The long-term strategy involves fusing multi-modal information to bolster the system's robustness. Radar, increasingly utilized for 3D target detection, is gradually replacing LiDAR in autonomous driving applications, offering a robust sensing alternative. In this paper, we focus on the potential of 3D radar in semantic scene completion, pioneering cross-modal refinement techniques for improved robustness against weather and illumination changes, and enhancing SSC this http URL model architecture, we propose a three-stage tight fusion approach on BEV to realize a fusion framework for point clouds and images. Based on this foundation, we designed three cross-modal distillation modules-CMRD, BRD, and PDD. Our approach enhances the performance in both radar-only (R-LiCROcc) and radar-camera (RC-LiCROcc) settings by distilling to them the rich semantic and structural information of the fused features of LiDAR and camera. Finally, our LC-Fusion (teacher model), R-LiCROcc and RC-LiCROcc achieve the best performance on the nuScenes-Occupancy dataset, with mIOU exceeding the baseline by 22.9%, 44.1%, and 15.5%, respectively. The project page is available at this https URL.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
Cite as:	arXiv:2407.16197 [cs.CV]
	(or arXiv:2407.16197v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2407.16197

Submission history

From: Yukai Ma [view email]
[v1] Tue, 23 Jul 2024 05:53:05 UTC (4,978 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:LiCROcc: Teach Radar for Accurate Semantic Occupancy Prediction using LiDAR and Camera

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:LiCROcc: Teach Radar for Accurate Semantic Occupancy Prediction using LiDAR and Camera

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators