CRT-Fusion: Camera, Radar, Temporal Fusion Using Motion Information for 3D Object Detection
–Neural Information Processing Systems
Accurate and robust 3D object detection is a critical component in autonomous vehicles and robotics. While recent radar-camera fusion methods have made significant progress by fusing information in the bird's-eye view (BEV) representation, they often struggle to effectively capture the motion of dynamic objects, leading to limited performance in real-world scenarios. In this paper, we introduce CRT-Fusion, a novel framework that integrates temporal information into radar-camera fusion to address this challenge. Our approach comprises three key modules: Multi-View Fusion (MVF), Motion Feature Estimator (MFE), and Motion Guided Temporal Fusion (MGTF).
Neural Information Processing Systems
Dec-27-2025, 06:18:01 GMT
- Technology:
- Information Technology > Artificial Intelligence > Robots (0.59)