Unleash the Potential of Image Branch for Cross-modal 3D Object Detection
–Neural Information Processing Systems
To achieve reliable and precise scene understanding, autonomous vehicles typically incorporate multiple sensing modalities to capitalize on their complementary attributes. However, existing cross-modal 3D detectors do not fully utilize the image domain information to address the bottleneck issues of the LiDAR-based detectors. This paper presents a new cross-modal 3D object detector, namely UPIDet, which aims to unleash the potential of the image branch from two aspects. First, UPIDet introduces a new 2D auxiliary task called normalized local coordinate map estimation. This approach enables the learning of local spatial-aware features from the image modality to supplement sparse point clouds.
Neural Information Processing Systems
Jan-19-2025, 17:42:40 GMT
- Technology:
- Information Technology > Artificial Intelligence > Vision (0.74)