Goto

Collaborating Authors

 South America









Unifying Voxel-based Representation with Transformer for 3D Object Detection Y anwei Li

Neural Information Processing Systems

Detecting 3D objects with multi-modality sensors ( i.e., LiDAR and camera) is regarded as a fundamental task in real-world scenes. For accurate object detection, data from different modalities are utilized to provide complementary knowledge, like accurate positions from point clouds and rich context from images.