Unifying Voxel-based Representation with Transformer for 3D Object Detection Y anwei Li
–Neural Information Processing Systems
Detecting 3D objects with multi-modality sensors ( i.e., LiDAR and camera) is regarded as a fundamental task in real-world scenes. For accurate object detection, data from different modalities are utilized to provide complementary knowledge, like accurate positions from point clouds and rich context from images.
Neural Information Processing Systems
Aug-15-2025, 23:44:26 GMT
- Country:
- South America > Brazil (0.04)
- Asia > China
- Hong Kong (0.04)
- Guangdong Province > Shenzhen (0.04)
- Genre:
- Research Report (0.46)
- Technology:
- Information Technology > Artificial Intelligence
- Vision (1.00)
- Representation & Reasoning (1.00)
- Machine Learning (1.00)
- Information Technology > Artificial Intelligence