Unifying Voxel-based Representation with Transformer for 3D Object Detection Y anwei Li

Aug-15-2025, 23:44:26 GMT–Neural Information Processing Systems

Detecting 3D objects with multi-modality sensors ( i.e., LiDAR and camera) is regarded as a fundamental task in real-world scenes. For accurate object detection, data from different modalities are utilized to provide complementary knowledge, like accurate positions from point clouds and rich context from images.

artificial intelligence, detection, machine learning, (14 more...)

Neural Information Processing Systems

Aug-15-2025, 23:44:26 GMT

Conferences PDF

Add feedback

Country:
- South America > Brazil (0.04)
- Asia > China
  - Hong Kong (0.04)
  - Guangdong Province > Shenzhen (0.04)

Genre:
- Research Report (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Vision (1.00)
  - Representation & Reasoning (1.00)
  - Machine Learning (1.00)

Duplicate Docs Excel Report

Title
752df938681b2cf15e5fc9689f0bcf3a-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found