Ret3D: Rethinking Object Relations for Efficient 3D Object Detection in Driving Scenes

Wu, Yu-Huan, Zhang, Da, Zhang, Le, Zhan, Xin, Dai, Dengxin, Liu, Yun, Cheng, Ming-Ming

Aug-17-2022–arXiv.org Artificial Intelligence

Current efficient LiDAR-based detection frameworks are lacking in exploiting object relations, which naturally present in both spatial and temporal manners. To this end, we introduce a simple, efficient, and effective two-stage detector, termed as Ret3D. At the core of Ret3D is the utilization of novel intra-frame and inter-frame relation modules to capture the spatial and temporal relations accordingly. More Specifically, intra-frame relation module (IntraRM) encapsulates the intra-frame objects into a sparse graph and thus allows us to refine the object features through efficient message passing. On the other hand, inter-frame relation module (InterRM) densely connects each object in its corresponding tracked sequences dynamically, and leverages such temporal information to further enhance its representations efficiently through a lightweight transformer network. We instantiate our novel designs of IntraRM and InterRM with general center-based or anchor-based detectors and evaluate them on Waymo Open Dataset (WOD). With negligible extra overhead, Ret3D achieves the state-of-the-art performance, being 5.5% and 3.2% higher than the recent competitor in terms of the LEVEL 1 and LEVEL 2 mAPH metrics on vehicle detection, respectively.

comput, detection, relation, (14 more...)

arXiv.org Artificial Intelligence

Aug-17-2022

arXiv.org PDF

Add feedback

Country:
- Europe
  - Germany (0.04)
  - Switzerland > Zürich
    - Zürich (0.14)
- Asia > China
  - Tianjin Province > Tianjin (0.04)

Genre:
- Research Report (0.64)

Technology:
- Information Technology > Artificial Intelligence
  - Vision (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found