HEDNet: A Hierarchical Encoder-Decoder Network for 3D Object Detection in Point Clouds

Jan-19-2025, 18:12:39 GMT–Neural Information Processing Systems

A primary challenge in 3D object detection stems from the sparse distribution of points within the 3D scene. Existing high-performance methods typically employ 3D sparse convolutional neural networks with small kernels to extract features. To reduce computational costs, these methods resort to submanifold sparse convolutions, which prevent the information exchange among spatially disconnected features. Some recent approaches have attempted to address this problem by introducing large-kernel convolutions or self-attention mechanisms, but they either achieve limited accuracy improvements or incur excessive computational costs. We propose HEDNet, a hierarchical encoder-decoder network for 3D object detection, which leverages encoder-decoder blocks to capture long-range dependencies among features in the spatial space, particularly for large and distant objects.

hednet, hierarchical encoder-decoder network, object detection, (3 more...)

Neural Information Processing Systems

Jan-19-2025, 18:12:39 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Vision (0.95)
  - Machine Learning > Neural Networks (0.64)