Optimized CNNs for Rapid 3D Point Cloud Object Recognition

Lyu, Tianyi, Gu, Dian, Chen, Peiyuan, Jiang, Yaoting, Zhang, Zhenhong, Pang, Huadong, Zhou, Li, Dong, Yiping

Dec-3-2024–arXiv.org Artificial Intelligence

This study introduces a method for efficiently detecting objects within 3D point clouds using convolutional neural networks (CNNs). Our approach adopts a unique feature-centric voting mechanism to construct convolutional layers that capitalize on the typical sparsity observed in input data. We explore the trade-off between accuracy and speed across diverse network architectures and advocate for integrating an $\mathcal{L}_1$ penalty on filter activations to augment sparsity within intermediate layers. This research pioneers the proposal of sparse convolutional layers combined with $\mathcal{L}_1$ regularization to effectively handle large-scale 3D data processing. Our method's efficacy is demonstrated on the MVTec 3D-AD object detection benchmark. The Vote3Deep models, with just three layers, outperform the previous state-of-the-art in both laser-only approaches and combined laser-vision methods. Additionally, they maintain competitive processing speeds. This underscores our approach's capability to substantially enhance detection performance while ensuring computational efficiency suitable for real-time applications.

artificial intelligence, detection, machine learning, (16 more...)

arXiv.org Artificial Intelligence

Dec-3-2024

arXiv.org PDF

Add feedback

Country:
- North America
  - Canada > Quebec
    - Montreal (0.14)
  - United States > Pennsylvania
    - Philadelphia County > Philadelphia (0.14)

Genre:
- Research Report (1.00)

Industry:
- Health & Medicine > Therapeutic Area (0.93)
- Information Technology (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.89)
  - Vision (1.00)