Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving

Kong, Lingdong, Xu, Xiang, Ren, Jiawei, Zhang, Wenwei, Pan, Liang, Chen, Kai, Ooi, Wei Tsang, Liu, Ziwei

May-8-2024–arXiv.org Artificial Intelligence

Efficient data utilization is crucial for advancing 3D scene understanding in autonomous driving, where reliance on heavily human-annotated LiDAR point clouds challenges fully supervised methods. Addressing this, our study extends into semi-supervised learning for LiDAR semantic segmentation, leveraging the intrinsic spatial priors of driving scenes and multi-sensor complements to augment the efficacy of unlabeled datasets. We introduce LaserMix++, an evolved framework that integrates laser beam manipulations from disparate LiDAR scans and incorporates LiDAR-camera correspondences to further assist data-efficient learning. Our framework is tailored to enhance 3D scene consistency regularization by incorporating multi-modality, including 1) multi-modal LaserMix operation for fine-grained cross-sensor interactions; 2) camera-to-LiDAR feature distillation that enhances LiDAR feature learning; and 3) language-driven knowledge guidance generating auxiliary supervisions using open-vocabulary models. The versatility of LaserMix++ enables applications across LiDAR representations, establishing it as a universally applicable solution. Our framework is rigorously validated through theoretical analysis and extensive experiments on popular driving perception datasets. Results demonstrate that LaserMix++ markedly outperforms fully supervised alternatives, achieving comparable accuracy with five times fewer annotations and significantly improving the supervised-only baselines. This substantial advancement underscores the potential of semi-supervised approaches in reducing the reliance on extensive labeled data in LiDAR-based 3D scene understanding systems.

comput, conf, segmentation, (14 more...)

arXiv.org Artificial Intelligence

May-8-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New Mexico > Bernalillo County > Albuquerque (0.04)
- Asia
  - Singapore (0.04)
  - China
    - Shanghai > Shanghai (0.04)
    - Jiangsu Province > Nanjing (0.04)

Genre:
- Research Report > New Finding (0.66)

Industry:
- Information Technology > Robotics & Automation (0.61)
- Transportation > Ground
  - Road (0.61)

Technology:
- Information Technology > Artificial Intelligence
  - Vision (1.00)
  - Robots > Autonomous Vehicles (0.70)
  - Machine Learning > Neural Networks
    - Deep Learning (0.67)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found