Voxel Proposal Network via Multi-Frame Knowledge Distillation for Semantic Scene Completion Kairui Yang

Jun-1-2025, 05:31:30 GMT–Neural Information Processing Systems

Semantic scene completion is a difficult task that involves completing the geometry and semantics of a scene from point clouds in a large-scale environment. Many current methods use 3D/2D convolutions or attention mechanisms, but these have limitations in directly constructing geometry and accurately propagating features from related voxels, the completion likely fails while propagating features in a single pass without considering multiple potential pathways. And they are generally only suitable for static scenes and struggle to handle dynamic aspects. This paper introduces Voxel Proposal Network (VPNet) that completes scenes from 3D and Bird's-Eye-View (BEV) perspectives. It includes Confident Voxel Proposal based on voxel-wise coordinates to propose confident voxels with high reliability for completion.

completion, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Jun-1-2025, 05:31:30 GMT

Conferences PDF

Add feedback

Country:
- Asia > China (0.68)

Genre:
- Research Report > Experimental Study (0.93)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks (0.68)
  - Natural Language (0.68)
  - Representation & Reasoning (0.68)
  - Robots (0.93)
  - Vision (1.00)