Voxel Proposal Network via Multi-Frame Knowledge Distillation for Semantic Scene Completion Kairui Yang
–Neural Information Processing Systems
Semantic scene completion is a difficult task that involves completing the geometry and semantics of a scene from point clouds in a large-scale environment. Many current methods use 3D/2D convolutions or attention mechanisms, but these have limitations in directly constructing geometry and accurately propagating features from related voxels, the completion likely fails while propagating features in a single pass without considering multiple potential pathways. And they are generally only suitable for static scenes and struggle to handle dynamic aspects. This paper introduces Voxel Proposal Network (VPNet) that completes scenes from 3D and Bird's-Eye-View (BEV) perspectives. It includes Confident Voxel Proposal based on voxel-wise coordinates to propose confident voxels with high reliability for completion.
Neural Information Processing Systems
Jun-1-2025, 05:31:30 GMT
- Genre:
- Research Report > Experimental Study (0.93)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning > Neural Networks (0.68)
- Natural Language (0.68)
- Representation & Reasoning (0.68)
- Robots (0.93)
- Vision (1.00)
- Information Technology > Artificial Intelligence