MsSVT: Mixed-scale Sparse Voxel Transformer for 3D Object Detection on Point Clouds
–Neural Information Processing Systems
To mitigate this gap, we present a novel Mixed-scale Sparse V oxel Transformer, named MsSVT, which can well capture both types of information simultaneously by the divide-and-conquer philosophy.
Neural Information Processing Systems
Aug-14-2025, 16:47:59 GMT