Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object Detection
–Neural Information Processing Systems
Serialization-based methods, which serialize the 3D voxels and group them into multiple sequences before inputting to Transformers, have demonstrated their effectiveness in 3D object detection. However, serializing 3D voxels into 1D sequences will inevitably sacrifice the voxel spatial proximity. Such an issue is hard to be addressed by enlarging the group size with existing serialization-based methods due to the quadratic complexity of Transformers with feature sizes.
Neural Information Processing Systems
Oct-10-2025, 10:08:19 GMT
- Genre:
- Research Report > Experimental Study (0.93)
- Industry:
- Information Technology > Services (0.50)
- Technology: