S 3 : Sign-Sparse-Shift Reparametrization for Effective Training of Low-bit Shift Networks
–Neural Information Processing Systems
Shift neural networks reduce computation complexity by removing expensive multiplication operations and quantizing continuous weights into low-bit discrete values, which are fast and energy-efficient compared to conventional neural networks. However, existing shift networks are sensitive to the weight initialization and yield a degraded performance caused by vanishing gradient and weight sign freezing problem. To address these issues, we propose S$^3$ re-parameterization, a novel technique for training low-bit shift networks.
Neural Information Processing Systems
Feb-9-2026, 11:27:57 GMT
- Technology: