S 3 : Sign-Sparse-Shift Reparametrization for Effective Training of Low-bit Shift Networks