$L_0$-ARM: Network Sparsification via Stochastic Binary Optimization