Bayesian Bits: Unifying Quantization and Pruning

Neural Information Processing Systems 

We introduce Bayesian Bits, a practical method for joint mixed precision quantization and pruning through gradient based optimization.