Numerical Pruning for Efficient Autoregressive Models