Linearity-based neural network compression