Successive Pruning for Model Compression via Rate Distortion Theory