Review for NeurIPS paper: On the Convergence of Smooth Regularized Approximate Value Iteration Schemes

Neural Information Processing Systems 

This analysis provides theoretical insights explaining their empirical success. After author feedback and discussion all reviewers agree that this is a meaningful contribution to the better understanding of existing RL algorithms. This is thus a clear « Accept » decision. That being said, I would like to ask the authors to please add a discussion w.r.t.