Review for NeurIPS paper: Agnostic Q -learning with Function Approximation in Deterministic Systems: Near-Optimal Bounds on Approximation Error and Sample Complexity

Neural Information Processing Systems 

This paper makes progress on our theoretical understanding of function approximation in RL, a crucial and tricky topic. The paper is technically strong and proposes a highly novel recursion-based algorithm that could open the door to future innovations.