Review for NeurIPS paper: A new convergent variant of Q-learning with linear function approximation

Open in new window