$Q^\star$ Approximation Schemes for Batch Reinforcement Learning: A Theoretical Comparison

Open in new window