Convergence and sample complexity of gradient methods for the model-free linear quadratic regulator problem

Open in new window