Least-Squares Temporal Difference Learning for the Linear Quadratic Regulator

Open in new window