Derivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic Systems

Open in new window