Linear Complementarity for Regularized Policy Evaluation and Improvement

Johns, Jeffrey, Painter-wakefield, Christopher, Parr, Ronald

Feb-15-2020, 19:44:04 GMT–Neural Information Processing Systems

Recent work in reinforcement learning has emphasized the power of L1 regularization to perform feature selection and prevent overfitting. We propose formulating the L1 regularized linear fixed point problem as a linear complementarity problem (LCP). This formulation offers several advantages over the LARS-inspired formulation, LARS-TD. The LCP formulation allows the use of efficient off-the-shelf solvers, leads to a new uniqueness result, and can be initialized with starting points from similar problems (warm starts). We demonstrate that warm starts, as well as the efficiency of LCP solvers, can speed up policy iteration.

artificial intelligence, machine learning, regularized policy evaluation and improvement, (6 more...)

Neural Information Processing Systems

Feb-15-2020, 19:44:04 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.31)