Efficient Reinforcement Learning for High Dimensional Linear Quadratic Systems

Ibrahimi, Morteza, Javanmard, Adel, Roy, Benjamin V.

Feb-15-2020, 00:11:20 GMT–Neural Information Processing Systems

We study the problem of adaptive control of a high dimensional linear quadratic (LQ) system. Previous work established the asymptotic convergence to an optimal controller for various adaptive control schemes. More recently, an asymptotic regret bound of $\tilde{O}(\sqrt{T})$ was shown for $T \gg p$ where $p$ is the dimension of the state space. In this work we consider the case where the matrices describing the dynamic of the LQ system are sparse and their dimensions are large. We present an adaptive control scheme that for $p \gg 1$ and $T \gg \polylog(p)$ achieves a regret bound of $\tilde{O}(p \sqrt{T})$.

adaptive control scheme, efficient reinforcement learning, high dimensional linear quadratic system, (4 more...)

Neural Information Processing Systems

Feb-15-2020, 00:11:20 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.40)