Reinforcement Learning using Kernel-Based Stochastic Factorization

Barreto, Andre, Precup, Doina, Pineau, Joelle

Feb-14-2020, 22:12:00 GMT–Neural Information Processing Systems

Kernel-based reinforcement-learning (KBRL) is a method for learning a decision policy from a set of sample transitions which stands out for its strong theoretical guarantees. However, the size of the approximator grows with the number of transitions, which makes the approach impractical for large problems. In this paper we introduce a novel algorithm to improve the scalability of KBRL. We resort to a special decomposition of a transition matrix, called stochastic factorization, to fix the size of the approximator while at the same time incorporating all the information contained in the data. The resulting algorithm, kernel-based stochastic factorization (KBSF), is much faster but still converges to a unique solution.

algorithm, kernel-based stochastic factorization, reinforcement learning, (4 more...)

Neural Information Processing Systems

Feb-14-2020, 22:12:00 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.87)