Reinforcement Learning using Kernel-Based Stochastic Factorization
Barreto, Andre, Precup, Doina, Pineau, Joelle
–Neural Information Processing Systems
Kernel-based reinforcement-learning (KBRL) is a method for learning a decision policy from a set of sample transitions which stands out for its strong theoretical guarantees. However, the size of the approximator grows with the number of transitions, which makes the approach impractical for large problems. In this paper we introduce a novel algorithm to improve the scalability of KBRL. We resort to a special decomposition of a transition matrix, called stochastic factorization, to fix the size of the approximator while at the same time incorporating all the information contained in the data. The resulting algorithm, kernel-based stochastic factorization (KBSF), is much faster but still converges to a unique solution.
Neural Information Processing Systems
Feb-14-2020, 22:12:00 GMT
- Technology: