Accelerating Multi-Task Temporal Difference Learning under Low-Rank Representation

Bai, Yitao, Zeng, Sihan, Romberg, Justin, Doan, Thinh T.

arXiv.org Artificial Intelligence 

We study policy evaluation problems in multi-task reinforcement learning (RL) under a low-rank representation setting. In this setting, we are given $N$ learning tasks where the corresponding value function of these tasks lie in an $r$-dimensional subspace, with $r