Provable Benefit of Multitask Representation Learning in Reinforcement Learning

Jan-18-2025, 22:11:17 GMT–Neural Information Processing Systems

As representation learning becomes a powerful technique to reduce sample complexity in reinforcement learning (RL) in practice, theoretical understanding of its advantage is still limited. In this paper, we theoretically characterize the benefit of representation learning under the low-rank Markov decision process (MDP) model. We first study multitask low-rank RL (as upstream training), where all tasks share a common representation, and propose a new multitask reward-free algorithm called REFUEL. REFUEL learns both the transition kernel and the near-optimal policy for each task, and outputs a well-learned representation for downstream tasks. Our result demonstrates that multitask representation learning is provably more sample-efficient than learning each task individually, as long as the total number of tasks is above a certain threshold.

learning, multitask representation learning, representation, (6 more...)

Neural Information Processing Systems

Jan-18-2025, 22:11:17 GMT

Conferences Web Page

Add feedback

Genre:
- Research Report > New Finding (0.60)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.63)