Sharing Knowledge in Multi-Task Deep Reinforcement Learning

D'Eramo, Carlo, Tateo, Davide, Bonarini, Andrea, Restelli, Marcello, Peters, Jan

Jan-17-2024–arXiv.org Artificial Intelligence

We study the benefit of sharing representations among tasks to enable the effective use of deep neural networks in Multi-Task Reinforcement Learning. We leverage the assumption that learning from different tasks, sharing common properties, is helpful to generalize the knowledge of them resulting in a more effective feature extraction compared to learning a single task. Intuitively, the resulting set of features offers performance benefits when used by Reinforcement Learning algorithms. We prove this by providing theoretical guarantees that highlight the conditions for which is convenient to share representations among tasks, extending the wellknown finite-time bounds of Approximate Value-Iteration to the multi-task setting. In addition, we complement our analysis by proposing multi-task extensions of three Reinforcement Learning algorithms that we empirically evaluate on widely used Reinforcement Learning benchmarks showing significant improvements over the single-task counterparts in terms of sample efficiency and performance. Multi-Task Learning (MTL) ambitiously aims to learn multiple tasks jointly instead of learning them separately, leveraging the assumption that the considered tasks have common properties which can be exploited by Machine Learning (ML) models to generalize the learning of each of them. For instance, the features extracted in the hidden layers of a neural network trained on multiple tasks have the advantage of being a general representation of structures common to each other.

avg, conference paper, representation, (15 more...)

arXiv.org Artificial Intelligence

Jan-17-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - California > Los Angeles County > Santa Monica (0.04)
- Europe > Germany
  - Hesse > Darmstadt Region
    - Darmstadt (0.05)
  - Baden-Württemberg > Tübingen Region
    - Tübingen (0.04)

Genre:
- Research Report > New Finding (0.46)

Industry:
- Education (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (1.00)
  - Neural Networks > Deep Learning (0.67)