A Tensor Low-Rank Approximation for Value Functions in Multi-Task Reinforcement Learning

Rozada, Sergio, Paternain, Santiago, Bazerque, Juan Andres, Marques, Antonio G.

Jan-17-2025–arXiv.org Artificial Intelligence

In pursuit of reinforcement learning systems that could train in physical environments, we investigate multi-task approaches as a means to alleviate the need for massive data acquisition. In a tabular scenario where the Q-functions are collected across tasks, we model our learning problem as optimizing a higher order tensor structure. Recognizing that close-related tasks may require similar actions, our proposed method imposes a low-rank condition on this aggregated Q-tensor. The rationale behind this approach to multi-task learning is that the low-rank structure enforces the notion of similarity, without the need to explicitly prescribe which tasks are similar, but inferring this information from a reduced amount of data simultaneously with the stochastic optimization of the Q-tensor. The efficiency of our low-rank tensor approach to multi-task learning is demonstrated in two numerical experiments, first in a benchmark environment formed by a collection of inverted pendulums, and then into a practical scenario involving multiple wireless communication devices.

learning, multi-task learning, tensor, (15 more...)

arXiv.org Artificial Intelligence

Jan-17-2025

arXiv.org PDF

Add feedback

Country:
- South America > Chile
  - Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
- North America > United States
  - Pennsylvania > Allegheny County
    - Pittsburgh (0.04)
  - New York > Rensselaer County
    - Troy (0.04)
- Europe > Spain
  - Galicia > Madrid (0.05)

Genre:
- Research Report (0.40)

Industry:
- Energy (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (1.00)
  - Statistical Learning (0.88)
  - Neural Networks > Deep Learning (0.46)