Continual Auxiliary Task Learning

May-29-2025, 02:28:38 GMT–Neural Information Processing Systems

Learning auxiliary tasks, such as multiple predictions about the world, can provide many benefits to reinforcement learning systems. A variety of off-policy learning algorithms have been developed to learn such predictions, but as yet there is little work on how to adapt the behavior to gather useful data for those off-policy predictions. In this work, we investigate a reinforcement learning system designed to learn a collection of auxiliary tasks, with a behavior policy learning to take actions to improve those auxiliary predictions. We highlight the inherent non-stationarity in this continual auxiliary task learning problem, for both prediction learners and the behavior learner. We develop an algorithm based on successor features that facilitates tracking under non-stationary rewards, and prove the separation into learning successor features and rewards provides convergence rate improvements. We conduct an in-depth study into the resulting multi-prediction learning system.

learner, reward feature, successor feature, (16 more...)

Neural Information Processing Systems

May-29-2025, 02:28:38 GMT

Conferences PDF

Add feedback

Country:
- North America
  - Canada > Alberta (0.14)
  - United States
    - California (0.04)
    - Massachusetts > Middlesex County
      - Cambridge (0.04)
- Asia
  - Middle East > Jordan (0.04)
  - Japan > Honshū
    - Chūbu > Toyama Prefecture > Toyama (0.04)

Industry:
- Education > Focused Education (0.34)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Statistical Learning (1.00)
  - Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
68331ff0427b551b68e911eebe35233b-Supplemental.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found