The Option Keyboard: Combining Skills in Reinforcement Learning

Oct-9-2024, 16:56:21 GMT–Neural Information Processing Systems

The ability to combine known skills to create new ones may be crucial in the solution of complex reinforcement learning problems that unfold over extended periods. We argue that a robust way of combining skills is to define and manipulate them in the space of pseudo-rewards (or "cumulants"). Based on this premise, we propose a framework for combining skills using the formalism of options. We show that every deterministic option can be unambiguously represented as a cumulant defined in an extended domain. Building on this insight and on previous results on transfer learning, we show how to approximate options whose cumulants are linear combinations of the cumulants of known options.

cumulant, option keyboard, reinforcement learning, (2 more...)

Neural Information Processing Systems

Oct-9-2024, 16:56:21 GMT

Conferences Web Page

Add feedback

Country:
- Asia > Japan > Honshū > Chūbu > Toyama Prefecture > Toyama (0.09)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.65)