Value Function Decomposition for Iterative Design of Reinforcement Learning Agents

Dec-24-2025, 04:36:36 GMT–Neural Information Processing Systems

Designing reinforcement learning (RL) agents is typically a difficult process that requires numerous design iterations. Learning can fail for a multitude of reasons and standard RL methods provide too few tools to provide insight into the exact cause. In this paper, we show how to integrate \textit{value decomposition} into a broad class of actor-critic algorithms and use it to assist in the iterative agent-design process. Value decomposition separates a reward function into distinct components and learns value estimates for each. These value estimates provide insight into an agent's learning and decision-making process and enable new training methods to mitigate common problems.

iterative design, reinforcement learning agent, value function decomposition, (7 more...)

Neural Information Processing Systems

Dec-24-2025, 04:36:36 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.79)