Balancing Multiple Sources of Reward in Reinforcement Learning

Dec-31-2001–Neural Information Processing Systems

For many problems which would be natural for reinforcement learning, the reward signal is not a single scalar value but has multiple scalar components. Examplesof such problems include agents with multiple goals and agents with multiple users. Creating a single reward value by combining themultiple components can throwaway vital information and can lead to incorrect solutions. We describe the multiple reward source problem and discuss the problems with applying traditional reinforcement learning.We then present an new algorithm for finding a solution and results on simulated environments.

artificial intelligence, machine learning, reinforcement learning, (13 more...)

Neural Information Processing Systems

Dec-31-2001

Conferences PDF

Add feedback

Country:
- North America > United States > Massachusetts (0.28)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
Balancing Multiple Sources of Reward in Reinforcement Learning
Balancing Multiple Sources of Reward in Reinforcement Learning

Similar Docs Excel Report more

Title	Similarity	Source
None found