Reviews: Hybrid Reward Architecture for Reinforcement Learning

Oct-7-2024, 13:53:47 GMT–Neural Information Processing Systems

R5: Summary: This paper builds on the basic idea of the Horde architecture: learning many value functions in parallel with off-policy reinforcement learning. This paper shows that learning many value functions in parallel improves the performance on a single main task. The novelty here lies in a particular strategy for generating many different reward functions and how to combine them to generate behavior. The results show large improvements in performance in an illustrative grid world and Miss Pac-man. Decision: This paper is difficult to access.

hybrid reward architecture, representation, value function, (8 more...)

Neural Information Processing Systems

Oct-7-2024, 13:53:47 GMT

Conferences Web Page

Add feedback

Genre:
- Research Report (0.36)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.63)