The Value Equivalence Principle for Model-Based Reinforcement Learning

Neural Information Processing Systems 

Learning models of the environment from data is often viewed as an essential component to building intelligent reinforcement learning (RL) agents. The common practice is to separate the learning of the model from its use, by constructing a model of the environment's dynamics that correctly predicts the observed state transitions. In this paper we argue that the limited representational resources of model-based RL agents are better used to build models that are directly useful for value-based planning.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found