Deep Reinforcement and InfoMax Learning

Neural Information Processing Systems 

We begin with the hypothesis that a model-free agent whose representations are predictive of properties of future states (beyond expected rewards) will be more capable of solving and adapting to new RL problems.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found