Continuous MDP Homomorphisms and Homomorphic Policy Gradient

Neural Information Processing Systems 

Abstraction has been widely studied as a way to improve the efficiency and generalization of reinforcement learning algorithms.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found