The Importance of Sampling inMeta-Reinforcement Learning

Bradly Stadie, Ge Yang, Rein Houthooft, Peter Chen, Yan Duan, Yuhuai Wu, Pieter Abbeel, Ilya Sutskever

Neural Information Processing Systems 

This sampling process is embodied as the policyπ, which is responsible for outputting an actiona conditioned on past environmental states{s}.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found