The Importance of Sampling inMeta-Reinforcement Learning
Bradly Stadie, Ge Yang, Rein Houthooft, Peter Chen, Yan Duan, Yuhuai Wu, Pieter Abbeel, Ilya Sutskever
–Neural Information Processing Systems
This sampling process is embodied as the policyπ, which is responsible for outputting an actiona conditioned on past environmental states{s}.
Neural Information Processing Systems
Feb-14-2026, 16:25:26 GMT