Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion
Jacob Buckman, Danijar Hafner, George Tucker, Eugene Brevdo, Honglak Lee
–Neural Information Processing Systems
Neural Information Processing Systems
Mar-27-2025, 04:17:24 GMT