Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion

Jacob Buckman, Danijar Hafner, George Tucker, Eugene Brevdo, Honglak Lee

Mar-13-2026, 20:43:02 GMT–Neural Information Processing Systems

We propose stochastic ensemble value expansion (STEVE), a novel model-based technique that addresses this issue. By dynamically interpolating between model rollouts of various horizon lengths for each individual example, STEVE ensures that the model is only utilized when doing so does not introduce significant errors.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Mar-13-2026, 20:43:02 GMT

Conferences PDF

Add feedback

Country:
- North America
  - United States > California
    - Santa Clara County > Mountain View (0.04)
  - Canada > Quebec
    - Montreal (0.04)
- Europe > Sweden
  - Stockholm > Stockholm (0.04)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Research Report (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Machine Learning
    - Reinforcement Learning (1.00)
    - Neural Networks > Deep Learning (0.68)

Duplicate Docs Excel Report

Title
Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion
Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion

Similar Docs Excel Report more

Title	Similarity	Source
None found