Search on the Replay Buffer: Bridging Planning and Reinforcement Learning
Ben Eysenbach, Russ R. Salakhutdinov, Sergey Levine
–Neural Information Processing Systems
The history of learning for control has been an exciting back and forth between twobroad classes ofalgorithms: planning andreinforcement learning.
Neural Information Processing Systems
Feb-12-2026, 07:32:30 GMT
- Country:
- Asia > Middle East
- Jordan (0.04)
- Europe > Italy
- North America
- Oceania > Australia
- Queensland > Brisbane (0.04)
- Asia > Middle East
- Technology: