Blazing the trails before beating the path: Sample-efficient Monte-Carlo planning
Jean-Bastien Grill, Michal Valko, Remi Munos
–Neural Information Processing Systems
Y ou are a robot and you live in a Markov decision process (MDP) with a finite or an infinite number of transitions from state-action to next states. Y ou got brains and so you plan before you act.
Neural Information Processing Systems
Nov-21-2025, 07:17:08 GMT
- Country:
- Europe
- France > Hauts-de-France
- Pas-de-Calais (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- France > Hauts-de-France
- North America > United States
- Massachusetts > Middlesex County
- Belmont (0.04)
- New Jersey > Mercer County
- Princeton (0.04)
- Massachusetts > Middlesex County
- Europe