Better Exploration with Optimistic Actor Critic
Kamil Ciosek, Quan Vuong, Robert Loftin, Katja Hofmann
–Neural Information Processing Systems
Actor-critic methods, a type of model-free Reinforcement Learning, have beensuccessfully applied to challenging tasks in continuous control, often achievingstate-of-the artperformance.
Neural Information Processing Systems
Feb-13-2026, 08:39:45 GMT
- Country:
- Asia
- China > Beijing
- Beijing (0.05)
- Middle East > Jordan (0.04)
- China > Beijing
- Europe
- North America
- Canada
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.14)
- Quebec > Montreal (0.05)
- British Columbia > Metro Vancouver Regional District
- United States
- Arizona > Maricopa County
- Phoenix (0.04)
- Colorado > Denver County
- Denver (0.14)
- Louisiana (0.04)
- Oregon > Benton County
- Corvallis (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Tennessee > Davidson County
- Nashville (0.04)
- Arizona > Maricopa County
- Canada
- Asia
- Technology: