Near-Optimal Randomized Exploration for Tabular Markov Decision Processes
–Neural Information Processing Systems
We study algorithms using randomized value functions for exploration in reinforcement learning. This type of algorithms enjoys appealing empirical performance.
Neural Information Processing Systems
Oct-3-2025, 04:21:29 GMT
- Country:
- Asia > Middle East
- Jordan (0.04)
- Europe > United Kingdom
- England > Greater London > London (0.04)
- North America > United States
- California > Los Angeles County > Long Beach (0.04)
- Asia > Middle East
- Genre:
- Research Report (0.68)