Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy Policies
–Neural Information Processing Systems
State-of-the-art efficient model-based Reinforcement Learning (RL) algorithms typically act by iteratively solving empirical models, i.e., by performing full-planning
Neural Information Processing Systems
Oct-2-2025, 09:51:04 GMT
- Country:
- Asia > Middle East
- North America
- Canada (0.04)
- United States > Massachusetts
- Middlesex County > Belmont (0.04)
- Technology: