Reinforcement Learning under Model Mismatch
Aurko Roy, Huan Xu, Sebastian Pokutta
–Neural Information Processing Systems
We scale up the robust algorithms to large MDPs via function approximation and prove convergence under two different settings.
Neural Information Processing Systems
Nov-21-2025, 11:32:39 GMT
- Country:
- North America > United States
- California > Los Angeles County
- Long Beach (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- California > Los Angeles County
- North America > United States
- Genre:
- Research Report (0.46)
- Technology: