Safe Policy Improvement by Minimizing Robust Baseline Regret
Mohammad Ghavamzadeh, Marek Petrik, Yinlam Chow
–Neural Information Processing Systems
In this paper, we develop and analyze a new model-based approach that computes a safe policy, given an inaccurate model of the system's
Neural Information Processing Systems
Nov-21-2025, 08:13:53 GMT
- Country:
- Europe > Spain
- Catalonia > Barcelona Province > Barcelona (0.04)
- North America > United States
- California > Santa Clara County
- Palo Alto (0.04)
- New Hampshire (0.04)
- California > Santa Clara County
- Europe > Spain
- Industry:
- Energy (0.46)
- Technology: