Percentile Criterion Optimization in Offline Reinforcement Learning
–Neural Information Processing Systems
In reinforcement learning, robust policies for high-stakes decision-making problems with limited data are usually computed by optimizing the percentile criterion .
Neural Information Processing Systems
Feb-8-2026, 15:55:28 GMT
- Country:
- Asia > Singapore
- Central Region > Singapore (0.04)
- Europe > United Kingdom
- England
- Cambridgeshire > Cambridge (0.04)
- Oxfordshire > Oxford (0.04)
- England
- North America > United States
- Massachusetts
- Hampshire County > Amherst (0.04)
- Middlesex County > Cambridge (0.04)
- New Hampshire (0.04)
- Wisconsin > Dane County
- Madison (0.04)
- Massachusetts
- Asia > Singapore
- Genre:
- Research Report > New Finding (0.67)
- Technology: