Off-PolicyEvaluationforAction-Dependent Non-StationaryEnvironments
–Neural Information Processing Systems
Methods for sequential decision making are often built upon a foundational assumption that the underlying decision process is stationary [Sutton and Barto, 2018]. While this assumption was a cornerstone when laying the theoretical foundations of the field, and while is often reasonable, it isseldom trueinpractice andcanbeunreasonable [Dulac-Arnold etal.,2019].
Neural Information Processing Systems
Feb-8-2026, 10:56:25 GMT
- Country:
- Oceania > Australia
- North America > United States
- Massachusetts > Middlesex County > Cambridge (0.04)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- Asia > Middle East
- Jordan (0.04)
- Industry:
- Government (0.68)
- Health & Medicine > Public Health (0.46)
- Technology: