Efficient and Sharp Off-Policy Evaluation in Robust Markov Decision Processes Andrew Bennett
–Neural Information Processing Systems
MDP, whether they are generated under the same or a different policy. This is an important problem when there is the possibility of a shift between historical and future environments, e.g.
Neural Information Processing Systems
Nov-20-2025, 04:12:19 GMT
- Country:
- Asia > India (0.04)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America > United States (0.46)
- Genre:
- Research Report > Experimental Study (0.67)
- Industry:
- Government > Regional Government (0.45)
- Health & Medicine (0.93)