Worst-Case Offline Reinforcement Learning with Arbitrary Data Support
–Neural Information Processing Systems
We propose a method of offline reinforcement learning (RL) featuring the performance guarantee without any assumptions on the data support.
Neural Information Processing Systems
Nov-19-2025, 16:41:38 GMT
- Country:
- Asia > Japan
- Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America > United States (0.04)
- Asia > Japan
- Genre:
- Research Report > Experimental Study (0.93)
- Industry:
- Information Technology (0.46)
- Technology: