Provably Efficient Offline Reinforcement Learning in Regular Decision Processes
–Neural Information Processing Systems
Most reinforcement learning (RL) algorithms hinge on the Markovian assumption, i.e. that the underlying system transitions and rewards are Markovian in some natural notion of (observable)
Neural Information Processing Systems
Feb-15-2026, 10:28:55 GMT
- Country:
- Europe
- Denmark > Capital Region
- Copenhagen (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Oxfordshire > Oxford (0.04)
- Denmark > Capital Region
- Europe