On Gap-dependent Bounds for Offline Reinforcement Learning

Open in new window