On Gap-dependent Boundsfor Offline Reinforcement Learning

Open in new window