The Generalization Gap in Offline Reinforcement Learning