Adversarial Model for Offline Reinforcement Learning