Model-based Offline Policy Optimization with Adversarial Network