Supported Trust Region Optimization for Offline Reinforcement Learning