Offline Reinforcement Learning with Behavioral Supervisor Tuning