Reframing Offline Reinforcement Learning as a Regression Problem