Supervised Reward Inference