Distance Minimization for Reward Learning from Scored Trajectories