Training Value-Aligned Reinforcement Learning Agents Using a Normative Prior