Value Internalization: Learning and Generalizing from Social Reward