Reward Shaping for User Satisfaction in a REINFORCE Recommender