Utility-inspired Reward Transformations Improve Reinforcement Learning Training of Language Models