Utility-inspired Reward Transformations Improve Reinforcement Learning Training of Language Models

Open in new window