Defining and Characterizing Reward Hacking

Open in new window