Scaling Laws for Reward Model Overoptimization

Open in new window