RATE: Score Reward Models with Imperfect Rewrites of Rewrites

Open in new window