Checklists Are Better Than Reward Models For Aligning Language Models

Open in new window