How to Evaluate Reward Models for RLHF

Open in new window