A Systematic Analysis of Base Model Choice for Reward Modeling

Open in new window