HelpSteer2: Open-source dataset for training top-performing reward models

Open in new window