HelpSteer3-Preference: Open Human-Annotated Preference Data across Diverse Tasks and Languages
–Neural Information Processing Systems
Preference datasets are essential for training general-domain, instruction-following language models with Reinforcement Learning from Human Feedback (RLHF). Each subsequent data release raises expectations for future data collection, meaning there is a constant need to advance the quality and diversity of openly available preference data. To address this need, we introduce HelpSteer3-Preference, a permissively licensed (CC-BY-4.0),
Neural Information Processing Systems
Jun-16-2026, 16:27:04 GMT
- Country:
- Europe (0.67)
- North America > United States (0.67)
- Genre:
- Research Report > Experimental Study (1.00)
- Industry:
- Media > Music (1.00)
- Leisure & Entertainment > Sports (0.67)
- Education > Educational Setting
- K-12 Education (0.45)
- Technology: