Group Robust Preference Optimization in Reward-free RLHF
–Neural Information Processing Systems
While these data often come from diverse labelers' groups (e.g., different demographics, ethnicities, company teams, etc.), traditional RLHF approaches
Neural Information Processing Systems
Oct-10-2025, 00:29:06 GMT