Towards Reward Fairness in RLHF: From a Resource Allocation Perspective

Open in new window