RRHF: Rank Responses to Align Language Models with Human Feedback

Open in new window