Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey

Open in new window