Fine-Tuning Language Models from Human Preferences

Open in new window