Self-Exploring Language Models: Active Preference Elicitation for Online Alignment

Open in new window