Safer-Instruct: Aligning Language Models with Automated Preference Data

Open in new window