Rethinking the Evaluation of Alignment Methods: Insights into Diversity, Generalisation, and Safety

Open in new window