Towards Improved Safety Alignment of LLM via a Human-Preference Dataset Jiaming Ji

Open in new window