Fundamental Safety-Capability Trade-offs in Fine-tuning Large Language Models

Open in new window