Defending MoE LLMs against Harmful Fine-Tuning via Safety Routing Alignment

Open in new window