NLSR: Neuron-Level Safety Realignment of Large Language Models Against Harmful Fine-Tuning

Open in new window