Unforgotten Safety: Preserving Safety Alignment of Large Language Models with Continual Learning

Open in new window