Can Small Language Models Reliably Resist Jailbreak Attacks? A Comprehensive Evaluation

Open in new window