AntiDote: Bi-level Adversarial Training for Tamper-Resistant LLMs

Open in new window