Inoculation Prompting: Eliciting traits from LLMs during training can suppress them at test-time

Open in new window