No Offense Taken: Eliciting Offensiveness from Language Models

Open in new window