Eliciting Latent Knowledge from Quirky Language Models

Open in new window