Eliciting Latent Predictions from Transformers with the Tuned Lens

Open in new window