An Interpretability Evaluation Benchmark for Pre-trained Language Models

Open in new window