A Fine-grained Interpretability Evaluation Benchmark for Neural NLP

Open in new window