Holmes: Benchmark the Linguistic Competence of Language Models

Open in new window