UrBLiMP: A Benchmark for Evaluating the Linguistic Competence of Large Language Models in Urdu

Open in new window