Measuring the Effect of Disfluency in Multilingual Knowledge Probing Benchmarks

Open in new window