Do Language Models Care About Text Quality? Evaluating Web-Crawled Corpora Across 11 Languages

Open in new window