This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish

Dec-24-2025, 17:30:22 GMT–Neural Information Processing Systems

The availability of compute and data to train larger and larger language models increases the demand for robust methods of benchmarking the true progress of LM training. Recent years witnessed significant progress in standardized benchmarking for English. Benchmarks such as GLUE, SuperGLUE, or KILT have become a de facto standard tools to compare large language models. Following the trend to replicate GLUE for other languages, the KLEJ benchmark\ (klej is the word for glue in Polish) has been released for Polish. In this paper, we evaluate the progress in benchmarking for low-resourced languages.

benchmark, comprehensive nlp benchmark, lepiszcze, (8 more...)

Neural Information Processing Systems

Dec-24-2025, 17:30:22 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Natural Language (0.81)