skLEP: A Slovak General Language Understanding Benchmark
Šuppa, Marek, Ridzik, Andrej, Hládek, Daniel, Javůrek, Tomáš, Ondrejová, Viktória, Sásiková, Kristína, Tamajka, Martin, Šimko, Marián
–arXiv.org Artificial Intelligence
In this work, we introduce skLEP, the first comprehensive benchmark specifically designed for evaluating Slovak natural language understanding (NLU) models. We have compiled skLEP to encompass nine diverse tasks that span token-level, sentence-pair, and document-level challenges, thereby offering a thorough assessment of model capabilities. To create this benchmark, we curated new, original datasets tailored for Slovak and meticulously translated established English NLU resources. Within this paper, we also present the first systematic and extensive evaluation of a wide array of Slovak-specific, multilingual, and English pre-trained language models using the skLEP tasks. Finally, we also release the complete benchmark data, an open-source toolkit facilitating both fine-tuning and evaluation of models, and a public leaderboard at https://github.com/slovak-nlp/sklep in the hopes of fostering reproducibility and drive future research in Slovak NLU.
arXiv.org Artificial Intelligence
Jun-27-2025
- Country:
- Asia
- India (0.04)
- Indonesia > Bali (0.04)
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.14)
- Russia (0.04)
- Singapore (0.04)
- Europe
- United Kingdom (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Croatia > Dubrovnik-Neretva County
- Dubrovnik (0.04)
- Ukraine (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Russia (0.04)
- Slovakia
- Banska Bystrica > Banská Bystrica (0.04)
- Bratislava > Bratislava (0.04)
- Košice > Košice (0.04)
- Finland (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Italy > Tuscany
- Florence (0.04)
- North America
- Canada > Ontario
- Toronto (0.04)
- Dominican Republic (0.04)
- Mexico > Mexico City
- Mexico City (0.04)
- United States
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Oregon > Multnomah County
- Portland (0.04)
- Minnesota > Hennepin County
- Canada > Ontario
- Asia
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Government > Regional Government (0.46)
- Media > News (0.46)
- Technology: