Pretraining Finnish ModernBERTs
Reunamo, Akseli, Peltonen, Laura-Maria, Moen, Hans, Pyysalo, Sampo
–arXiv.org Artificial Intelligence
This paper reports on pretraining ModernBERT encoder models in six different sizes, ranging from 51M to 475M parameters, with a focus on limited multilingualism, emphasizing languages relevant to Finland. Our models are competitive with, or superior to, existing multilingual models. They outperform monolingual models on tasks that require a context longer than 512 tokens. We present empirical results on using different data in the final stage of training. The code and models are publicly released.
arXiv.org Artificial Intelligence
Nov-13-2025
- Country:
- Asia (1.00)
- Europe > Finland (0.89)
- North America > United States
- Minnesota (0.28)
- Genre:
- Research Report (1.00)
- Technology: