TimeLMs: Diachronic Language Models from Twitter
Loureiro, Daniel, Barbieri, Francesco, Neves, Leonardo, Anke, Luis Espinosa, Camacho-Collados, Jose
–arXiv.org Artificial Intelligence
Despite its importance, the time variable has been largely neglected in the NLP and language model literature. In this paper, we present TimeLMs, a set of language models specialized on diachronic Twitter data. We show that a continual learning strategy contributes to enhancing Twitter-based language models' capacity to deal with future and out-of-distribution tweets, while making them competitive with standardized and more monolithic benchmarks. We also perform a number of qualitative analyses showing how they cope with trends and peaks in activity involving specific named entities or concept drift.
arXiv.org Artificial Intelligence
Feb-8-2022
- Country:
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Health & Medicine (0.48)
- Information Technology > Services (0.48)
- Technology: