Spanish Legalese Language Model and Corpora
Gutiérrez-Fandiño, Asier, Armengol-Estapé, Jordi, Gonzalez-Agirre, Aitor, Villegas, Marta
–arXiv.org Artificial Intelligence
There are many Language Models for the English language according to its worldwide relevance. However, for the Spanish language, even if it is a widely spoken language, there are very few Spanish Language Models which result to be small and too general. Legal slang could be think of a Spanish variant on its own as it is very complicated in vocabulary, semantics and phrase understanding. For this work we gathered legal-domain corpora from different sources, generated a model and evaluated against Spanish general domain tasks. The model provides reasonable results in those tasks.
arXiv.org Artificial Intelligence
Oct-23-2021
- Country:
- Europe > France
- Île-de-France > Paris
- Paris (0.05)
- Provence-Alpes-Côte d'Azur > Bouches-du-Rhône
- Marseille (0.05)
- Île-de-France > Paris
- Europe > France
- Genre:
- Research Report (0.50)
- Technology: