HiligayNER: A Baseline Named Entity Recognition Model for Hiligaynon
Teves, James Ald, Cal, Ray Daniel, Villaluz, Josh Magdiel, Malolos, Jean, Magtira, Mico, Rodriguez, Ramon, Abisado, Mideth, Imperial, Joseph Marvin
–arXiv.org Artificial Intelligence
The language of Hiligaynon, spoken predominantly by the people of Panay Island, Negros Occidental, and Soccsksargen in the Philippines, remains underrepresented in language processing research due to the absence of annotated corpora and baseline models. This study introduces HiligayNER, the first publicly available baseline model for the task of Named Entity Recognition (NER) in Hiligaynon. The dataset used to build HiligayNER contains over 8,000 annotated sentences collected from publicly available news articles, social media posts, and literary texts. Two Transformer-based models, mBERT and XLM-RoBERTa, were fine-tuned on this collected corpus to build versions of HiligayNER. Evaluation results show strong performance, with both models achieving over 80% in precision, recall, and F1-score across entity types. Furthermore, cross-lingual evaluation with Cebuano and Tagalog demonstrates promising transferability, suggesting the broader applicability of HiligayNER for multilingual NLP in low-resource settings. This work aims to contribute to language technology development for underrepresented Philippine languages, specifically for Hiligaynon, and support future research in regional language processing.
arXiv.org Artificial Intelligence
Oct-14-2025
- Country:
- Asia
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
- India > Maharashtra
- Pune (0.04)
- Indonesia > Bali (0.04)
- Middle East > Israel
- Haifa District > Haifa (0.04)
- Philippines
- Mindanao > Soccsksargen (0.24)
- Visayas
- Negros Island Region > Province of Negros Occidental (0.24)
- Western Visayas (0.05)
- Southeast Asia (0.04)
- Thailand > Bangkok
- Bangkok (0.04)
- Singapore (0.04)
- Taiwan > Taiwan Province
- Taipei (0.04)
- China > Hong Kong (0.04)
- Japan > Honshū
- Europe > Austria
- Vienna (0.14)
- North America
- Canada > Ontario
- Toronto (0.04)
- United States
- Florida > Miami-Dade County
- Miami (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- New Mexico > Santa Fe County
- Santa Fe (0.04)
- Texas > Dallas County
- Dallas (0.04)
- Florida > Miami-Dade County
- Canada > Ontario
- Asia
- Genre:
- Research Report > New Finding (0.34)
- Technology: