Local LLM Ensembles for Zero-shot Portuguese Named Entity Recognition
Sarcinelli, João Lucas Luz Lima, Silva, Diego Furtado
–arXiv.org Artificial Intelligence
Large Language Models (LLMs) excel in many Natural Language Processing (NLP) tasks through in-context learning but often under-perform in Named Entity Recognition (NER), especially for lower-resource languages like Portuguese. While open-weight LLMs enable local deployment, no single model dominates all tasks, motivating ensemble approaches. However, existing LLM ensembles focus on text generation or classification, leaving NER under-explored. In this context, this work proposes a novel three-step ensemble pipeline for zero-shot NER using similarly capable, locally run LLMs. Our method outperforms individual LLMs in four out of five Portuguese NER datasets by leveraging a heuristic to select optimal model combinations with minimal annotated data. Moreover, we show that ensembles obtained on different source datasets generally outperform individual LLMs in cross-dataset configurations, potentially eliminating the need for annotated data for the current task.
arXiv.org Artificial Intelligence
Dec-12-2025
- Country:
- Europe
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Italy > Liguria
- Genoa (0.04)
- Switzerland (0.04)
- France > Provence-Alpes-Côte d'Azur
- North America > United States
- District of Columbia > Washington (0.04)
- New Mexico > Bernalillo County
- Albuquerque (0.04)
- South America
- Brazil
- Minas Gerais (0.04)
- São Paulo (0.04)
- Colombia > Meta Department
- Villavicencio (0.04)
- Brazil
- Europe
- Genre:
- Research Report (0.65)
- Workflow (0.47)
- Technology: