Leveraging Open-Source Large Language Models for Native Language Identification
–arXiv.org Artificial Intelligence
Native Language Identification (NLI) - the task of identifying the native language (L1) of a person based on their writing in the second language (L2) - has applications in forensics, marketing, and second language acquisition. Historically, conventional machine learning approaches that heavily rely on extensive feature engineering have outperformed transformer-based language models on this task. Recently, closed-source generative large language models (LLMs), e.g., GPT-4, have demonstrated remarkable performance on NLI in a zero-shot setting, including promising results in open-set classification. However, closed-source LLMs have many disadvantages, such as high costs and undisclosed nature of training data. This study explores the potential of using open-source LLMs for NLI. Our results indicate that open-source LLMs do not reach the accuracy levels of closed-source LLMs when used out-of-the-box. However, when fine-tuned on labeled training data, open-source LLMs can achieve performance comparable to that of commercial LLMs.
arXiv.org Artificial Intelligence
Sep-15-2024
- Country:
- North America
- United States
- New York > New York County
- New York City (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Colorado > Denver County
- Denver (0.04)
- New York > New York County
- Mexico > Mexico City
- Mexico City (0.05)
- Canada > British Columbia
- United States
- Europe
- Italy (0.04)
- United Kingdom
- Scotland > City of Edinburgh
- Edinburgh (0.04)
- England > Cambridgeshire
- Cambridge (0.04)
- Scotland > City of Edinburgh
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Netherlands > North Holland
- Amsterdam (0.05)
- Middle East > Malta
- Eastern Region > Northern Harbour District > St. Julian's (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Belgium > Wallonia
- Walloon Brabant > Louvain-la-Neuve (0.04)
- Asia
- North America
- Genre:
- Research Report > New Finding (0.88)
- Technology: