Historical Ink: Exploring Large Language Models for Irony Detection in 19th-Century Spanish

Cohen, Kevin, Manrique-Gómez, Laura, Manrique, Rubén

Mar-28-2025–arXiv.org Artificial Intelligence

This study explores the use of large language models (LLMs) to enhance datasets and improve irony detection in 19th-century Latin American newspapers. Two strategies were employed to evaluate the efficacy of BERT and GPT-4o models in capturing the subtle nuances nature of irony, through both multi-class and binary classification tasks. First, we implemented dataset enhancements focused on enriching emotional and contextual cues; however, these showed limited impact on historical language analysis. The second strategy, a semi-automated annotation process, effectively addressed class imbalance and augmented the dataset with high-quality annotations. Despite the challenges posed by the complexity of irony, this work contributes to the advancement of sentiment analysis through two key contributions: introducing a new historical Spanish dataset tagged for sentiment analysis and irony detection, and proposing a semi-automated annotation methodology where human expertise is crucial for refining LLMs results, enriched by incorporating historical and cultural contexts as core features.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

Mar-28-2025

arXiv.org PDF

Add feedback

Country:
- South America > Colombia
  - Bogotá D.C. > Bogotá (0.04)
- North America
  - Mexico (0.04)
  - Central America (0.04)
  - United States
    - Florida > Miami-Dade County
      - Miami (0.04)
    - California > San Diego County
      - San Diego (0.04)
- Asia
  - Singapore (0.04)
  - Thailand > Bangkok
    - Bangkok (0.04)

Genre:
- Research Report (1.00)

Industry:
- Media > News (0.48)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found