Curriculum-style Data Augmentation for LLM-based Metaphor Detection
Jia, Kaidi, Wu, Yanxia, Li, Rongsheng
–arXiv.org Artificial Intelligence
Recently, utilizing large language models (LLMs) for metaphor detection has achieved promising results. However, these methods heavily rely on the capabilities of closed-source LLMs, which come with relatively high inference costs and latency. To address this, we propose a method for metaphor detection by fine-tuning open-source LLMs, effectively reducing inference costs and latency with a single inference step. Furthermore, metaphor detection suffers from a severe data scarcity problem, which hinders effective fine-tuning of LLMs. To tackle this, we introduce Curriculum-style Data Augmentation (CDA). Specifically, before fine-tuning, we evaluate the training data to identify correctly predicted instances for fine-tuning, while incorrectly predicted instances are used as seed data for data augmentation. This approach enables the model to quickly learn simpler knowledge and progressively acquire more complex knowledge, thereby improving performance incrementally. Experimental results demonstrate that our method achieves state-of-the-art performance across all baselines. Additionally, we provide detailed ablation studies to validate the effectiveness of CDA.
arXiv.org Artificial Intelligence
Dec-3-2024
- Country:
- Oceania > Australia
- North America
- United States
- Maryland > Baltimore (0.04)
- California (0.04)
- Washington > King County
- Seattle (0.04)
- New York > New York County
- New York City (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Mexico > Mexico City
- Mexico City (0.04)
- Canada > Ontario
- Toronto (0.05)
- United States
- Europe
- Austria > Vienna (0.14)
- Germany > Berlin (0.04)
- United Kingdom > Scotland
- City of Edinburgh > Edinburgh (0.04)
- Spain > Valencian Community
- Valencia Province > Valencia (0.04)
- Italy > Trentino-Alto Adige/Südtirol
- Trentino Province > Trento (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- Africa > Rwanda
- Genre:
- Research Report > New Finding (0.34)
- Industry:
- Education (0.69)
- Technology: