Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors

Zhang, Kai, Gutiérrez, Bernal Jiménez, Su, Yu

May-18-2023–arXiv.org Artificial Intelligence

Recent work has shown that fine-tuning large language models (LLMs) on large-scale instruction-following datasets substantially improves their performance on a wide range of NLP tasks, especially in the zero-shot setting. However, even advanced instruction-tuned LLMs still fail to outperform small LMs on relation extraction (RE), a fundamental information extraction task. We hypothesize that instruction-tuning has been unable to elicit strong RE capabilities in LLMs due to RE's low incidence in instruction-tuning datasets, making up less than 1% of all tasks (Wang et al., 2022). To address this limitation, we propose QA4RE, a framework that aligns RE with question answering (QA), a predominant task in instruction-tuning datasets. Comprehensive zero-shot RE experiments over four datasets with two series of instruction-tuned LLMs (six LLMs in total) demonstrate that our QA4RE framework consistently improves LLM performance, strongly verifying our hypothesis and enabling LLMs to outperform strong zero-shot baselines by a large margin. Additionally, we provide thorough experiments and discussions to show the robustness, few-shot effectiveness, and strong transferability of our QA4RE framework. This work illustrates a promising way of adapting LLMs to challenging and underrepresented tasks by aligning these tasks with more common instruction-tuning tasks like QA.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

May-18-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Ohio (0.04)
  - California (0.04)
  - Washington > King County
    - Seattle (0.04)
- Europe
  - Austria (0.04)
  - Sweden > Uppsala County
    - Uppsala (0.04)
  - Ireland > Leinster
    - County Dublin > Dublin (0.04)
  - France > Auvergne-Rhône-Alpes
    - Lyon > Lyon (0.04)
  - Denmark > Capital Region
    - Copenhagen (0.04)
- Asia
  - China > Hong Kong (0.04)
  - Middle East > UAE
    - Abu Dhabi Emirate > Abu Dhabi (0.04)

Genre:
- Research Report (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.95)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found