Learning In-context Learning for Named Entity Recognition
Chen, Jiawei, Lu, Yaojie, Lin, Hongyu, Lou, Jie, Jia, Wei, Dai, Dai, Wu, Hua, Cao, Boxi, Han, Xianpei, Sun, Le
–arXiv.org Artificial Intelligence
Named entity recognition in real-world applications suffers from the diversity of entity types, the emergence of new entity types, and the lack of high-quality annotations. To address the above problems, this paper proposes an in-context learning-based NER approach, which can effectively inject in-context NER ability into PLMs and recognize entities of novel types on-the-fly using only a few demonstrative instances. Specifically, we model PLMs as a meta-function $\mathcal{ \lambda_ {\text{instruction, demonstrations, text}}. M}$, and a new entity extractor can be implicitly constructed by applying new instruction and demonstrations to PLMs, i.e., $\mathcal{ (\lambda . M) }$(instruction, demonstrations) $\to$ $\mathcal{F}$ where $\mathcal{F}$ will be a new entity extractor, i.e., $\mathcal{F}$: text $\to$ entities. To inject the above in-context NER ability into PLMs, we propose a meta-function pre-training algorithm, which pre-trains PLMs by comparing the (instruction, demonstration)-initialized extractor with a surrogate golden extractor. Experimental results on 4 few-shot NER datasets show that our method can effectively inject in-context NER ability into PLMs and significantly outperforms the PLMs+fine-tuning counterparts.
arXiv.org Artificial Intelligence
May-26-2023
- Country:
- North America
- Dominican Republic (0.04)
- United States
- Washington > King County
- Seattle (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- California > San Diego County
- San Diego (0.04)
- Washington > King County
- Canada > Alberta
- Europe
- Ireland > Leinster
- County Dublin > Dublin (0.05)
- Denmark > Capital Region
- Copenhagen (0.04)
- Ireland > Leinster
- Asia > China
- North America
- Genre:
- Research Report (0.82)
- Industry:
- Technology: