Unifying Token and Span Level Supervisions for Few-Shot Sequence Labeling
Cheng, Zifeng, Zhou, Qingyu, Jiang, Zhiwei, Zhao, Xuemin, Cao, Yunbo, Gu, Qing
–arXiv.org Artificial Intelligence
Few-shot sequence labeling aims to identify novel classes based on only a few labeled samples. Existing methods solve the data scarcity problem mainly by designing token-level or span-level labeling models based on metric learning. However, these methods are only trained at a single granularity (i.e., either token level or span level) and have some weaknesses of the corresponding granularity. In this paper, we first unify token and span level supervisions and propose a Consistent Dual Adaptive Prototypical (CDAP) network for few-shot sequence labeling. CDAP contains the token-level and span-level networks, jointly trained at different granularities. To align the outputs of two networks, we further propose a consistent loss to enable them to learn from each other. During the inference phase, we propose a consistent greedy inference algorithm that first adjusts the predicted probability and then greedily selects non-overlapping spans with maximum probability. Extensive experiments show that our model achieves new state-of-the-art results on three benchmark datasets.
arXiv.org Artificial Intelligence
Jul-19-2023
- Country:
- North America
- United States > Utah
- Salt Lake County > Salt Lake City (0.04)
- Canada > Quebec
- Montreal (0.04)
- United States > Utah
- Europe
- Monaco > Monaco (0.04)
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- Slovenia > Central Slovenia
- Municipality of Ljubljana > Ljubljana (0.04)
- Asia
- Taiwan > Taiwan Province
- Taipei (0.04)
- China
- Jiangsu Province > Nanjing (0.05)
- Beijing > Beijing (0.04)
- Sichuan Province > Chengdu (0.04)
- Taiwan > Taiwan Province
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- North America
- Genre:
- Overview (0.67)
- Research Report (0.50)
- Industry:
- Education (0.68)
- Technology: