DualNER: A Dual-Teaching framework for Zero-shot Cross-lingual Named Entity Recognition
Zeng, Jiali, Jiang, Yufan, Yin, Yongjing, Wang, Xu, Lin, Binghuai, Cao, Yunbo
–arXiv.org Artificial Intelligence
We present DualNER, a simple and effective framework to make full use of both annotated source language corpus and unlabeled target language text for zero-shot cross-lingual named entity recognition (NER). In particular, we combine two complementary learning paradigms of NER, i.e., sequence labeling and span prediction, into a unified multi-task framework. After obtaining a sufficient NER model trained on the source data, we further train it on the target data in a {\it dual-teaching} manner, in which the pseudo-labels for one task are constructed from the prediction of the other task. Moreover, based on the span prediction, an entity-aware regularization is proposed to enhance the intrinsic cross-lingual alignment between the same entities in different languages. Experiments and analysis demonstrate the effectiveness of our DualNER. Code is available at https://github.com/lemon0830/dualNER.
arXiv.org Artificial Intelligence
Dec-10-2022
- Country:
- North America > United States
- New York > New York County
- New York City (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- California > San Diego County
- San Diego (0.04)
- New York > New York County
- Europe
- Germany > Berlin (0.04)
- Italy > Tuscany
- Florence (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Asia > China
- Zhejiang Province (0.04)
- Hong Kong (0.04)
- Beijing > Beijing (0.04)
- North America > United States
- Genre:
- Research Report (0.40)
- Technology: