DualNER: A Dual-Teaching framework for Zero-shot Cross-lingual Named Entity Recognition

Zeng, Jiali, Jiang, Yufan, Yin, Yongjing, Wang, Xu, Lin, Binghuai, Cao, Yunbo

Dec-10-2022–arXiv.org Artificial Intelligence

We present DualNER, a simple and effective framework to make full use of both annotated source language corpus and unlabeled target language text for zero-shot cross-lingual named entity recognition (NER). In particular, we combine two complementary learning paradigms of NER, i.e., sequence labeling and span prediction, into a unified multi-task framework. After obtaining a sufficient NER model trained on the source data, we further train it on the target data in a {\it dual-teaching} manner, in which the pseudo-labels for one task are constructed from the prediction of the other task. Moreover, based on the span prediction, an entity-aware regularization is proposed to enhance the intrinsic cross-lingual alignment between the same entities in different languages. Experiments and analysis demonstrate the effectiveness of our DualNER. Code is available at https://github.com/lemon0830/dualNER.

computational linguistic, information retrieval, natural language, (16 more...)

arXiv.org Artificial Intelligence

Dec-10-2022

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York > New York County
    - New York City (0.04)
  - Minnesota > Hennepin County
    - Minneapolis (0.14)
  - California > San Diego County
    - San Diego (0.04)
- Europe
  - Germany > Berlin (0.04)
  - Italy > Tuscany
    - Florence (0.04)
  - Denmark > Capital Region
    - Copenhagen (0.04)
- Asia > China
  - Zhejiang Province (0.04)
  - Hong Kong (0.04)
  - Beijing > Beijing (0.04)

Genre:
- Research Report (0.40)

Technology:
- Information Technology > Artificial Intelligence > Natural Language
  - Text Processing (1.00)
  - Information Retrieval (0.87)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found