NER-BERT: A Pre-trained Model for Low-Resource Entity Tagging

Liu, Zihan, Jiang, Feijun, Hu, Yuxiang, Shi, Chen, Fung, Pascale

Dec-1-2021–arXiv.org Artificial Intelligence

Named entity recognition (NER) models generally perform poorly when large training datasets are unavailable for low-resource domains. Recently, pre-training a large-scale language model has become a promising direction for coping with the data scarcity issue. However, the underlying discrepancies between the language modeling and NER task could limit the models' performance, and pre-training for the NER task has rarely been studied since the collected NER datasets are generally small or large but with low quality. In this paper, we construct a massive NER corpus with a relatively high quality, and we pre-train a NER-BERT model based on the created dataset. Experimental results show that our pre-trained model can significantly outperform BERT (Devlin et al., 2019) as well as other strong baselines in low-resource scenarios across nine diverse domains. Moreover, a visualization of entity representations further indicates the effectiveness of NER-BERT for categorizing a variety of entities.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

Dec-1-2021

arXiv.org PDF

Add feedback

Genre:
- Research Report (0.70)

Industry:
- Government (0.68)
- Leisure & Entertainment > Sports (1.00)
- Media (0.68)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.94)
  - Natural Language > Text Processing (1.00)