Gazetteer Generation for Neural Named Entity Recognition

Song, Chan Hee (University of Notre Dame ) | Lawrie, Dawn (John Hopkins University) | Finin, Tim (University of Maryland Baltimore County) | Mayfield, James (John Hopkins University)

AAAI Conferences 

We present a way to generate gazetteers from the Wikidata knowledge graph and use the lists to improve a neural NER system by adding an input feature indicating that a word is part of a name in the gazetteer. We empirically show that the approach yields performance gains in two distinct languages: a high-resource, word-based language, English and a high-resource, character-based language, Chinese. We apply the approach to a low-resource language, Russian, using a new annotated Russian NER corpus from Reddit tagged with four core and eleven extended types, and show a baseline score.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found