A dynamic programming algorithm for span-based nested named-entity recognition in O(n^2)
–arXiv.org Artificial Intelligence
Our main contributions can be summarized as Named entity recognition (NER) is a fundamental follows: problem in information retrieval that aims to identify We present the semi-Markov and CYK-like mentions of entities and their associated types models for non-nested and nested NER, respectively in natural language documents. As such, the problem -- although we do not claim that can be reduced to the identification and classification these approaches for NER are new, our presentation of segments of texts. In particular, we of the CYK-like algorithm differs focus on mentions that have the following properties: from previous work as it is tailored to the NER problem and guarantees uniqueness of 1. continuous, i.e. a mention corresponds to a derivations; contiguous sequence of words; We introduce a novel search space for nested 2. potentially nested, i.e. one mention can be inside NER that has no significant loss in coverage another, but they can never partially overlap.
arXiv.org Artificial Intelligence
May-26-2023
- Country:
- Asia
- China > Hong Kong (0.04)
- Middle East > Jordan (0.04)
- Singapore (0.04)
- South Korea (0.04)
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Czechia > Prague (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- France (0.04)
- Germany > North Rhine-Westphalia
- Cologne Region > Bonn (0.05)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Italy > Tuscany
- Florence (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Belgium > Brussels-Capital Region
- North America
- Canada > British Columbia
- United States
- California > Los Angeles County
- Los Angeles (0.14)
- Colorado > Boulder County
- Boulder (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Maryland > Baltimore (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- California > Los Angeles County
- Asia
- Genre:
- Research Report (0.82)
- Technology: