cner
CNER: A tool Classifier of Named-Entity Relationships
Torres, Jefferson A. Peña, De Piñerez, Raúl E. Gutiérrez
However, Spanish is occasionally adopted as the focus language for research endeavors and as result multiple projects are conducted in Spanish to explore language-specific nuances and challenges in NLP applications. Named-Entity recognition [1], Machine Translation [2], Semantic Relation Extraction [3] among others tasks have been conducted with a focus on Spanish language data, allowing for a more nuanced understanding of the intricacies involved. In this paper we present Classifier for Named Entities Recognized (CNER) a linguistically-aware online service that offers the possibility to test two main tasks of NLP, Named Entity Recognition (NER) and Relation Extraction (RE) for Spanish language. This together with other projects on Spanish language have been evaluated and adapted as a web service. In this context, language technologies and natural language processing (NLP) tools can support the identification of useful information in text and to promote its understanding. Specifically, CNER i) identifies the mentions follow the ACE standard with entity types include Person (PER), Organisation (ORG), Facility (FAC), Location (LOC), Geographical/Political (GPE), Vehicle (VEH), Vehicle (VEH) and Weapon (WEA) [4], [5]; ii) displays three different NER tools as previous step to RE task and iii) offers entity relationship information through tags GPE-AFF, PHYS, DISC, EMP-ORG, ART, NON-REL representing the relations between two entities [6] .
- South America > Colombia > Valle del Cauca Department > Cali (0.05)
- North America > United States > California > Santa Clara County > Palo Alto (0.05)
- Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.05)
- Europe > Portugal > Lisbon > Lisbon (0.05)
Neural Chinese Named Entity Recognition via CNN-LSTM-CRF and Joint Training with Word Segmentation
Wu, Fangzhao, Liu, Junxin, Wu, Chuhan, Huang, Yongfeng, Xie, Xing
Chinese named entity recognition (CNER) is an important task in Chinese natural language processing field. However, CNER is very challenging since Chinese entity names are highly context-dependent. In addition, Chinese texts lack delimiters to separate words, making it difficult to identify the boundary of entities. Besides, the training data for CNER in many domains is usually insufficient, and annotating enough training data for CNER is very expensive and time-consuming. In this paper, we propose a neural approach for CNER. First, we introduce a CNN-LSTM-CRF neural architecture to capture both local and long-distance contexts for CNER. Second, we propose a unified framework to jointly train CNER and word segmentation models in order to enhance the ability of CNER model in identifying entity boundaries. Third, we introduce an automatic method to generate pseudo labeled samples from existing labeled data which can enrich the training data. Experiments on two benchmark datasets show that our approach can effectively improve the performance of Chinese named entity recognition, especially when training data is insufficient.
- North America > United States > California > San Francisco County > San Francisco (0.14)
- Asia > China > Beijing > Beijing (0.05)
- North America > United States > New York > New York County > New York City (0.04)