Fine-Grained Named Entities for Corona News

Apr-20-2024–arXiv.org Artificial Intelligence

Information resources such as newspapers have produced unstructured text data in various languages related to the corona outbreak since December 2019. Analyzing these unstructured texts is time-consuming without representing them in a structured format; therefore, representing them in a structured format is crucial. An information extraction pipeline with essential tasks -- named entity tagging and relation extraction -- to accomplish this goal might be applied to these texts. This study proposes a data annotation pipeline to generate training data from corona news articles, including generic and domain-specific entities. Named entity recognition models are trained on this annotated corpus and then evaluated on test sentences manually annotated by domain experts evaluating the performance of a trained model. The code base and demonstration are available at https://github.com/sefeoglu/coronanews-ner.git.

corona news article, ner model, news article, (14 more...)

arXiv.org Artificial Intelligence

Apr-20-2024

arXiv.org PDF

Add feedback

Country:
- Europe > Germany
  - Berlin (0.05)
- Asia > China
  - Hubei Province > Wuhan (0.04)

Genre:
- Research Report (0.66)

Industry:
- Health & Medicine > Therapeutic Area
  - Infections and Infectious Diseases (1.00)
  - Immunology (1.00)

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found