NERsocial: Efficient Named Entity Recognition Dataset Construction for Human-Robot Interaction Utilizing RapidNER
Atuhurra, Jesse, Kamigaito, Hidetaka, Ouchi, Hiroki, Shindo, Hiroyuki, Watanabe, Taro
–arXiv.org Artificial Intelligence
Adapting named entity recognition (NER) methods to new domains poses significant challenges. We introduce RapidNER, a framework designed for the rapid deployment of NER systems through efficient dataset construction. RapidNER operates through three key steps: (1) extracting domain-specific sub-graphs and triples from a general knowledge graph, (2) collecting and leveraging texts from various sources to build the NERsocial dataset, which focuses on entities typical in human-robot interaction, and (3) implementing an annotation scheme using Elasticsearch (ES) to enhance efficiency. NERsocial, validated by human annotators, includes six entity types, 153K tokens, and 99.4K sentences, demonstrating RapidNER's capability to expedite dataset creation.
arXiv.org Artificial Intelligence
Nov-27-2024
- Country:
- Asia > Indonesia
- Sumatra (0.14)
- Europe > Russia
- Central Federal District > Moscow Oblast (0.14)
- North America > United States
- Minnesota (0.14)
- Asia > Indonesia
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Media
- Retail (0.67)
- Government > Military (0.67)
- Transportation
- Banking & Finance (1.00)
- Health & Medicine
- Consumer Health (1.00)
- Therapeutic Area (1.00)
- Information Technology (1.00)
- Leisure & Entertainment
- Games (1.00)
- Sports
- Martial Arts (1.00)
- Motorsports (1.00)
- Consumer Products & Services
- Food, Beverage, Tobacco & Cannabis > Beverages (1.00)
- Restaurants (0.67)
- Law Enforcement & Public Safety (0.67)
- Technology: