Neural Approaches to Entity-Centric Information Extraction
–arXiv.org Artificial Intelligence
Artificial Intelligence (AI) has huge impact on our daily lives with applications such as voice assistants, facial recognition, chatbots, autonomously driving cars, etc. Natural Language Processing (NLP) is a cross-discipline of AI and Linguistics, dedicated to study the understanding of the text. This is a very challenging area due to unstructured nature of the language, with many ambiguous and corner cases. In this thesis we address a very specific area of NLP that involves the understanding of entities (e.g., names of people, organizations, locations) in text. First, we introduce a radically different, entity-centric view of the information in text. We argue that instead of using individual mentions in text to understand their meaning, we should build applications that would work in terms of entity concepts. Next, we present a more detailed model on how the entity-centric approach can be used for the entity linking task. In our work, we show that this task can be improved by considering performing entity linking at the coreference cluster level rather than each of the mentions individually. In our next work, we further study how information from Knowledge Base entities can be integrated into text. Finally, we analyze the evolution of the entities from the evolving temporal perspective.
arXiv.org Artificial Intelligence
Apr-15-2023
- Country:
- South America
- North America > United States
- Illinois (0.04)
- Massachusetts (0.04)
- Maryland (0.04)
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- Ohio > Stark County
- Alliance (0.04)
- New York > New York County
- New York City (0.04)
- California
- Santa Clara County > Palo Alto (0.04)
- San Diego County > San Diego (0.04)
- Europe
- Poland (0.04)
- Netherlands (0.04)
- Belgium > Flanders (0.04)
- Ukraine (0.04)
- Sweden (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- United Kingdom > England
- Greater London > London > Kensington and Chelsea (0.04)
- Greece > Attica
- Athens (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Germany > North Rhine-Westphalia
- Cologne Region > Cologne (0.04)
- Asia
- Turkmenistan (0.04)
- India (0.04)
- Middle East
- Jordan (0.04)
- Republic of Türkiye > Batman Province
- Batman (0.04)
- Africa
- North Africa (0.04)
- Middle East > Libya (0.04)
- Genre:
- Overview (1.00)
- Research Report
- New Finding (1.00)
- Experimental Study (0.92)
- Industry:
- Law (1.00)
- Information Technology (1.00)
- Education (0.92)
- Media
- Leisure & Entertainment > Sports
- Soccer (0.67)
- Health & Medicine
- Pharmaceuticals & Biotechnology (0.92)
- Epidemiology (0.67)
- Therapeutic Area
- Psychiatry/Psychology (1.00)
- Immunology (0.67)
- Government
- Technology:
- Information Technology
- Data Science > Data Mining (1.00)
- Artificial Intelligence
- Representation & Reasoning > Expert Systems (1.00)
- Cognitive Science > Problem Solving (1.00)
- Natural Language
- Text Processing (1.00)
- Information Retrieval (1.00)
- Information Extraction (1.00)
- Grammars & Parsing (1.00)
- Machine Learning
- Statistical Learning (1.00)
- Performance Analysis > Accuracy (1.00)
- Neural Networks > Deep Learning (1.00)
- Information Technology