Distantly Supervised Morpho-Syntactic Model for Relation Extraction
Gutehrlé, Nicolas, Atanassova, Iana
–arXiv.org Artificial Intelligence
The task of Information Extraction (IE) involves automatically converting unstructured textual content into structured data. Most research in this field concentrates on extracting all facts or a specific set of relationships from documents. In this paper, we present a method for the extraction and categorisation of an unrestricted set of relationships from text. Our method relies on morpho-syntactic extraction patterns obtained by a distant supervision method, and creates Syntactic and Semantic Indices to extract and classify candidate graphs. We evaluate our approach on six datasets built on Wikidata and Wikipedia. The evaluation shows that our approach can achieve Precision scores of up to 0.85, but with lower Recall and F1 scores. Our approach allows to quickly create rule-based systems for Information Extraction and to build annotated datasets to train machine-learning and deep-learning based classifiers.
arXiv.org Artificial Intelligence
Jan-18-2024
- Country:
- Oceania > Australia
- North America
- United States
- Washington > King County
- Seattle (0.14)
- Texas > Travis County
- Austin (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- New York
- New York County > New York City (0.04)
- Monroe County > Rochester (0.04)
- New Mexico > Santa Fe County
- Santa Fe (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Massachusetts
- Suffolk County > Boston (0.04)
- Middlesex County > Cambridge (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Washington > King County
- Canada
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.14)
- United States
- Europe
- Bulgaria (0.04)
- Czechia > Prague (0.04)
- Italy > Tuscany
- Florence (0.04)
- Spain
- Valencian Community > Valencia Province
- Valencia (0.04)
- Catalonia > Barcelona Province
- Barcelona (0.04)
- Valencian Community > Valencia Province
- Denmark > Capital Region
- Copenhagen (0.04)
- Sweden > Uppsala County
- Uppsala (0.04)
- France > Bourgogne-Franche-Comté
- Portugal > Lisbon
- Lisbon (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- United Kingdom > Scotland
- City of Edinburgh > Edinburgh (0.04)
- Asia
- South Korea (0.04)
- Singapore (0.04)
- Middle East > Qatar
- Genre:
- Research Report (0.64)
- Technology: