3D-EX : A Unified Dataset of Definitions and Dictionary Examples
Almeman, Fatemah, Sheikhi, Hadi, Espinosa-Anke, Luis
–arXiv.org Artificial Intelligence
Definitions are a fundamental building block in lexicography, linguistics and computational semantics. In NLP, they have been used for retrofitting word embeddings or augmenting contextual representations in language models. However, lexical resources containing definitions exhibit a wide range of properties, which has implications in the behaviour of models trained and evaluated on them. In this paper, we introduce 3D- EX , a dataset that aims to fill this gap by combining well-known English resources into one centralized knowledge repository in the form of triples. 3D- EX is a unified evaluation framework with carefully pre-computed train/validation/test splits to prevent memorization. We report experimental results that suggest that this dataset could be effectively leveraged in downstream NLP tasks. Code and data are available at https://github.com/F-Almeman/3D-EX .
arXiv.org Artificial Intelligence
Aug-11-2023
- Country:
- Asia
- China (0.04)
- Japan > Honshū
- Kansai > Osaka Prefecture > Osaka (0.04)
- Middle East > Iran (0.04)
- South Korea > Busan
- Busan (0.04)
- Taiwan > Taiwan Province
- Taipei (0.04)
- Europe
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Italy (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Sweden
- Uppsala County > Uppsala (0.04)
- Vaestra Goetaland > Gothenburg (0.04)
- United Kingdom > England
- Greater London > London > City of Westminster (0.04)
- France > Provence-Alpes-Côte d'Azur
- North America
- Dominican Republic (0.04)
- United States
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Washington > King County
- Seattle (0.04)
- Minnesota > Hennepin County
- Oceania > Australia
- Asia
- Genre:
- Research Report (1.00)
- Technology: