Goto

Collaborating Authors

 reverse dictionary


Advancing Arabic Reverse Dictionary Systems: A Transformer-Based Approach with Dataset Construction Guidelines

Sibaee, Serry, Ahmed, Samar, Harbi, Abdullah Al, Nacar, Omer, Ammar, Adel, Habashi, Yasser, Boulila, Wadii

arXiv.org Artificial Intelligence

This study addresses the critical gap in Arabic natural language processing by developing an effective Arabic Reverse Dictionary (RD) system that enables users to find words based on their descriptions or meanings. We present a novel transformer-based approach with a semi-encoder neural network architecture featuring geometrically decreasing layers that achieves state-of-the-art results for Arabic RD tasks. Our methodology incorporates a comprehensive dataset construction process and establishes formal quality standards for Arabic lexicographic definitions. Experiments with various pre-trained models demonstrate that Arabic-specific models significantly outperform general multilingual embeddings, with ARBERTv2 achieving the best ranking score (0.0644). Additionally, we provide a formal abstraction of the reverse dictionary task that enhances theoretical understanding and develop a modular, extensible Python library (RDTL) with configurable training pipelines. Our analysis of dataset quality reveals important insights for improving Arabic definition construction, leading to eight specific standards for building high-quality reverse dictionary resources. This work contributes significantly to Arabic computational linguistics and provides valuable tools for language learning, academic writing, and professional communication in Arabic.


Rosetta Stone at KSAA-RD Shared Task: A Hop From Language Modeling To Word--Definition Alignment

ElBakry, Ahmed, Gabr, Mohamed, ElNokrashy, Muhammad, AlKhamissi, Badr

arXiv.org Artificial Intelligence

A Reverse Dictionary is a tool enabling users to discover a word based on its provided definition, meaning, or description. Such a technique proves valuable in various scenarios, aiding language learners who possess a description of a word without its identity, and benefiting writers seeking precise terminology. These scenarios often encapsulate what is referred to as the "Tip-of-the-Tongue" (TOT) phenomena. In this work, we present our winning solution for the Arabic Reverse Dictionary shared task. This task focuses on deriving a vector representation of an Arabic word from its accompanying description. The shared task encompasses two distinct subtasks: the first involves an Arabic definition as input, while the second employs an English definition. For the first subtask, our approach relies on an ensemble of finetuned Arabic BERT-based models, predicting the word embedding for a given definition. The final representation is obtained through averaging the output embeddings from each model within the ensemble. In contrast, the most effective solution for the second subtask involves translating the English test definitions into Arabic and applying them to the finetuned models originally trained for the first subtask. This straightforward method achieves the highest score across both subtasks.


A Unified Model for Reverse Dictionary and Definition Modelling

Chen, Pinzhen, Zhao, Zheng

arXiv.org Artificial Intelligence

We build a dual-way neural dictionary to retrieve words given definitions, and produce definitions for queried words. The model learns the two tasks simultaneously and handles unknown words via embeddings. It casts a word or a definition to the same representation space through a shared layer, then generates the other form in a multi-task fashion. Our method achieves promising automatic scores on previous benchmarks without extra resources. Human annotators prefer the model's outputs in both reference-less and reference-based evaluation, indicating its practicality. Analysis suggests that multiple objectives benefit learning.


Creating Reverse Bilingual Dictionaries

Lam, Khang Nhut, Kalita, Jugal

arXiv.org Artificial Intelligence

Bilingual dictionaries are expensive resources and not many are available when one of the languages is resource-poor. In this paper, we propose algorithms for creation of new reverse bilingual dictionaries from existing bilingual dictionaries in which English is one of the two languages. Our algorithms exploit the similarity between word-concept pairs using the English Wordnet to produce reverse dictionary entries. Since our algorithms rely on available bilingual dictionaries, they are applicable to any bilingual dictionary as long as one of the two languages has Wordnet type lexical ontology.


WordAlchemy: A transformer-based Reverse Dictionary

Mane, Sunil B., Patil, Harshal, Madaswar, Kanhaiya, Sadavarte, Pranav

arXiv.org Artificial Intelligence

Abstract--A reverse dictionary takes a target word's description This difficulty is handled by the'reverse dictionary'. Currently, there does not exist any Reverse Dictionary provider with support for any Indian Language. Dictionaries have many practical usages e.g. This architecture uses the Translation Language Modeling (TLM) technique, rather than the conventional BERT's Masked Sometimes, new language learners can describe a word in a particular language but fail to retrieve the exact word I. Anomia patients, people who are able to recognize and describe an object but are not able to name it due to a neurological disorder, can also be assisted by using a reverse dictionary. To address all these issues more accurately, we propose and develop a novel open-source Reverse Dictionary named "WordAlchemy", mainly based on the proposed transformerbased mT5 [2] model. In this section, we will focus on the previous work in the domain of Reverse Dictionary.