ArNLI: Arabic Natural Language Inference for Entailment and Contradiction Detection
Jallad, Khloud Al, Ghneim, Nada
–arXiv.org Artificial Intelligence
Natural Language Inference (NLI) is a hot topic research in natural language processing, contradiction detection between sentences is a special case of NLI. This is considered a difficult NLP task which has a big influence when added as a component in many NLP applications, such as Question Answering Systems, text Summarization. Arabic Language is one of the most challenging low-resources languages in detecting contradictions due to its rich lexical, semantics ambiguity. We have created a data set of more than 12k sentences and named ArNLI, that will be publicly available. Moreover, we have applied a new model inspired by Stanford contradiction detection proposed solutions on English language. We proposed an approach to detect contradictions between pairs of sentences in Arabic language using contradiction vector combined with language model vector as an input to machine learning model. We analyzed results of different traditional machine learning classifiers and compared their results on our created data set (ArNLI) and on an automatic translation of both PHEME, SICK English data sets. Best results achieved using Random Forest classifier with an accuracy of 99%, 60%, 75% on PHEME, SICK and ArNLI respectively.
arXiv.org Artificial Intelligence
Sep-28-2022
- Country:
- North America
- Dominican Republic (0.04)
- United States
- Illinois (0.04)
- Ohio > Franklin County
- Columbus (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- California > Santa Clara County
- Palo Alto (0.04)
- Europe
- Portugal (0.04)
- Bulgaria (0.04)
- Slovenia (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Italy > Lazio
- Rome (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- Japan (0.05)
- Singapore (0.04)
- Middle East > Syria (0.04)
- North America
- Genre:
- Research Report (0.82)
- Technology: