Transformer-based Language Models for Reasoning in the Description Logic ALCQ
Poulis, Angelos, Tsalapati, Eleni, Koubarakis, Manolis
–arXiv.org Artificial Intelligence
Recent advancements in transformer-based language models have sparked research into their logical reasoning capabilities. Most of the benchmarks used to evaluate these models are simple: generated from short (fragments of) first-order logic sentences with only a few logical operators and quantifiers. We construct the natural language dataset, DELTA$_D$, using the expressive description logic language $\mathcal{ALCQ}$. DELTA$_D$ comprises 384K examples and increases in two dimensions: i) reasoning depth, and ii) linguistic complexity. In this way, we systematically investigate the logical reasoning capabilities of a supervised fine-tuned DeBERTa-based model and two large language models (GPT-3.5, GPT-4) with few-shot prompting. We show that the DeBERTa-based model fine-tuned on our dataset can master the entailment checking task. Moreover, the performance of GPTs can improve significantly even when a small number of samples is provided (9 shots). We open-source our code and datasets.
arXiv.org Artificial Intelligence
Oct-12-2024
- Country:
- North America
- Dominican Republic (0.04)
- United States > Georgia
- Clarke County > Athens (0.04)
- Puerto Rico > San Juan
- San Juan (0.04)
- Canada > Ontario
- Toronto (0.04)
- Europe
- Greece (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Germany > Baden-Württemberg
- Karlsruhe Region > Karlsruhe (0.04)
- Asia
- Singapore (0.04)
- Indonesia > Bali (0.04)
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- Africa > Rwanda
- North America
- Genre:
- Research Report > New Finding (0.46)
- Technology: