Implementing Derivations of Definite Logic Programs with Self-Attention Networks

Oct-15-2024–arXiv.org Artificial Intelligence

In this paper we propose that a restricted version of logical inference can be implemented with self-attention networks. We are aiming at showing that LLMs (Large Language Models) constructed with transformer networks can make logical inferences. We would reveal the potential of LLMs by analyzing self-attention networks, which are main components of transformer networks. Our approach is not based on semantics of natural languages but operations of logical inference. %point of view. We show that hierarchical constructions of self-attention networks with feed forward networks (FFNs) can implement top-down derivations for a class of logical formulae. We also show bottom-up derivations are also implemented for the same class. We believe that our results show that LLMs implicitly have the power of logical inference.

derivation, large language model, logic & formal reasoning, (18 more...)

arXiv.org Artificial Intelligence

Oct-15-2024

arXiv.org PDF

Add feedback

Country:
- Europe > Italy (0.04)
- Oceania > Australia
  - Victoria > Melbourne (0.04)
- North America > United States
  - California > Los Angeles County > Long Beach (0.04)
- Asia
  - Vietnam > Hanoi
    - Hanoi (0.04)
  - Japan > Honshū
    - Kansai > Kyoto Prefecture > Kyoto (0.05)

Genre:
- Research Report > New Finding (0.55)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Logic & Formal Reasoning (1.00)
  - Natural Language > Large Language Model (0.97)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found