A Logic for Expressing Log-Precision Transformers

Feb-11-2025, 07:18:54 GMT–Neural Information Processing Systems

One way to interpret the reasoning power of transformer-based language models is to describe the types of logical rules they can resolve over some input text. Recently, Chiang et al. (2023) showed that finite-precision transformer classifiers can be equivalently expressed in a generalization of first-order logic. However, finite-precision transformers are a weak transformer variant because, as we show, a single head can only attend to a constant number of tokens and, in particular, cannot represent uniform attention. Since attending broadly is a core capability for transformers, we ask whether a minimally more expressive model that can attend universally can also be characterized in logic. To this end, we analyze transformers whose forward pass is computed in log n precision on contexts of length n. We prove any log-precision transformer classifier can be equivalently expressed as a first-order logic sentence that, in addition to standard universal and existential quantifiers, may also contain majority-vote quantifiers. This is the tightest known upper bound and first logical characterization of log-precision transformers. Any log-precision transformer can be re-expressed as a sentence in FO(M) logic, e.g.:

artificial intelligence, machine learning, transformer, (15 more...)

Neural Information Processing Systems

Feb-11-2025, 07:18:54 GMT

Conferences PDF

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.66)
  - Representation & Reasoning (1.00)

Duplicate Docs Excel Report

Title
a48e5877c7bf86a513950ab23b360498-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found