The Power of Hard Attention Transformers on Data Sequences: A Formal Language Theoretic Perspective Chris Köcher RPTU Kaiserslautern-Landau

Jun-1-2025, 01:37:54 GMT–Neural Information Processing Systems

Formal language theory has recently been successfully employed to unravel the power of transformer encoders. This setting is primarily applicable in Natural Language Processing (NLP), as a token embedding function (where a bounded number of tokens is admitted) is first applied before feeding the input to the transformer.

logic & formal reasoning, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Jun-1-2025, 01:37:54 GMT

Conferences PDF

Add feedback

Country:
- Europe > Germany
  - Rhineland-Palatinate > Kaiserslautern (0.40)
- North America > United States
  - Minnesota > Hennepin County
    - Minneapolis (0.14)
  - New York (0.28)

Genre:
- Research Report > Experimental Study (0.93)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Natural Language (1.00)
  - Representation & Reasoning > Logic & Formal Reasoning (1.00)