The Power of Hard Attention Transformers on Data Sequences: A formal language theoretic perspective

Mar-22-2026, 02:08:06 GMT–Neural Information Processing Systems

Formal language theory has recently been successfully employed to unravel the power of transformer encoders. This setting is primarily applicable in Natural Language Processing (NLP), as a token embedding function (where a bounded number of tokens is admitted) is first applied before feeding the input to the transformer.

artificial intelligence, natural language, proceedings, (8 more...)

Neural Information Processing Systems

Mar-22-2026, 02:08:06 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Natural Language (0.96)