The Expressive Capacity of State Space Models: A Formal Language Perspective

May-27-2025, 00:02:33 GMT–Neural Information Processing Systems

Recently, recurrent models based on linear state space models (SSMs) have shown promising performance in language modeling (LM), competititve with transformers. However, there is little understanding of the in-principle abilities of such models, which could provide useful guidance to the search for better LM architectures. We present a comprehensive theoretical study of the capacity of such SSMs as it compares to that of transformers and traditional RNNs. We find that SSMs and transformers have overlapping but distinct strengths. In star-free state tracking, SSMs implement straightforward and exact solutions to problems that transformers struggle to represent exactly.

artificial intelligence, formal language perspective, logic & formal reasoning, (5 more...)

Neural Information Processing Systems

May-27-2025, 00:02:33 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.40)