Goto

Collaborating Authors

 expressive


Kimi Linear: An Expressive, Efficient Attention Architecture

arXiv.org Artificial Intelligence

We introduce Kimi Linear, a hybrid linear attention architecture that, for the first time, outperforms full attention under fair comparisons across various scenarios -- including short-context, long-context, and reinforcement learning (RL) scaling regimes. At its core lies Kimi Delta Attention (KDA), an expressive linear attention module that extends Gated DeltaNet with a finer-grained gating mechanism, enabling more effective use of limited finite-state RNN memory. Our bespoke chunkwise algorithm achieves high hardware efficiency through a specialized variant of the Diagonal-Plus-Low-Rank (DPLR) transition matrices, which substantially reduces computation compared to the general DPLR formulation while remaining more consistent with the classical delta rule. We pretrain a Kimi Linear model with 3B activated parameters and 48B total parameters, based on a layerwise hybrid of KDA and Multi-Head Latent Attention (MLA). Our experiments show that with an identical training recipe, Kimi Linear outperforms full MLA with a sizeable margin across all evaluated tasks, while reducing KV cache usage by up to 75% and achieving up to 6 times decoding throughput for a 1M context. These results demonstrate that Kimi Linear can be a drop-in replacement for full attention architectures with superior performance and efficiency, including tasks with longer input and output lengths. To support further research, we open-source the KDA kernel and vLLM implementations, and release the pre-trained and instruction-tuned model checkpoints.


Google I/O 2025: What to expect over the next two weeks on Android 16, Android XR and Gemini

Engadget

In about two weeks, Google's annual developer conference will kick off on May 20. The event is probably the most important on the company's calendar, offering a glimpse at everything it has been working on over the past year. Judging from rumors and information Google has trickled out, I/O 2025 should be one of the more exciting tech keynotes in recent memory. Plus, for the first time, Google has spun out a dedicated Android showcase planned a whole week earlier. If you want to know what to expect from the company later this month, read on.


How Expressive are Knowledge Graph Foundation Models?

arXiv.org Artificial Intelligence

Knowledge Graph Foundation Models (KGFMs) are at the frontier for deep learning on knowledge graphs (KGs), as they can generalize to completely novel knowledge graphs with different relational vocabularies. Despite their empirical success, our theoretical understanding of KGFMs remains very limited. In this paper, we conduct a rigorous study of the expressive power of KGFMs. Specifically, we show that the expressive power of KGFMs directly depends on the motifs that are used to learn the relation representations. We then observe that the most typical motifs used in the existing literature are binary, as the representations are learned based on how pairs of relations interact, which limits the model's expressiveness. As part of our study, we design more expressive KGFMs using richer motifs, which necessitate learning relation representations based on, e.g., how triples of relations interact with each other. Finally, we empirically validate our theoretical findings, showing that the use of richer motifs results in better performance on a wide range of datasets drawn from different domains.


Capturing Knowledge Graphs and Rules with Octagon Embeddings

arXiv.org Artificial Intelligence

Region based knowledge graph embeddings represent relations as geometric regions. This has the advantage that the rules which are captured by the model are made explicit, making it straightforward to incorporate prior knowledge and to inspect learned models. Unfortunately, existing approaches are severely restricted in their ability to model relational composition, and hence also their ability to model rules, thus failing to deliver on the main promise of region based models. With the aim of addressing these limitations, we investigate regions which are composed of axis-aligned octagons. Such octagons are particularly easy to work with, as intersections and compositions can be straightforwardly computed, while they are still sufficiently expressive to model arbitrary knowledge graphs. Among others, we also show that our octagon embeddings can properly capture a non-trivial class of rule bases. Finally, we show that our model achieves competitive experimental results.


A dynamic state-based model of crowds

arXiv.org Artificial Intelligence

As a discipline, crowd science has acknowledged the need to understand the nature of human collective phenomena before trying to explain them, and a number of attempts have been made to specify and classify different crowd types and behaviours. However, these typologies are often partial, over-fitted to a specific crowd type, or use arbitrary and/or subjective labels for behaviours of complex origin (for example, "panic"). Moreover, they tend to be relatively inflexible, and do not reflect the fluid nature of crowd behaviour (and how this might influence the crowd's structure and impact over time). For example, a static typology might not capture a situation in which a peaceful demonstration can quickly turn into a riot, or how a physical crowd moving around a shopping mall can suddenly become united into a psychological crowd in response to a shared grievance or an external threat. In this paper, we present an alternative to the typology approach; a dynamic, state-based model of crowds, structured around an existing assembly-action-dispersal framework. Our model draws on the statechart formalism from computer science. This approach is relatively objective, can capture the dynamic evolution of a crowd over time, and (unlike existing typologies, which are relatively static) allows for the natural description of how sub-groups emerge within a crowd. This new model may be useful for describing the evolution of incidents such as riots or emergencies, but it is equally well-suited to the study of expected, "normal" crowds.


ExpressivE: A Spatio-Functional Embedding For Knowledge Graph Completion

arXiv.org Artificial Intelligence

Knowledge graphs are inherently incomplete. Therefore substantial research has been directed toward knowledge graph completion (KGC), i.e., predicting missing triples from the information represented in the knowledge graph (KG). KG embedding models (KGEs) have yielded promising results for KGC, yet any current KGE is incapable of: (1) fully capturing vital inference patterns (e.g., composition), (2) capturing prominent patterns jointly (e.g., hierarchy and composition), and (3) providing an intuitive interpretation of captured patterns. In this work, we propose ExpressivE, a fully expressive spatio-functional KGE that solves all these challenges simultaneously. ExpressivE embeds pairs of entities as points and relations as hyper-parallelograms in the virtual triple space $\mathbb{R}^{2d}$. This model design allows ExpressivE not only to capture a rich set of inference patterns jointly but additionally to display any supported inference pattern through the spatial relation of hyper-parallelograms, offering an intuitive and consistent geometric interpretation of ExpressivE embeddings and their captured patterns. Experimental results on standard KGC benchmarks reveal that ExpressivE is competitive with state-of-the-art KGEs and even significantly outperforms them on WN18RR.


How Expressive are Transformers in Spectral Domain for Graphs?

arXiv.org Artificial Intelligence

The recent works proposing transformer-based models for graphs have proven the inadequacy of Vanilla Transformer for graph representation learning. To understand this inadequacy, there is a need to investigate if spectral analysis of the transformer will reveal insights into its expressive power. Similar studies already established that spectral analysis of Graph neural networks (GNNs) provides extra perspectives on their expressiveness. In this work, we systematically study and establish the link between the spatial and spectral domain in the realm of the transformer. We further provide a theoretical analysis and prove that the spatial attention mechanism in the transformer cannot effectively capture the desired frequency response, thus, inherently limiting its expressiveness in spectral space. Therefore, we propose FeTA, a framework that aims to perform attention over the entire graph spectrum (i.e., actual frequency components of the graphs) analogous to the attention in spatial space. Empirical results suggest that FeTA provides homogeneous performance gain against vanilla transformer across all tasks on standard benchmarks and can easily be extended to GNN-based models with low-pass characteristics (e.g., GAT).


How AI understands emotion

#artificialintelligence

Lia's creator Soul Machines is developing digital humans, complete with digital brains, who are portrayed by actual humans. Verizon's Labs showcases innovators like Soul Machines to explore how 5G networks support cutting edge technology that contributes to the betterment of society. Having the speed and bandwidth of a 5G connection is critical to ensuring that digital interactions feel humanized. In human-to-human engagement, the brain rapidly identifies and processes data points such as tone and non-verbal cues. In digital-to-human engagement, mimicking human-like interactions requires 5G's bandwidth and speed.