AITopics | Distributed Systems

Collaborating Authors

Distributed Systems

News Overviews Instructional Materials AI-Alerts Classics

Graph Coarsening with Message-Passing Guarantees

Neural Information Processing SystemsApr-30-2026, 10:37:42 GMT

Graph coarsening aims to reduce the size of a large graph while preserving some of its key properties, which has been used in many applications to reduce computational load and memory footprint. For instance, in graph machine learning, training Graph Neural Networks (GNNs) on coarsened graphs leads to drastic savings in time and memory. However, GNNs rely on the Message-Passing (MP) paradigm, and classical spectral preservation guarantees for graph coarsening do not directly lead to theoretical guarantees when performing naive message-passing on the coarsened graph. In this work, we propose a new message-passing operation specific to coarsened graphs, which exhibit theoretical guarantees on the preservation of the propagated signal. Interestingly, and in a sharp departure from previous proposals, this operation on coarsened graphs is often oriented, even when the original graph is undirected. We conduct node classification tasks on synthetic and real data and observe improved results compared to performing naive message-passing on the coarsened graph.

artificial intelligence, graph, machine learning, (18 more...)

Neural Information Processing Systems

Genre: Research Report > Experimental Study (0.93)

Industry: Information Technology (0.46)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Architecture > Distributed Systems (1.00)

Add feedback

Rooted subtree of ? ? with 2-layer 1-hop Input Graph message passing ? ? 2 5 4 ? ? 6 2 6

Neural Information Processing SystemsApr-25-2026, 00:29:38 GMT

artificial intelligence, graph, node, (13 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Architecture > Distributed Systems (1.00)

Add feedback

How Powerful are K-hop Message Passing Graph Neural Networks

Neural Information Processing SystemsApr-25-2026, 00:29:34 GMT

The most popular design paradigm for Graph Neural Networks (GNNs) is 1-hop message passing--aggregating information from 1-hop neighbors repeatedly. However, the expressive power of 1-hop message passing is bounded by the WeisfeilerLehman (1-WL) test. Recently, researchers extended 1-hop message passing to K-hop message passing by aggregating information from K-hop neighbors of nodes simultaneously. However, there is no work on analyzing the expressive power of K-hop message passing. In this work, we theoretically characterize the expressive power of K-hop message passing.

artificial intelligence, expressive power, machine learning, (16 more...)

Neural Information Processing Systems

Genre: Research Report (0.46)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Architecture > Distributed Systems (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.79)

Add feedback

LibAMM: Empirical Insights into Approximate Computing for Accelerating Matrix Multiplication

Neural Information Processing SystemsMar-21-2026, 02:03:24 GMT

Matrix multiplication (MM) is pivotal in fields from deep learning to scientific computing, driving the quest for improved computational efficiency. Accelerating MM encompasses strategies like complexity reduction, parallel and distributed computing, hardware acceleration, and approximate computing techniques, namely AMM algorithms. Amidst growing concerns over the resource demands of large language models (LLMs), AMM has garnered renewed focus. However, understanding the nuances that govern AMM's effectiveness remains incomplete. This study delves into AMM by examining algorithmic strategies, operational specifics, dataset characteristics, and their application in real-world tasks.

artificial intelligence, large language model, natural language, (9 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.59)
Information Technology > Architecture > Distributed Systems (0.59)

Add feedback

A Appendix 399 A.1 Message Passing in SyncTREE

Neural Information Processing SystemsFeb-11-2026, 02:21:45 GMT

It should be noted that we only made a little modification to the GraphTrans model. For NTREE, we set GA T as its basic block with a 0.2 dropout probability between layers.

artificial intelligence, dataset, machine learning, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Architecture > Distributed Systems (0.45)
Information Technology > Artificial Intelligence > Machine Learning (0.31)

Add feedback

Orthogonal Approximate Message Passing Algorithms for Rectangular Spiked Matrix Models with Rotationally Invariant Noise

Chen, Haohua, Liu, Songbin, Ma, Junjie

arXiv.org Machine LearningFeb-4-2026

We propose an orthogonal approximate message passing (OAMP) algorithm for signal estimation in the rectangular spiked matrix model with general rotationally invariant (RI) noise. We establish a rigorous state evolution that exactly characterizes the high-dimensional dynamics of the algorithm. Building on this framework, we derive an optimal variant of OAMP that minimizes the predicted mean-squared error at each iteration. For the special case of i.i.d. Gaussian noise, the fixed point of the proposed OAMP algorithm coincides with that of the standard AMP algorithm. For general RI noise models, we conjecture that the optimal OAMP algorithm is statistically optimal within a broad class of iterative methods, and achieves Bayes-optimal performance in certain regimes.

algorithm, artificial intelligence, oamp algorithm, (11 more...)

arXiv.org Machine Learning

2602.03283

Country:

North America > United States (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Architecture > Distributed Systems (0.62)
Information Technology > Artificial Intelligence (0.47)

Add feedback

Learning Multi-Order Block Structure in Higher-Order Networks

Nakajima, Kazuki, Sasaki, Yuya, Uno, Takeaki, Aida, Masaki

arXiv.org Artificial IntelligenceNov-27-2025

Higher-order networks, naturally described as hypergraphs, are essential for modeling real-world systems involving interactions among three or more entities. Stochastic block models offer a principled framework for characterizing mesoscale organization, yet their extension to hypergraphs involves a trade-off between expressive power and computational complexity. A recent simplification, a single-order model, mitigates this complexity by assuming a single affinity pattern governs interactions of all orders. This universal assumption, however, may overlook order-dependent structural details. Here, we propose a framework that relaxes this assumption by introducing a multi-order block structure, in which different affinity patterns govern distinct subsets of interaction orders. Our framework is based on a multi-order stochastic block model and searches for the optimal partition of the set of interaction orders that maximizes out-of-sample hyperlink prediction performance. Analyzing a diverse range of real-world networks, we find that multi-order block structures are prevalent. Accounting for them not only yields better predictive performance over the single-order model but also uncovers sharper, more interpretable mesoscale organization. Our findings reveal that order-dependent mechanisms are a key feature of the mesoscale organization of real-world higher-order networks.

data mining, information retrieval, machine learning, (21 more...)

arXiv.org Artificial Intelligence

2511.2135

Country: North America > United States (1.00)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.93)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Regional Government > North America Government > United States Government (0.93)
Health & Medicine > Therapeutic Area (0.93)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Information Management > Search (1.00)
Information Technology > Data Science > Data Mining (1.00)
(10 more...)

Add feedback

Communication Efficient Parallel Algorithms for Optimization on Manifolds

Neural Information Processing SystemsNov-20-2025, 21:13:50 GMT

However, the existing literature on parallel inference almost exclusively focuses on Euclidean data and parameters.

algorithm, inference, manifold, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > Indiana > St. Joseph County > Notre Dame (0.04)
Asia > Middle East > Jordan (0.04)
North America > United States > New Jersey > Mercer County > Princeton (0.04)
(2 more...)

Industry: Health & Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Architecture > Distributed Systems (0.65)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

Add feedback

Plexus: Taming Billion-edge Graphs with 3D Parallel Full-graph GNN Training

Ranjan, Aditya K., Singh, Siddharth, Wei, Cunyang, Bhatele, Abhinav

arXiv.org Artificial IntelligenceOct-30-2025

Graph neural networks (GNNs) leverage the connectivity and structure of real-world graphs to learn intricate properties and relationships between nodes. Many real-world graphs exceed the memory capacity of a GPU due to their sheer size, and training GNNs on such graphs requires techniques such as mini-batch sampling to scale. The alternative approach of distributed full-graph training suffers from high communication overheads and load imbalance due to the irregular structure of graphs. We propose a three-dimensional (3D) parallel approach for full-graph training that tackles these issues and scales to billion-edge graphs. In addition, we introduce optimizations such as a double permutation scheme for load balancing, and a performance model to predict the optimal 3D configuration of our parallel implementation -- Plexus. We evaluate Plexus on six different graph datasets and show scaling results on up to 2048 GPUs of Perlmutter, and 1024 GPUs of Frontier. Plexus achieves unprecedented speedups of 2.3-12.5x over prior state of the art, and a reduction in time-to-solution by 5.2-8.7x on Perlmutter and 7.0-54.2x on Frontier.

data mining, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2505.04083

Country: North America > United States > Maryland > Prince George's County > College Park (0.14)

Genre:

Overview (0.67)
Research Report (0.51)

Industry:

Energy (1.00)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Architecture > Distributed Systems (0.93)
Information Technology > Artificial Intelligence > Natural Language (0.93)
Information Technology > Data Science > Data Mining (0.93)

Add feedback

What Expressivity Theory Misses: Message Passing Complexity for GNNs

Kemper, Niklas, Wollschläger, Tom, Günnemann, Stephan

arXiv.org Artificial IntelligenceOct-23-2025

Expressivity theory, characterizing which graphs a GNN can distinguish, has become the predominant framework for analyzing GNNs, with new models striving for higher expressivity. However, we argue that this focus is misguided: First, higher expressivity is not necessary for most real-world tasks as these tasks rarely require expressivity beyond the basic WL test. Second, expressivity theory's binary characterization and idealized assumptions fail to reflect GNNs' practical capabilities. To overcome these limitations, we propose Message Passing Complexity (MPC): a continuous measure that quantifies the difficulty for a GNN architecture to solve a given task through message passing. MPC captures practical limitations like over-squashing while preserving the theoretical impossibility results from expressivity theory, effectively narrowing the gap between theory and practice. Through extensive validation on fundamental GNN tasks, we show that MPC's theoretical predictions correlate with empirical performance, successfully explaining architectural successes and failures. Thereby, MPC advances beyond expressivity theory to provide a more powerful and nuanced framework for understanding and improving GNN architectures.

artificial intelligence, complexity, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2509.01254

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Architecture > Distributed Systems (0.82)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback