AITopics | gat

Collaborating Authors

gat

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Gaussian Process Limit Reveals Structural Benefits of Graph Transformers

Ayday, Nil, Yang, Lingchu, Ghoshdastidar, Debarghya

arXiv.org Machine LearningMar-19-2026

Graph transformers are the state-of-the-art for learning from graph-structured data and are empirically known to avoid several pitfalls of message-passing architectures. However, there is limited theoretical analysis on why these models perform well in practice. In this work, we prove that attention-based architectures have structural benefits over graph convolutional networks in the context of node-level prediction tasks. Specifically, we study the neural network gaussian process limits of graph transformers (GAT, Graphormer, Specformer) with infinite width and infinite heads, and derive the node-level and edge-level kernels across the layers. Our results characterise how the node features and the graph structure propagate through the graph attention layers. As a specific example, we prove that graph transformers structurally preserve community information and maintain discriminative node representations even in deep layers, thereby preventing oversmoothing. We provide empirical evidence on synthetic and real-world graphs that validate our theoretical insights, such as integrating informative priors and positional encoding can improve performance of deep graph transformers.

apreprint-march19, artificial intelligence, machine learning, (19 more...)

arXiv.org Machine Learning

2603.17569

Country:

Europe > Austria > Vienna (0.14)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
North America > United States (0.04)

Genre: Research Report (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.66)

Add feedback

ReFactorGNNs

Neural Information Processing SystemsFeb-9-2026, 12:37:10 GMT

Hence, each atomic term of the sum can be seen as a messagevectorbetween v andv'sneighbouringnode. In the paper, we chose DistMult and GD because of their mathematical simplicity,leading toeasier-to-read formulas. For example, here we offer a specific derivation for ComplEx[39]. For scalability w.r.t. the number of triplets/edges in the graph, we denote the entity set asE, the relation setasR,and the triplets asT. For inductive knowledge graph completion, we test the model on the new graph, where the relation vocabulary is shared with the training graph, while the entities are novel.

artificial intelligence, machine learning, refactorgnn, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.49)
Information Technology > Artificial Intelligence > Machine Learning (0.47)

Add feedback

WhatMakesGraphNeuralNetworks Miscalibrated?

Neural Information Processing SystemsFeb-9-2026, 04:41:51 GMT

In this work, we conduct asystematic study on the calibration qualities ofGNN node predictions.

artificial intelligence, calibration, machine learning, (18 more...)

Neural Information Processing Systems

Country: Europe > Germany (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

24d6d158531508115e628188e2697f76-Supplemental-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsFeb-7-2026, 22:35:10 GMT

Next, we 6.1.2. Figure connected.

artificial intelligence, decay, dropout, (18 more...)

Neural Information Processing Systems

Country: Europe (0.05)

Technology: Information Technology > Artificial Intelligence (0.31)

Add feedback

28f248e9279ac845995c4e9f8af35c2b-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-7-2026, 21:34:41 GMT

artificial intelligence, machine learning, target domain, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Robots (0.54)
Information Technology > Artificial Intelligence > Machine Learning (0.50)

Add feedback

What Makes Graph Neural Networks Miscalibrated?

Neural Information Processing SystemsDec-24-2025, 06:26:47 GMT

Given the importance of getting calibrated predictions and reliable uncertainty estimations, various post-hoc calibration methods have been developed for neural networks on standard multi-class classification tasks. However, these methods are not well suited for calibrating graph neural networks (GNNs), which presents unique challenges such as accounting for the graph structure and the graph-induced correlations between the nodes. In this work, we conduct a systematic study on the calibration qualities of GNN node predictions. In particular, we identify five factors which influence the calibration of GNNs: general under-confident tendency, diversity of nodewise predictive distributions, distance to training nodes, relative confidence level, and neighborhood similarity. Furthermore, based on the insights from this study, we design a novel calibration method named Graph Attention Temperature Scaling (GATS), which is tailored for calibrating graph neural networks. GATS incorporates designs that address all the identified influential factors and produces nodewise temperature scaling using an attention-based architecture. GATS is accuracy-preserving, data-efficient, and expressive at the same time. Our experiments empirically verify the effectiveness of GATS, demonstrating that it can consistently achieve state-of-the-art calibration results on various graph datasets for different GNN backbones.

make graph neural network miscalibrated, name change, proceedings, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.77)

Add feedback

Are GATs Out of Balance?

Neural Information Processing SystemsDec-24-2025, 06:13:52 GMT

While the expressive power and computational capabilities of graph neural networks (GNNs) have been theoretically studied, their optimization and learning dynamics, in general, remain largely unexplored. Our study undertakes the Graph Attention Network (GAT), a popular GNN architecture in which a node's neighborhood aggregation is weighted by parameterized attention coefficients. We derive a conservation law of GAT gradient flow dynamics, which explains why a high portion of parameters in GATs with standard initialization struggle to change during training. This effect is amplified in deeper GATs, which perform significantly worse than their shallow counterparts. To alleviate this problem, we devise an initialization scheme that balances the GAT network. Our approach i) allows more effective propagation of gradients and in turn enables trainability of deeper networks, and ii) attains a considerable speedup in training and convergence time in comparison to the standard initialization. Our main theorem serves as a stepping stone to studying the learning dynamics of positive homogeneous models with attention mechanisms.

balance, gat, name change, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.41)

Add feedback

25d463c05b414125f598cdf8022b3b46-Paper-Conference.pdf

Neural Information Processing SystemsOct-8-2025, 07:27:21 GMT

initialization, international conference, neural network, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Wisconsin (0.04)
North America > United States > Texas (0.04)
Europe > Germany > Saarland > Saarbrücken (0.04)
Europe > Germany > North Rhine-Westphalia > Cologne Region > Cologne (0.04)

Genre: Research Report (0.68)

Industry: Information Technology (0.46)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

28f248e9279ac845995c4e9f8af35c2b-AuthorFeedback.pdf

Neural Information Processing SystemsOct-2-2025, 12:51:52 GMT

We thank the reviewers for lending their expertise and time to provide feedback on our efforts. If accepted, we will make several changes to moderate the claims as R4 suggested. Also, we will change the title to "Towards Sim-to-Real Transfer: We compare to ANE [20] It does not represent the maximum and minimum returns possible.

artificial intelligence, machine learning, target domain, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Robots (0.54)
Information Technology > Artificial Intelligence > Machine Learning (0.50)

Add feedback

Filters

Collaborating Authors

gat

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Gaussian Process Limit Reveals Structural Benefits of Graph Transformers

ReFactorGNNs

WhatMakesGraphNeuralNetworks Miscalibrated?

24d6d158531508115e628188e2697f76-Supplemental-Datasets_and_Benchmarks.pdf

28f248e9279ac845995c4e9f8af35c2b-AuthorFeedback.pdf

What Makes Graph Neural Networks Miscalibrated?

Are GATs Out of Balance?

25d463c05b414125f598cdf8022b3b46-Paper-Conference.pdf

28f248e9279ac845995c4e9f8af35c2b-AuthorFeedback.pdf

fb4c835feb0a65cc39739320d7a51c02-Supplemental.pdf