Towards Principled Graph Transformers Luis Müller Daniel Kusuma

Neural Information Processing Systems 

However, such architectures often fail to deliver solid predictive performance on real-world tasks, limiting their practical impact. In contrast, global attention-based models such as graph transformers demonstrate strong performance in practice.