Fast Training of Sparse Graph Neural Networks on Dense Hardware

Balog, Matej, van Merriënboer, Bart, Moitra, Subhodeep, Li, Yujia, Tarlow, Daniel

Jun-27-2019–arXiv.org Machine Learning

Graph neural networks have become increasingly popular in recent years due to their ability to naturally encode relational input data and their ability to scale to large graphs by operating on a sparse representation of graph adjacency matrices. As we look to scale up these models using custom hardware, a natural assumption would be that we need hardware tailored to sparse operations and/or dynamic control flow. In this work, we question this assumption by scaling up sparse graph neural networks using a platform targeted at dense computation on fixed-size data. Drawing inspiration from optimization of numerical algorithms on sparse matrices, we develop techniques that enable training the sparse graph neural network model from Allamanis et al. [2018] in 13 minutes using a 512-core TPUv2 Pod, whereas the original training takes almost a day.

deep learning, graph, neural network, (22 more...)

arXiv.org Machine Learning

Jun-27-2019

arXiv.org PDF

Add feedback

Country:
- Europe (0.14)

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found