HierarchicalGraphTransformerwithAdaptive NodeSampling

Neural Information Processing Systems 

The Transformer architecture has achieved remarkable success in a number of domains including natural language processing and computer vision.