Lipschitz Normalization for Self-Attention Layers with Application to Graph Neural Networks

Open in new window