Self-AttentionBetweenDatapoints: GoingBeyond IndividualInput-OutputPairsinDeepLearning Appendix TableofContents

Neural Information Processing Systems 

We next give a rough indication of prediction time behavior of NPT and the baselines.