Self-AttentionBetweenDatapoints: GoingBeyond IndividualInput-OutputPairsinDeepLearning Appendix TableofContents