Beyond the Doors of Perception: Vision Transformers Represent Relations Between Objects Michael A. Lepori

Neural Information Processing Systems 

Though the ability to compute over abstract visual relations is thought to be fundamental to human visual intelligence (Ullman, 1987; Hespos et al., 2021), the ability of neural networks to perform such