Analyzing the Inner Workings of Transformers in Compositional Generalization

Open in new window