Layer Specialization Underlying Compositional Reasoning in Transformers

Open in new window