How do Transformer Embeddings Represent Compositions? A Functional Analysis

Open in new window