Why bother with geometry? On the relevance of linear decompositions of Transformer embeddings

Open in new window