Beyond position: how rotary embeddings shape representations and memory in autoregressive transfomers

Open in new window