Understanding Factual Recall in Transformers via Associative Memories

Open in new window