Towards Understanding Grokking: An Effective Theory of Representation Learning

Open in new window