Explaining grokking through circuit efficiency

Open in new window