All for One: LLMs Solve Mental Math at the Last Token With Information Transferred From Other Tokens

Open in new window