Grokking of Implicit Reasoning in Transformers: A Mechanistic Journey to the Edge of Generalization

Neural Information Processing Systems 

We study whether transformers can learn to implicitly reason over parametric knowledge, a skill that even the most capable language models struggle with.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found