Transformers as Algorithms: Generalization and Stability in In-context Learning

Open in new window