In-Context Learning with Transformers: Softmax Attention Adapts to Function Lipschitzness

Open in new window