Transformer learns the cross-task prior and regularization for in-context learning

Open in new window