Spectrum-Adaptive Generalization Bounds for Trained Deep Transformers

Open in new window