A Theory of Diversity for Random Matrices with Applications to In-Context Learning of Schrödinger Equations
Cole, Frank, Lu, Yulong, Sehgal, Shaurya
We address the following question: given a collection $\{\mathbf{A}^{(1)}, \dots, \mathbf{A}^{(N)}\}$ of independent $d \times d$ random matrices drawn from a common distribution $\mathbb{P}$, what is the probability that the centralizer of $\{\mathbf{A}^{(1)}, \dots, \mathbf{A}^{(N)}\}$ is trivial? We provide lower bounds on this probability in terms of the sample size $N$ and the dimension $d$ for several families of random matrices which arise from the discretization of linear Schrödinger operators with random potentials. When combined with recent work on machine learning theory, our results provide guarantees on the generalization ability of transformer-based neural networks for in-context learning of Schrödinger equations.
Jan-21-2026
- Country:
- Europe > United Kingdom
- North Sea > Southern North Sea (0.04)
- North America > United States
- Massachusetts (0.04)
- Europe > United Kingdom
- Genre:
- Research Report > New Finding (0.66)
- Technology: