Supplementary Informationfor: FastMatrixSquare RootswithApplicationstoGaussianProcessesand BayesianOptimization

Feb-11-2026, 05:58:46 GMT–Neural Information Processing Systems

We note that all methods incur some sampling error, regardless of the subset size (N). In Fig. S6 we plot the learned hyperparameters of the Precipitation SVGP models: 1)o2 (the kernel outputscale)--which roughly corresponds to variance explained as "signal" in the data; 2)σ2obs--which roughly corresponds to variance explained away as observational noise; and 3)ν (degreesoffreedom)--which controls thetailsofthenoisemodel (lowerν corresponds toheavier tails). As M increases, we find that the observational noise parameter decreases by a factor of 4--downfrom 0.19to0.05--whilethe Fig. S7 is a histogram displaying the msMINRES iterations needed to achieve a relative residual of10 3 when training aM = 5,000SVGP model on the 3droad dataset (subsampled to30,000 datapoints). AsM increases, the kernel outputscale (left) also increases.

artificial intelligence, machine learning, minre, (17 more...)

Neural Information Processing Systems

Feb-11-2026, 05:58:46 GMT

Conferences PDF

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.46)

Duplicate Docs Excel Report

Title
Supplementary Information for: Fast Matrix Square Roots with Applications to Gaussian Processes and Bayesian Optimization

Similar Docs Excel Report more

Title	Similarity	Source
None found