AITopics | cholesky

Collaborating Authors

cholesky

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

01ce84968c6969bdd5d51c5eeaa3946a-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-11-2026, 07:43:49 GMT

engineering effort, gps, sparse gps, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.31)

Add feedback

fcf55a303b71b84d326fb1d06e332a26-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 05:58:39 GMT

gaussian process, matrix, optimization, (17 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
South America > Paraguay > Asunción > Asunción (0.04)
North America > United States > Pennsylvania (0.04)
North America > Canada (0.04)

Industry: Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Kernel Quadrature with Randomly Pivoted Cholesky

Neural Information Processing SystemsDec-26-2025, 20:14:21 GMT

This paper presents new quadrature rules for functions in a reproducing kernel Hilbert space using nodes drawn by a sampling algorithm known as randomly pivoted Cholesky. The resulting computational procedure compares favorably to previous kernel quadrature methods, which either achieve low accuracy or require solving a computationally challenging sampling problem. Theoretical and numerical results show that randomly pivoted Cholesky is fast and achieves comparable quadrature error rates to more computationally expensive quadrature schemes based on continuous volume sampling, thinning, and recombination. Randomly pivoted Cholesky is easily adapted to complicated geometries with arbitrary kernels, unlocking new potential for kernel quadrature.

cholesky, kernel quadrature, name change, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.43)

Add feedback

the NLLs in the final version of the paper in addition to reporting averages and standard deviations in all of our other 3 tables by running more trials

Neural Information Processing SystemsOct-1-2025, 22:47:53 GMT

We agree with all three reviewers that evaluating the predictive variances is important. Thank you for your comments and suggestions. Finally, we will clarify that SGPR is by (Titsias, 2009) and SVGP is by (Hensman et al., 2013). This has important ramifications, e.g., We were unaware of Nguyen's paper at submission and we will add this discussion to the paper. We note that the precomputation, like CG, can be run to a specified desired tolerance. Hensman et al. (2013) used 1000 inducing points on the massive Airline dataset.

final version, gps, standard deviation, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.31)

Add feedback

Fast Matrix Square Roots with Applications to Gaussian Processes and Bayesian Optimization

Neural Information Processing SystemsAug-17-2025, 10:09:38 GMT

Matrix square roots and their inverses arise frequently in machine learning, e.g.,

artificial intelligence, gaussian process, machine learning, (19 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
South America > Paraguay > Asunción > Asunción (0.04)
North America > United States > Pennsylvania (0.04)
North America > Canada (0.04)

Industry: Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Kernel Quadrature with Randomly Pivoted Cholesky

Neural Information Processing SystemsJan-19-2025, 22:48:37 GMT

cholesky, kernel quadrature, pivoted cholesky

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.50)

Add feedback

Randomly Pivoted Partial Cholesky: Random How?

Steinerberger, Stefan

arXiv.org Machine LearningApr-17-2024

We consider the problem of finding good low rank approximations of symmetric, positive-definite $A \in \mathbb{R}^{n \times n}$. Chen-Epperly-Tropp-Webber showed, among many other things, that the randomly pivoted partial Cholesky algorithm that chooses the $i-$th row with probability proportional to the diagonal entry $A_{ii}$ leads to a universal contraction of the trace norm (the Schatten 1-norm) in expectation for each step. We show that if one chooses the $i-$th row with likelihood proportional to $A_{ii}^2$ one obtains the same result in the Frobenius norm (the Schatten 2-norm). Implications for the greedy pivoting rule and pivot selection strategies are discussed.

approximation, frobenius norm, matrix, (15 more...)

arXiv.org Machine Learning

2404.11487

Country: North America > United States > Washington > King County > Seattle (0.14)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.68)

Add feedback

Practical and Matching Gradient Variance Bounds for Black-Box Variational Bayesian Inference

Kim, Kyurae, Wu, Kaiwen, Oh, Jisu, Gardner, Jacob R.

arXiv.org Artificial IntelligenceJun-3-2023

Understanding the gradient variance of blackbox Despite the advances of BBVI, little is known about its theoretical variational inference (BBVI) is a crucial step properties. Even when restricted to the locationscale for establishing its convergence and developing family (Definition 2), it is unknown whether BBVI algorithmic improvements. However, existing is guaranteed to converge without having to modify the studies have yet to show that the gradient variance algorithms used in practice, for example, by enforcing of BBVI satisfies the conditions used to bounded domains, bounded support, bounded gradients, study the convergence of stochastic gradient descent and such. This theoretical insight is necessary since BBVI (SGD), the workhorse of BBVI. In this methods are known to be less robust (Yao et al., 2018; work, we show that BBVI satisfies a matching Dhaka et al., 2020; Welandawe et al., 2022; Dhaka et al., bound corresponding to the condition used 2021; Domke, 2020) compared to other inference methods in the SGD literature when applied to smooth and such as Markov chain Monte Carlo. Although progress has quadratically-growing log-likelihoods. Our results been made to formalize the theory of BBVI with some generality, generalize to nonlinear covariance parameterizations the gap between our understanding of BBVI and the widely used in the practice of BBVI.

artificial intelligence, machine learning, parameterization, (14 more...)

arXiv.org Artificial Intelligence

2303.10472

Country:

Asia > Bangladesh > Dhaka Division > Dhaka District > Dhaka (0.44)
North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.14)
Asia > Middle East > Jordan (0.04)
(4 more...)

Genre: Research Report > New Finding (0.34)

Industry: Transportation > Air (0.43)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.64)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.54)

Add feedback

Fast Matrix Square Roots with Applications to Gaussian Processes and Bayesian Optimization

Pleiss, Geoff, Jankowiak, Martin, Eriksson, David, Damle, Anil, Gardner, Jacob R.

arXiv.org Machine LearningJun-19-2020

Matrix square roots and their inverses arise frequently in machine learning, e.g., when sampling from high-dimensional Gaussians $\mathcal{N}(\mathbf 0, \mathbf K)$ or whitening a vector $\mathbf b$ against covariance matrix $\mathbf K$. While existing methods typically require $O(N^3)$ computation, we introduce a highly-efficient quadratic-time algorithm for computing $\mathbf K^{1/2} \mathbf b$, $\mathbf K^{-1/2} \mathbf b$, and their derivatives through matrix-vector multiplication (MVMs). Our method combines Krylov subspace methods with a rational approximation and typically achieves $4$ decimal places of accuracy with fewer than $100$ MVMs. Moreover, the backward pass requires little additional computation. We demonstrate our method's applicability on matrices as large as $50,\!000 \times 50,\!000$ - well beyond traditional methods - with little approximation error. Applying this increased scalability to variational Gaussian processes, Bayesian optimization, and Gibbs sampling results in more powerful models with higher accuracy.

artificial intelligence, bayesian inference, machine learning, (19 more...)

arXiv.org Machine Learning

2006.11267

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > United States > Pennsylvania (0.04)
South America > Paraguay > Asunción > Asunción (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Genre: Research Report (0.50)

Industry: Government (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.34)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.34)

Add feedback

Optimal whitening and decorrelation

Kessy, Agnan, Lewin, Alex, Strimmer, Korbinian

arXiv.org Machine LearningDec-17-2016

Whitening, or sphering, is a common preprocessing step in statistical analysis to transform random variables to orthogonality. However, due to rotational freedom there are infinitely many possible whitening procedures. Consequently, there is a diverse range of sphering methods in use, for example based on principal component analysis (PCA), Cholesky matrix decomposition and zero-phase component analysis (ZCA), among others. Here we provide an overview of the underlying theory and discuss five natural whitening procedures. Subsequently, we demonstrate that investigating the cross-covariance and the cross-correlation matrix between sphered and original variables allows to break the rotational invariance and to identify optimal whitening transformations. As a result we recommend two particular approaches: ZCA-cor whitening to produce sphered variables that are maximally similar to the original variables, and PCA-cor whitening to obtain sphered variables that maximally compress the original variables.

artificial intelligence, machine learning, survey article, (17 more...)

arXiv.org Machine Learning

doi: 10.1080/00031305.2016.1277159

1512.00809

Genre:

Overview (0.54)
Research Report (0.50)

Industry: Health & Medicine (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.34)

Add feedback