AITopics | conjugate kernel

Collaborating Authors

conjugate kernel

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Spectra of the Conjugate Kernel and Neural Tangent Kernel for linear-width neural networks

Neural Information Processing SystemsDec-24-2025, 01:47:00 GMT

We study the eigenvalue distributions of the Conjugate Kernel and Neural Tangent Kernel associated to multi-layer feedforward neural networks. In an asymptotic regime where network width is increasing linearly in sample size, under random initialization of the weights, and for input samples satisfying a notion of approximate pairwise orthogonality, we show that the eigenvalue distributions of the CK and NTK converge to deterministic limits. The limit for the CK is described by iterating the Marcenko-Pastur map across the hidden layers. The limit for the NTK is equivalent to that of a linear combination of the CK matrices across layers, and may be described by recursive fixed-point equations that extend this Marcenko-Pastur map. We demonstrate the agreement of these asymptotic predictions with the observed spectra for both synthetic and CIFAR-10 training data, and we perform a small simulation to investigate the evolutions of these spectra over training.

conjugate kernel, kernel and neural tangent kernel, linear-width neural network, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Concentration of measure for non-linear random matrices with applications to neural networks and non-commutative polynomials

Adamczak, Radosław

arXiv.org Artificial IntelligenceJul-15-2025

We prove concentration inequalities for several models of non-linear random matrices. As corollaries we obtain estimates for linear spectral statistics of the conjugate kernel of neural networks and non-commutative polynomials in (possibly dependent) random matrices.

artificial intelligence, inequality, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2507.07625

Country:

Europe (0.92)
North America > United States (0.46)

Genre: Research Report (0.82)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Thompson Sampling in Function Spaces via Neural Operators

Oliveira, Rafael, Wang, Xuesong, Chai, Kian Ming A., Bonilla, Edwin V.

arXiv.org Machine LearningJun-30-2025

We propose an extension of Thompson sampling to optimization problems over function spaces where the objective is a known functional of an unknown operator's output. We assume that functional evaluations are inexpensive, while queries to the operator (such as running a high-fidelity simulator) are costly. Our algorithm employs a sample-then-optimize approach using neural operator surrogates. This strategy avoids explicit uncertainty quantification by treating trained neural operators as approximate samples from a Gaussian process. We provide novel theoretical convergence guarantees, based on Gaussian processes in the infinite-dimensional setting, under minimal assumptions. We benchmark our method against existing baselines on functional optimization tasks involving partial differential equations and other nonlinear operator-driven phenomena, demonstrating improved sample efficiency and competitive performance.

artificial intelligence, machine learning, modeling & simulation, (19 more...)

arXiv.org Machine Learning

2506.21894

Country:

Europe > Austria > Vienna (0.14)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(7 more...)

Genre: Research Report (0.83)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Review for NeurIPS paper: Spectra of the Conjugate Kernel and Neural Tangent Kernel for linear-width neural networks

Neural Information Processing SystemsJan-24-2025, 14:13:50 GMT

This deserves to be clarified (for instance, by clearly stating which are the actual contributions of the work).

conjugate kernel, kernel and neural tangent kernel, linear-width neural network, (10 more...)

Neural Information Processing Systems

Country:

North America > Guadeloupe (0.06)
Europe > Sweden > Stockholm > Stockholm (0.06)
Europe > France (0.06)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.40)

Add feedback

Review for NeurIPS paper: Spectra of the Conjugate Kernel and Neural Tangent Kernel for linear-width neural networks

Neural Information Processing SystemsJan-24-2025, 14:13:43 GMT

The reviewers and I are all confident that this paper will be interesting to the NeurIPS community and should be accepted. In addition to the improvements suggested by the reviewers, I would encourage the authors to expand the description of how to unfold the recursion in Theorem 3.7. The discussion in Appendix A helps, but it is insufficient as it is missing crucial details that would clarify how to interpret some of the ambiguous notation. I think including a detailed worked example would be an important addition.

conjugate kernel, kernel and neural tangent kernel, linear-width neural network, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.40)

Add feedback

Spectra of the Conjugate Kernel and Neural Tangent Kernel for linear-width neural networks

Neural Information Processing SystemsOct-10-2024, 06:22:19 GMT

conjugate kernel, kernel and neural tangent kernel, linear-width neural network, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Double-descent curves in neural networks: a new perspective using Gaussian processes

Harzli, Ouns El, Valle-Pérez, Guillermo, Louis, Ard A.

arXiv.org Machine LearningFeb-16-2021

Double-descent curves in neural networks describe the phenomenon that the generalisation error initially descends with increasing parameters, then grows after reaching an optimal number of parameters which is less than the number of data points, but then descends again in the overparameterised regime. Here we use a neural network Gaussian process (NNGP) which maps exactly to a fully connected network (FCN) in the infinite width limit, combined with techniques from random matrix theory, to calculate this generalisation behaviour, with a particular focus on the overparameterised regime. We verify our predictions with numerical simulations of the corresponding Gaussian process regressions. An advantage of our NNGP approach is that the analytical calculations are easier to interpret. We argue that neural network generalization performance improves in the overparameterised regime precisely because that is where they converge to their equivalent Gaussian process.

matrix, neural network, spectral distribution, (14 more...)

arXiv.org Machine Learning

2102.07238

Country:

North America > United States (0.14)
North America > Canada > Ontario > Toronto (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
(2 more...)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Characteristic Kernels and Infinitely Divisible Distributions

Nishiyama, Yu, Fukumizu, Kenji

arXiv.org Machine LearningOct-24-2016

We connect shift-invariant characteristic kernels to infinitely divisible distributions on $\mathbb{R}^{d}$. Characteristic kernels play an important role in machine learning applications with their kernel means to distinguish any two probability measures. The contribution of this paper is two-fold. First, we show, using the L\'evy-Khintchine formula, that any shift-invariant kernel given by a bounded, continuous and symmetric probability density function (pdf) of an infinitely divisible distribution on $\mathbb{R}^d$ is characteristic. We also present some closure property of such characteristic kernels under addition, pointwise product, and convolution. Second, in developing various kernel mean algorithms, it is fundamental to compute the following values: (i) kernel mean values $m_P(x)$, $x \in \mathcal{X}$, and (ii) kernel mean RKHS inner products ${\left\langle m_P, m_Q \right\rangle_{\mathcal{H}}}$, for probability measures $P, Q$. If $P, Q$, and kernel $k$ are Gaussians, then computation (i) and (ii) results in Gaussian pdfs that is tractable. We generalize this Gaussian combination to more general cases in the class of infinitely divisible distributions. We then introduce a {\it conjugate} kernel and {\it convolution trick}, so that the above (i) and (ii) have the same pdf form, expecting tractable computation at least in some cases. As specific instances, we explore $\alpha$-stable distributions and a rich class of generalized hyperbolic distributions, where the Laplace, Cauchy and Student-t distributions are included.

artificial intelligence, kernel, machine learning, (14 more...)

arXiv.org Machine Learning

1403.7304

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
Asia > Japan > Honshū > Kantō > Kanagawa Prefecture (0.04)
Europe > Germany > Baden-Württemberg > Freiburg (0.04)
(6 more...)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback