AITopics | dim eff

Collaborating Authors

dim eff

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Provably Strict Generalisation Benefit for Invariance in Kernel Methods

Neural Information Processing SystemsFeb-9-2026, 21:31:30 GMT

It is a commonly held belief that enforcing invariance improves generalisation.

artificial intelligence, invariance, machine learning, (12 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Kernel Methods (0.41)

Add feedback

Provably Strict Generalisation Benefit for Invariance in Kernel Methods

Elesedy, Bryn

arXiv.org Machine LearningJun-4-2021

It is a commonly held belief that enforcing invariance improves generalisation. Although this approach enjoys widespread popularity, it is only very recently that a rigorous theoretical demonstration of this benefit has been established. In this work we build on the function space perspective of Elesedy and Zaidi arXiv:2102.10333 to derive a strictly non-zero generalisation benefit of incorporating invariance in kernel ridge regression when the target is invariant to the action of a compact group. We study invariance enforced by feature averaging and find that generalisation is governed by a notion of effective dimension that arises from the interplay between the kernel and the group. In building towards this result, we find that the action of the group induces an orthogonal decomposition of both the reproducing kernel Hilbert space and its kernel, which may be of interest in its own right.

generalisation, invariance, kernel ridge regression, (12 more...)

arXiv.org Machine Learning

2106.02346

Country: Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Kernel Methods (0.41)

Add feedback

A scale-dependent notion of effective dimension

Berezniuk, Oksana, Figalli, Alessio, Ghigliazza, Raffaele, Musaelian, Kharen

arXiv.org Machine LearningJan-29-2020

Email: kharen@dualitygroup.com January 30, 2020 Abstract We introduce a notion of "effective dimension" of a statistical model based on the number of cubes of size 1 / n needed to cover the model space when endowed with the Fisher Information Matrix as metric, n being the number of observations. The effective dimension is then measured via the spectrum of the Fisher Information Matrix regularized using this natural scale. A very important and challenging question in statistics and machine learning is the "real" dimension of a statistical model, such as a neural network. Many definitions of effective dimension have been proposed in the literature, either based on the so-called VC dimension (see for instance [13]), or on Gardner phase-space approach [6], or also on some effective dimension based on the rank of the Jacobian matrix of the transformation between the parameters of the network and the parameters of the observable variables [2, 15] (see also [14, 1, 4, 7]). Although these notions of dimension are all very natural when the number of observations go to infinity, they do not take into account the fact that only a finite-size sample of data is available.

dim eff, dimension, effective dimension, (15 more...)

arXiv.org Machine Learning

2001.10872

Country:

Europe > Switzerland > Zürich > Zürich (0.15)
North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York > New York County > New York City (0.05)
Europe > Poland > Masovia Province > Warsaw (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.37)

Add feedback