AITopics | maximum information gain

Collaborating Authors

maximum information gain

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Supplementary Material

Neural Information Processing SystemsApr-24-2026, 21:47:36 GMT

We say a real-valued random variable X is -sub-Gaussian if it its mean is zero and for all " 2 R we have E[exp("X)] exp Such assumptions on the noise variables are frequently used in bandit optimization. Typically, in kernelized bandits, we assume that unknown f 2F k(D;B)= {f 2H k(D): kfkk B}, where Hk(D) is the reproducing kernel Hilbert space of functions associated with the given positive-definite kernel function. Typically, the learner knows Fk(D;B), meaning that both k(,) and B are considered as input to the learner's algorithm. We outline some commonly used kernel functions k: D D! R, that we also consider: Linear kernel: klin(x,x0)= xTx0, Squared exponential kernel: kSE(x,x0)=exp kx x0k2 2l2, Matérn kernel: kMat(x,x0)= 2 Maximum information gain is a kernel-dependent quantity that measures the complexity of the given function class. It has first been introduced in [40], and since then it has been used in numerous works on Gaussian process bandits.

artificial intelligence, denote, machine learning, (19 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.57)

Add feedback

dee8f820d86aca28ab0328a9243020f9-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 09:12:06 GMT

algorithm, gnn, graph, (15 more...)

Neural Information Processing Systems

Country:

North America > United States (0.27)
Europe > Switzerland > Zürich > Zürich (0.04)

Genre: Research Report > New Finding (0.67)

Industry: Government > Regional Government (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Data Science > Data Mining (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.70)

Add feedback

8f97d1d7e02158a83ceb2c14ff5372cd-Supplemental.pdf

Neural Information Processing SystemsFeb-9-2026, 21:16:51 GMT

information gain, ln null 1, variance, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

0d17d033059bacd127f25ab28784f829-Paper-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 19:24:42 GMT

confidence interval, international conference, kernel, (12 more...)

Neural Information Processing Systems

Country:

North America > Canada > Alberta (0.14)
Asia > Middle East > Jordan (0.04)
Europe > Netherlands > South Holland > Delft (0.04)
(4 more...)

Genre: Research Report > New Finding (0.54)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Supplementary Material Misspecified GP Bandit Optimization Ilija Bogunovic and Andreas Krause (NeurIPS 2021) A GP bandits: Useful definitions and auxiliary results (Realizable setting)

Neural Information Processing SystemsFeb-7-2026, 16:05:04 GMT

Such assumptions on the noise variables are frequently used in bandit optimization. Gaussian process with posterior mean and variance that correspond to Eq. (8) and Eq. It also allows us to rewrite Eq. Gaussian Process (supported on D) with the corresponding kernel function. Suppose the learner's hypothesis class is While the first two terms in this bound can be effectively controlled and bounded as in the proof of Theorem 1, the last term, i.e., Such a function can easily be constructed, e.g., via the approach outlined in [36].

artificial intelligence, denote, machine learning, (11 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.36)

Add feedback

Consequences of Kernel Regularity for Bandit Optimization

Lee, Madison, Javidi, Tara

arXiv.org Machine LearningDec-8-2025

In this work we investigate the relationship between kernel regularity and algorithmic performance in the bandit optimization of RKHS functions. While reproducing kernel Hilbert space (RKHS) methods traditionally rely on global kernel regressors, it is also common to use a smoothness-based approach that exploits local approximations. We show that these perspectives are deeply connected through the spectral properties of isotropic kernels. In particular, we characterize the Fourier spectra of the Matérn, square-exponential, rational-quadratic, $γ$-exponential, piecewise-polynomial, and Dirichlet kernels, and show that the decay rate determines asymptotic regret from both viewpoints. For kernelized bandit algorithms, spectral decay yields upper bounds on the maximum information gain, governing worst-case regret, while for smoothness-based methods, the same decay rates establish Hölder space embeddings and Besov space norm-equivalences, enabling local continuity analysis. These connections show that kernel-based and locally adaptive algorithms can be analyzed within a unified framework. This allows us to derive explicit regret bounds for each kernel family, obtaining novel results in several cases and providing improved analysis for others. Furthermore, we analyze LP-GP-UCB, an algorithm that combines both approaches, augmenting global Gaussian process surrogates with local polynomial estimators. While the hybrid approach does not uniformly dominate specialized methods, it achieves order-optimality across multiple kernel families.

algorithm, information gain, kernel, (14 more...)

arXiv.org Machine Learning

2512.05957

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Finland > Uusimaa > Helsinki (0.04)
North America > United States > Virginia > Arlington County > Arlington (0.04)
(6 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)
Information Technology > Data Science > Data Mining > Big Data (0.49)

Add feedback

Graph Neural Network Bandits

Neural Information Processing SystemsAug-19-2025, 12:08:34 GMT

The key challenges in this setting are scaling to large domains, and to graphs with many nodes. We resolve these challenges by embedding the permutation invariance into our model.

algorithm, gnn, graph, (15 more...)

Neural Information Processing Systems

Country: