AITopics | model space

Collaborating Authors

model space

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Variance-Aware Feel-Good Thompson Sampling for Contextual Bandits

Neural Information Processing SystemsJun-11-2026, 03:10:01 GMT

Variance-dependent regret bounds have received increasing attention in recent studies on contextual bandits. However, most of these studies are focused on upper confidence bound (UCB)-based bandit algorithms, while sampling based bandit algorithms such as Thompson sampling are still understudied. The only exception is the `LinVDTS` algorithm (Xu et al., 2023), which is limited to linear reward function and its regret bound is not optimal with respect to the model dimension. In this paper, we present `FGTSVA`, a variance-aware Thompson Sampling algorithm for contextual bandits with general reward function with optimal regret bound. At the core of our analysis is an extension of the decoupling coefficient, a technique commonly used in the analysis of Feel-good Thompson sampling (FGTS) that reflects the complexity of the model space.

artificial intelligence, machine learning, proceedings, (12 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Observable Geometry of Singular Statistical Models

Plummer, Sean

arXiv.org Machine LearningApr-3-2026

Singular statistical models arise whenever different parameter values induce the same distribution, leading to non-identifiability and a breakdown of classical asymptotic theory. While existing approaches analyze these phenomena in parameter space, the resulting descriptions depend heavily on parameterization and obscure the intrinsic statistical structure of the model. In this paper, we introduce an invariant framework based on \emph{observable charts}: collections of functionals of the data distribution that distinguish probability measures. These charts define local coordinate systems directly on the model space, independent of parameterization. We formalize \emph{observable completeness} as the ability of such charts to detect identifiable directions, and introduce \emph{observable order} to quantify higher-order distinguishability along analytic perturbations. Our main result establishes that, under mild regularity conditions, observable order provides a lower bound on the rate at which Kullback-Leibler divergence vanishes along analytic paths. This connects intrinsic geometric structure in model space to statistical distinguishability and recovers classical behavior in regular models while extending naturally to singular settings. We illustrate the framework in reduced-rank regression and Gaussian mixture models, where observable coordinates reveal both identifiable structure and singular degeneracies. These results suggest that observable charts provide a unified and parameterization-invariant language for studying singular models and offer a pathway toward intrinsic formulations of invariants such as learning coefficients.

artificial intelligence, machine learning, observable chart, (17 more...)

arXiv.org Machine Learning

2604.01267

Country: Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > New Finding (0.49)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Bayesian optimization for automated model selection

Gustavo Malkomes, Charles Schaff, Roman Garnett

Neural Information Processing SystemsMar-23-2026, 06:54:06 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, kernel, machine learning, (14 more...)

Neural Information Processing Systems

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Neural Latent Geometry Search: Product Manifold Inference via Gromov-Hausdorff-Informed Bayesian Optimization Haitz Sáez de Ocáriz Borde Oxford Robotics Institute University of Oxford Álvaro Arroyo

Neural Information Processing SystemsFeb-15-2026, 01:51:46 GMT

Recent research indicates that the performance of machine learning models can be improved by aligning the geometry of the latent space with the underlying data structure.

artificial intelligence, machine learning, manifold, (18 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.40)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada > Ontario > Toronto (0.04)

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.73)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

Deep Model Transferability from Attribution Maps

Jie Song, Yixin Chen, Xinchao Wang, Chengchao Shen, Mingli Song

Neural Information Processing SystemsFeb-14-2026, 21:53:28 GMT

Neural Information Processing Systems http://nips.cc/

attribution map, taskonomy, transferability, (13 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.04)
Europe > Italy > Marche > Ancona Province > Ancona (0.04)
Asia > China > Zhejiang Province (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Learning Others' Intentional Models in Multi-Agent Settings Using Interactive POMDPs

Yanlin Han, Piotr Gmytrasiewicz

Neural Information Processing SystemsFeb-12-2026, 23:54:09 GMT

It extends POMDPs to multi-agent settings by including models of other agents in the state space and forming a hierarchical belief structure.

agent, artificial intelligence, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois > Cook County > Chicago (0.04)
North America > Canada > Quebec > Montreal (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.76)

Add feedback

c62fe1daeb10814d33e5a33ba466ecaf-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 20:41:05 GMT

fpr, roc curve, tpr, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

c62fe1daeb10814d33e5a33ba466ecaf-Paper-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 20:41:01 GMT

classifier, optimal roc curve, roc curve, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Ohio > Franklin County > Columbus (0.04)
North America > United States > New Mexico > Los Alamos County > Los Alamos (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

Optimal Transport Model Distributional Robustness Van-Anh Nguyen

Neural Information Processing SystemsFeb-11-2026, 12:17:03 GMT

SAM aims to find a perturbed model within the vicinity of a current model that maximizes the loss over a training set.

artificial intelligence, distributional robustness, machine learning, (14 more...)

Neural Information Processing Systems

Country:

Oceania > Australia (0.04)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
Asia > Vietnam (0.04)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

On the consistent estimation of optimal Receiver Operating Characteristic (ROC) curve

Neural Information Processing SystemsDec-25-2025, 05:51:36 GMT

Under a standard binary classification setting with possible model misspecification, we study the problem of estimating general Receiver Operating Characteristic (ROC) curve, which is an arbitrary set of false positive rate (FPR) and true positive rate (TPR) pairs. We formally introduce the notion of \textit{optimal ROC curve} over a general model space. It is argued that any ROC curve estimation methods implemented over the given model space should target the optimal ROC curve over that space. Three popular ROC curve estimation methods are then analyzed at the population level (i.e., when there are infinite number of samples) under both correct and incorrect model specification. Based on our analysis, they are all consistent when the surrogate loss function satisfies certain conditions and the given model space includes all measurable classifiers. Interestingly, some of these conditions are similar to those that are required to ensure classification consistency. When the model space is incorrectly specified, however, we show that only one method leads to consistent estimation of the ROC curve over the chosen model space. We present some numerical results to demonstrate the effects of model misspecification on the performance of various methods in terms of their ROC curve estimates.

consistent estimation, optimal receiver operating characteristic, receiver operating characteristic, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback