AITopics | ast

In this paper we introduce the pure exploration transductive linear bandit problem: given a set of measurement vectors $\mathcal{X}\subset \mathbb{R}^d$, a set of items $\mathcal{Z}\subset \mathbb{R}^d$, a fixed confidence $\delta$, and an unknown vector $\theta^{\ast}\in \mathbb{R}^d$, the goal is to infer $\arg\max_{z\in \mathcal{Z}} z^\top\theta^\ast$ with probability $1-\delta$ by making as few sequentially chosen noisy measurements of the form $x^\top\theta^{\ast}$ as possible. When $\mathcal{X}=\mathcal{Z}$, this setting generalizes linear bandits, and when $\mathcal{X}$ is the standard basis vectors and $\mathcal{Z}\subset \{0,1\}^d$, combinatorial bandits. The transductive setting naturally arises when the set of measurement vectors is limited due to factors such as availability or cost. As an example, in drug discovery the compounds and dosages $\mathcal{X}$ a practitioner may be willing to evaluate in the lab in vitro due to cost or safety reasons may differ vastly from those compounds and dosages $\mathcal{Z}$ that can be safely administered to patients in vivo. Alternatively, in recommender systems for books, the set of books $\mathcal{X}$ a user is queried about may be restricted to known best-sellers even though the goal might be to recommend more esoteric titles $\mathcal{Z}$. In this paper, we provide instance-dependent lower bounds for the transductive setting, an algorithm that matches these up to logarithmic factors, and an evaluation. In particular, we present the first non-asymptotic algorithm for linear bandits that nearly achieves the information-theoretic lower bound.

mathcal, sequential experimental design, transductive linear bandit, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.38)

Add feedback

Minimax Optimal Rate for Parameter Estimation in Multivariate Deviated Models

Neural Information Processing SystemsDec-25-2025, 14:48:00 GMT

The main challenges in deriving the convergence rate of the MLE mainly come from two issues: (1) The interaction between the function $h_{0}$ and the density function $f$; (2) The deviated proportion $\lambda^{\ast}$ can go to the extreme points of $[0,1]$ as the sample size tends to infinity. To address these challenges, we develop the \emph{distinguishability condition} to capture the linear independent relation between the function $h_{0}$ and the density function $f$. We then provide comprehensive convergence rates of the MLE via the vanishing rate of $\lambda^{\ast}$ to zero as well as the distinguishability of two functions $h_{0}$ and $f$.

ast, minimax optimal rate, parameter estimation, (6 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.59)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.37)

Add feedback

Filters

Collaborating Authors

ast

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

0aa800df4298539770b57824afc77a89-Paper-Conference.pdf

f19e6e04ed32735cb0e52bdfe6282673-Paper-Conference.pdf

e6b2b48b5ed90d07c305932729927781-Supplemental-Conference.pdf

e6b2b48b5ed90d07c305932729927781-Paper-Conference.pdf

77c87a15bbf0aad017c53995b832cf84-Paper-Conference.pdf

c7207c38b6e809a83d0688936a91c3b5-Paper-Conference.pdf

0aa800df4298539770b57824afc77a89-Supplemental-Conference.pdf

0aa800df4298539770b57824afc77a89-Paper-Conference.pdf

Sequential Experimental Design for Transductive Linear Bandits

Minimax Optimal Rate for Parameter Estimation in Multivariate Deviated Models