AITopics | contextual linear bandit

Collaborating Authors

contextual linear bandit

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

ASimple and Provably Efficient Algorithm for Asynchronous Federated Contextual Linear Bandits

Neural Information Processing SystemsApr-25-2026, 00:28:57 GMT

We study federated contextual linear bandits, where M agents cooperate with each other to solve a global contextual linear bandit problem with the help of a central server. We consider the asynchronous setting, where all agents work independently and the communication between one agent and the server will not trigger other agents' communication. We propose a simple algorithm named FedLinUCBbased on the principle of optimism.

bandit, data mining, machine learning, (18 more...)

Neural Information Processing Systems

Country: North America > United States > California > Los Angeles County > Los Angeles (0.28)

Genre: Research Report (0.68)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

b1bdb0f22c9748203c62f29aa297ac57-Paper-Conference.pdf

Neural Information Processing SystemsFeb-16-2026, 15:03:36 GMT

artificial intelligence, data mining, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Qatar > Ad-Dawhah > Doha (0.04)

Genre: Research Report (0.46)

Industry: Health & Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Data Science > Data Mining > Big Data (0.47)

Add feedback

On the Minimax Regret for Contextual Linear Bandits and Multi-Armed Bandits with Expert Advice

Neural Information Processing SystemsFeb-15-2026, 19:20:19 GMT

This paper considers problems of multi-armed bandits with expert advice (MwE) [Auer et al.,

artificial intelligence, data mining, machine learning, (19 more...)

Neural Information Processing Systems

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Data Science > Data Mining > Big Data (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Efficient Batched Algorithm for Contextual Linear Bandits with Large Action Space via Soft Elimination

Neural Information Processing SystemsDec-26-2025, 14:03:45 GMT

In this paper, we provide the first efficient batched algorithm for contextual linear bandits with large action spaces. Unlike existing batched algorithms that rely on action elimination, which are not implementable for large action sets, our algorithm only uses a linear optimization oracle over the action set to design the policy. The proposed algorithm achieves a regret upper bound $\tilde{O}(\sqrt{T})$ with high probability, and uses $O(\log\log T)$ batches, matching the lower bound on the number of batches (Gao et al., 2019). When specialized to linear bandits, our algorithm can achieve a high probability gap-dependent regret bound of $\tilde{O}(1/\Delta_{\min})$ with the optimal $\log T$ number of batches, where $\Delta_{\min}$ is the minimum reward gap between a suboptimal arm and the optimal. Our result is achieved via a novel soft elimination approach, that entails $\text{``}$shaping$\text{}$ the action sets at each batch so that we can efficiently identify (near) optimal actions.

action space, contextual linear bandit, efficient batched algorithm, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

Learning from Distributed Users in Contextual Linear Bandits Without Sharing the Context

Neural Information Processing SystemsDec-24-2025, 03:28:38 GMT

Contextual linear bandits is a rich and theoretically important model that has many practical applications. Recently, this setup gained a lot of interest in applications over wireless where communication constraints can be a performance bottleneck, especially when the contexts come from a large $d$-dimensional space. In this paper, we consider the distributed contextual linear bandit learning problem, where the agents who observe the contexts and take actions are geographically separated from the learner who performs the learning while not seeing the contexts. We assume that contexts are generated from a distribution and propose a method that uses $\approx 5d$ bits per context for the case of unknown context distribution and $0$ bits per context if the context distribution is known, while achieving nearly the same regret bound as if the contexts were directly observable. The former bound improves upon existing bounds by a $\log(T)$ factor, where $T$ is the length of the horizon, while the latter achieves information theoretical tightness.

contextual linear bandit, learning, name change, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.78)

Add feedback

An Asymptotically Optimal Primal-Dual Incremental Algorithm for Contextual Linear Bandits

Neural Information Processing SystemsDec-23-2025, 18:28:19 GMT

In the contextual linear bandit setting, algorithms built on the optimism principle fail to exploit the structure of the problem and have been shown to be asymptotically suboptimal. In this paper, we follow recent approaches of deriving asymptotically optimal algorithms from problem-dependent regret lower bounds and we introduce a novel algorithm improving over the state-of-the-art along multiple dimensions. We build on a reformulation of the lower bound, where context distribution and exploration policy are decoupled, and we obtain an algorithm robust to unbalanced context distributions. Then, using an incremental primal-dual approach to solve the Lagrangian relaxation of the lower bound, we obtain a scalable and computationally efficient algorithm. Finally, we remove forced exploration and build on confidence intervals of the optimization problem to encourage a minimum level of exploration that is better adapted to the problem structure. We demonstrate the asymptotic optimality of our algorithm, while providing both problem-dependent and worst-case finite-time regret guarantees. Our bounds scale with the logarithm of the number of arms, thus avoiding the linear dependence common in all related prior works. Notably, we establish minimax optimality for any learning horizon in the special case of non-contextual linear bandits. Finally, we verify that our algorithm obtains better empirical performance than state-of-the-art baselines.

asymptotically optimal primal-dual incremental algorithm, contextual linear bandit, name change, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.59)

Add feedback

On the Minimax Regret for Contextual Linear Bandits and Multi-Armed Bandits with Expert Advice

Neural Information Processing SystemsOct-10-2025, 05:53:00 GMT

This paper considers problems of multi-armed bandits with expert advice (MwE) [Auer et al.,

algorithm, bandit, expert advice, (15 more...)

Neural Information Processing Systems

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Data Science > Data Mining > Big Data (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

b1bdb0f22c9748203c62f29aa297ac57-Paper-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 05:06:32 GMT

artificial intelligence, data mining, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Qatar > Ad-Dawhah > Doha (0.04)

Genre: Research Report (0.46)

Industry: Health & Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Data Science > Data Mining > Big Data (0.47)

Add feedback

Learning from Distributed Users in Contextual Linear Bandits Without Sharing the Context

Neural Information Processing SystemsAug-14-2025, 14:50:01 GMT

In this paper, we develop algorithms that support the deployment of contextual linear bandits in distributed settings.

agent, algorithm, central learner, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.46)

Industry:

Education (0.46)
Health & Medicine (0.46)

Technology:

Information Technology > Communications > Networks (0.93)
Information Technology > Data Science > Data Mining > Big Data (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.46)

Add feedback

Filters

Collaborating Authors

contextual linear bandit

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

ASimple and Provably Efficient Algorithm for Asynchronous Federated Contextual Linear Bandits

b1bdb0f22c9748203c62f29aa297ac57-Paper-Conference.pdf

On the Minimax Regret for Contextual Linear Bandits and Multi-Armed Bandits with Expert Advice

Learningfrom Distributed Usersin Contextual Linear Bandits Without Sharingthe Context

Efficient Batched Algorithm for Contextual Linear Bandits with Large Action Space via Soft Elimination

Learning from Distributed Users in Contextual Linear Bandits Without Sharing the Context

An Asymptotically Optimal Primal-Dual Incremental Algorithm for Contextual Linear Bandits

On the Minimax Regret for Contextual Linear Bandits and Multi-Armed Bandits with Expert Advice

b1bdb0f22c9748203c62f29aa297ac57-Paper-Conference.pdf

Learning from Distributed Users in Contextual Linear Bandits Without Sharing the Context