AITopics | bwk problem

Non-stationaryBanditswithKnapsacks

Neural Information Processing SystemsFeb-9-2026, 13:56:14 GMT

We employ both non-stationarity measures to derive upper andlowerbounds fortheproblem.

constraint, data mining, machine learning, (20 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.70)
Information Technology > Data Science > Data Mining > Big Data (0.30)

Add feedback

Non-stationary Bandits with Knapsacks

Neural Information Processing SystemsDec-24-2025, 09:36:31 GMT

In this paper, we study the problem of bandits with knapsacks (BwK) in a non-stationary environment. The BwK problem generalizes the multi-arm bandit (MAB) problem to model the resource consumption associated with playing each arm. At each time, the decision maker/player chooses to play an arm, and s/he will receive a reward and consume certain amount of resource from each of the multiple resource types. The objective is to maximize the cumulative reward over a finite horizon subject to some knapsack constraints on the resources. Existing works study the BwK problem under either a stochastic or adversarial environment.

name change, non-stationarity measure, non-stationary bandit, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.36)

Add feedback

Real-Time Bidding with Side Information

arthur flajolet, Patrick Jaillet

Neural Information Processing SystemsNov-21-2025, 04:27:10 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, artificial intelligence, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)

Industry: Information Technology > Services (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

69469da823348084ca8933368ecbf676-Paper-Conference.pdf

Neural Information Processing SystemsAug-15-2025, 13:33:30 GMT

bwk problem, constraint, resource consumption, (16 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.97)
Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (0.47)

Add feedback

Non-stationary Bandits with Knapsacks

Neural Information Processing SystemsOct-11-2024, 10:58:21 GMT

In this paper, we study the problem of bandits with knapsacks (BwK) in a non-stationary environment. The BwK problem generalizes the multi-arm bandit (MAB) problem to model the resource consumption associated with playing each arm. At each time, the decision maker/player chooses to play an arm, and s/he will receive a reward and consume certain amount of resource from each of the multiple resource types. The objective is to maximize the cumulative reward over a finite horizon subject to some knapsack constraints on the resources. Existing works study the BwK problem under either a stochastic or adversarial environment.

non-stationarity measure, non-stationary bandit, non-stationary environment, (3 more...)

Neural Information Processing Systems

Genre: Play (0.42)

Technology: Information Technology > Artificial Intelligence (0.40)

Add feedback

Real-Time Bidding with Side Information

arthur flajolet, Patrick Jaillet

Neural Information Processing SystemsOct-2-2024, 17:37:03 GMT

Neural Information Processing Systems http://nips.cc/

advertiser, algorithm, auction, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)

Industry: Information Technology > Services (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

High-dimensional Linear Bandits with Knapsacks

Ma, Wanteng, Xia, Dong, Jiang, Jiashuo

arXiv.org Machine LearningNov-2-2023

We study the contextual bandits with knapsack (CBwK) problem under the high-dimensional setting where the dimension of the feature is large. The reward of pulling each arm equals the multiplication of a sparse high-dimensional weight vector and the feature of the current arrival, with additional random noise. In this paper, we investigate how to exploit this sparsity structure to achieve improved regret for the CBwK problem. To this end, we first develop an online variant of the hard thresholding algorithm that performs the sparse estimation in an online manner. We further combine our online estimator with a primal-dual framework, where we assign a dual variable to each knapsack constraint and utilize an online learning algorithm to update the dual variable, thereby controlling the consumption of the knapsack capacity. We show that this integrated approach allows us to achieve a sublinear regret that depends logarithmically on the feature dimension, thus improving the polynomial dependency established in the previous literature. We also apply our framework to the high-dimension contextual bandit problem without the knapsack constraint and achieve optimal regret in both the data-poor regime and the data-rich regime. We finally conduct numerical experiments to show the efficient empirical performance of our algorithms under the high dimensional setting.

artificial intelligence, data mining, machine learning, (19 more...)

arXiv.org Machine Learning

2311.01327

Country:

Asia > China > Hong Kong (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.82)

Industry: Education > Educational Setting > Online (0.34)

Technology:

Information Technology > Data Science > Data Mining > Big Data (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

Add feedback

Non-stationary Bandits with Knapsacks

Liu, Shang, Jiang, Jiashuo, Li, Xiaocheng

arXiv.org Artificial IntelligenceOct-12-2022

In this paper, we study the problem of bandits with knapsacks (BwK) in a non-stationary environment. The BwK problem generalizes the multi-arm bandit (MAB) problem to model the resource consumption associated with playing each arm. At each time, the decision maker/player chooses to play an arm, and s/he will receive a reward and consume certain amount of resource from each of the multiple resource types. The objective is to maximize the cumulative reward over a finite horizon subject to some knapsack constraints on the resources. Existing works study the BwK problem under either a stochastic or adversarial environment. Our paper considers a non-stationary environment which continuously interpolates between these two extremes. We first show that the traditional notion of variation budget is insufficient to characterize the non-stationarity of the BwK problem for a sublinear regret due to the presence of the constraints, and then we propose a new notion of global non-stationarity measure. We employ both non-stationarity measures to derive upper and lower bounds for the problem. Our results are based on a primal-dual analysis of the underlying linear programs and highlight the interplay between the constraints and the non-stationarity. Finally, we also extend the non-stationarity measure to the problem of online convex optimization with constraints and obtain new regret bounds accordingly.

artificial intelligence, constraint, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2205.12427

Genre: Research Report (0.70)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (0.68)

Add feedback

Collaborating Authors

bwk problem

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Non-stationaryBanditswithKnapsacks

Non-stationary Bandits with Knapsacks

Real-Time Bidding with Side Information

69469da823348084ca8933368ecbf676-Paper-Conference.pdf

Non-stationary Bandits with Knapsacks

Real-Time Bidding with Side Information

High-dimensional Linear Bandits with Knapsacks

Non-stationary Bandits with Knapsacks