AITopics | open problem

Hardness of Online Sleeping Combinatorial Optimization Problems

Neural Information Processing SystemsMar-17-2026, 06:58:27 GMT

We show that several online combinatorial optimization problems that admit efficient no-regret algorithms become computationally hard in the sleeping setting where a subset of actions becomes unavailable in each round. Specifically, we show that the sleeping versions of these problems are at least as hard as PAC learning DNF expressions, a long standing open problem. We show hardness for the sleeping versions of Online Shortest Paths, Online Minimum Spanning Tree, Online k-Subsets, Online k-Truncated Permutations, Online Minimum Cut, and Online Bipartite Matching. The hardness result for the sleeping version of the Online Shortest Paths problem resolves an open problem presented at COLT 2015 [Koolen et al., 2015].

artificial intelligence, name change, proceedings, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.71)

Add feedback

91a5742235f70ae846436d9780e9f1d4-Paper-Conference.pdf

Neural Information Processing SystemsFeb-16-2026, 16:11:59 GMT

algorithm, experiment, oost, (17 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Asia > China > Hubei Province > Wuhan (0.04)

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.68)

Add feedback

On the Optimality of Perturbations in Stochastic and Adversarial Multi-armed Bandit Problems

Baekjin Kim, Ambuj Tewari

Neural Information Processing SystemsFeb-13-2026, 15:16:31 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, bandit, perturbation, (10 more...)

Neural Information Processing Systems

Country:

North America > United States > Michigan > Washtenaw County > Ann Arbor (0.14)
North America > Canada (0.04)

Technology:

Information Technology > Data Science > Data Mining > Big Data (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.97)

Add feedback

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Pareto_frontier_for_model_selection_in_Contextual_bandits_full

Neural Information Processing SystemsFeb-10-2026, 01:33:52 GMT

algorithm, bandit, bandit problem, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Data Science > Data Mining > Big Data (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.65)

Add feedback

768e78024aa8fdb9b8fe87be86f64745-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-9-2026, 00:07:58 GMT

artificial intelligence, machine learning, predictor, (18 more...)

Neural Information Processing Systems

Genre: Research Report (0.31)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.31)

Add feedback

2172fde49301047270b2897085e4319d-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-7-2026, 19:03:52 GMT

finite size effect, threshold state, transition, (10 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.32)

Add feedback

Performance Bounds for Policy-Based Average Reward Reinforcement Learning Algorithms

Neural Information Processing SystemsFeb-5-2026, 16:00:52 GMT

Many policy-based reinforcement learning (RL) algorithms can be viewed as instantiations of approximate policy iteration (PI), i.e., where policy improvement and policy evaluation are both performed approximately. In applications where the average reward objective is the meaningful performance metric, often discounted reward formulations are used with the discount factor being close to $1,$ which is equivalent to making the expected horizon very large.

artificial intelligence, average reward reinforcement learning algorithm, machine learning, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Collaborating Authors

open problem

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Hardness of Online Sleeping Combinatorial Optimization Problems

91a5742235f70ae846436d9780e9f1d4-Paper-Conference.pdf

On the Optimality of Perturbations in Stochastic and Adversarial Multi-armed Bandit Problems

635440afdfc39fe37995fed127d7df4f-AuthorFeedback.pdf

60ad83801910ec976590f69f638e0d6d-AuthorFeedback.pdf

e1fe6165cad3f7f3f57d409f78e4415f-Paper.pdf

Pareto_frontier_for_model_selection_in_Contextual_bandits_full

768e78024aa8fdb9b8fe87be86f64745-AuthorFeedback.pdf

2172fde49301047270b2897085e4319d-AuthorFeedback.pdf

Performance Bounds for Policy-Based Average Reward Reinforcement Learning Algorithms