AITopics | Pas-de-Calais

Collaborating Authors

Pas-de-Calais

Best of both worlds: Stochastic & adversarial best-arm identification

Abbasi-Yadkori, Yasin, Bartlett, Peter L., Gabillon, Victor, Malek, Alan, Valko, Michal

arXiv.org Machine LearningApr-17-2026

We study bandit best-arm identification with arbitrary and potentially adversarial rewards. A simple random uniform learner obtains the optimal rate of error in the adversarial scenario. However, this type of strategy is suboptimal when the rewards are sampled stochastically. Therefore, we ask: Can we design a learner that performs optimally in both the stochastic and adversarial problems while not being aware of the nature of the rewards? First, we show that designing such a learner is impossible in general. In particular, to be robust to adversarial rewards, we can only guarantee optimal rates of error on a subset of the stochastic problems. We give a lower bound that characterizes the optimal rate in stochastic problems if the strategy is constrained to be robust to adversarial rewards. Finally, we design a simple parameter-free algorithm and show that its probability of error matches (up to log factors) the lower bound in stochastic problems, and it is also robust to adversarial ones.

bestofbothworld, data mining, machine learning, (19 more...)

arXiv.org Machine Learning

2604.1486

Country:

Oceania > Australia (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > Germany > Saxony-Anhalt > Magdeburg (0.04)
Europe > France > Hauts-de-France > Pas-de-Calais (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Data Science > Data Mining (0.68)
Information Technology > Artificial Intelligence > Machine Learning (0.46)

Covariance-adapting algorithm for semi-bandits with application to sparse rewards

Perrault, Pierre, Perchet, Vianney, Valko, Michal

arXiv.org Machine LearningApr-16-2026

We investigate stochastic combinatorial semi-bandits, where the entire joint distribution of outcomes impacts the complexity of the problem instance (unlike in the standard bandits). Typical distributions considered depend on specific parameter values, whose prior knowledge is required in theory but quite difficult to estimate in practice; an example is the commonly assumed sub-Gaussian family. We alleviate this issue by instead considering a new general family of sub-exponential distributions, which contains bounded and Gaussian ones. We prove a new lower bound on the expected regret on this family, that is parameterized by the unknown covariance matrix of outcomes, a tighter quantity than the sub-Gaussian matrix. We then construct an algorithm that uses covariance estimates, and provide a tight asymptotic analysis of the regret. Finally, we apply and extend our results to the family of sparse outcomes, which has applications in many recommender systems.

artificial intelligence, machine learning, urlhttp, (17 more...)

arXiv.org Machine Learning

2604.13738

Country:

Europe > Spain > Canary Islands (0.04)
North America > United States > California (0.04)
Europe > Iceland > Capital Region > Reykjavik (0.04)
Europe > France > Hauts-de-France > Pas-de-Calais (0.04)

Genre: Research Report (0.71)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Optimistic optimization of a Brownian

Jean-Bastien Grill, Michal Valko, Remi Munos

Neural Information Processing SystemsFeb-14-2026, 04:59:00 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, brownian motion, sample complexity, (15 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.04)
Europe > Germany > Saxony-Anhalt > Magdeburg (0.04)
Europe > France > Île-de-France > Paris > Paris (0.04)
Europe > France > Hauts-de-France > Pas-de-Calais (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.46)
Information Technology > Artificial Intelligence > Machine Learning (0.46)

Regret Bounds for Learning State Representations in Reinforcement Learning

Ronald Ortner, Matteo Pirotta, Alessandro Lazaric, Ronan Fruit, Odalric-Ambrym Maillard

Neural Information Processing SystemsFeb-13-2026, 03:37:14 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, markov model, representation, (16 more...)

Neural Information Processing Systems

Country:

North America > United States (0.04)
North America > Canada (0.04)
Europe > France > Hauts-de-France > Pas-de-Calais (0.04)
Europe > Austria > Styria > Leoben (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.66)

Learning Multiple Markov Chains via Adaptive Allocation

Mohammad Sadegh Talebi, Odalric-Ambrym Maillard

Neural Information Processing SystemsFeb-12-2026, 19:15:32 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, ba-mc, markov chain, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Rhode Island > Providence County > Providence (0.04)
North America > Canada (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(3 more...)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.71)

Planning in entropy-regularized Markov decision processes and games

Jean-Bastien Grill, Omar Darwiche Domingues, Pierre Menard, Remi Munos, Michal Valko

Neural Information Processing SystemsFeb-12-2026, 03:52:48 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, sample complexity, value function, (14 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.04)
Europe > Germany > Saxony-Anhalt > Magdeburg (0.04)
Europe > France > Hauts-de-France > Pas-de-Calais (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.42)

4fe5149039b52765bde64beb9f674940-Paper.pdf

Neural Information Processing SystemsFeb-12-2026, 03:41:50 GMT

algorithm, learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > France > Hauts-de-France > Pas-de-Calais (0.04)
Europe > France > Hauts-de-France > Nord > Lille (0.04)

Industry: Automobiles & Trucks (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.47)

3a4496776767aaa99f9804d0905fe584-Supplemental.pdf

Neural Information Processing SystemsFeb-8-2026, 03:24:44 GMT

clip ct, gaussian, log 2, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Georgia > Fulton County > Atlanta (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
North America > Canada (0.04)
(2 more...)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Communications (0.67)
Information Technology > Data Science > Data Mining > Big Data (0.48)

3a4496776767aaa99f9804d0905fe584-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 03:17:45 GMT

clip ct, gaussian, log 2, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
(3 more...)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science (0.69)
Information Technology > Communications (0.68)

211b39255232ab59ce78f2e28cd0292b-Supplemental.pdf

Neural Information Processing SystemsFeb-7-2026, 18:54:09 GMT

objective, sequence, vehicle, (16 more...)

Neural Information Processing Systems

Country:

Europe > France > Hauts-de-France > Nord > Lille (0.04)
North America > Canada (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)
(4 more...)

Industry:

Energy (0.47)
Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.93)