Fast Asymptotically Optimal Algorithms for Non-Parametric Stochastic Bandits

Feb-9-2026, 02:49:37 GMT–Neural Information Processing Systems

We consider the problem of regret minimization in non-parametric stochastic bandits. When the rewards are known to be bounded from above, there exists asymptotically optimal algorithms, with asymptotic regret depending on an infi-mum of Kullback-Leibler divergences (KL).

artificial intelligence, data mining, machine learning, (21 more...)

Neural Information Processing Systems

Feb-9-2026, 02:49:37 GMT

Conferences PDF

Add feedback

Country:
- North America > United States
  - Colorado > Boulder County > Boulder (0.04)
- Europe
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
  - France > Hauts-de-France
    - Nord > Lille (0.04)

Genre:
- Research Report > New Finding (0.67)

Industry:
- Food & Agriculture > Agriculture (1.00)

Technology:
- Information Technology
  - Data Science > Data Mining
    - Big Data (0.48)
  - Artificial Intelligence
    - Machine Learning > Statistical Learning (0.60)
    - Representation & Reasoning > Optimization (0.46)

Duplicate Docs Excel Report

Title
Fast Asymptotically Optimal Algorithms for Non-Parametric Stochastic Bandits

Similar Docs Excel Report more

Title	Similarity	Source
None found