Copeland Dueling Bandits

Masrour Zoghi, Zohar S. Karnin, Shimon Whiteson, Maarten de Rijke

Oct-2-2025, 11:26:35 GMT–Neural Information Processing Systems

A version of the dueling bandit problem is addressed in which a Condorcet winner may not exist. Two algorithms are proposed that instead seek to minimize regret with respect to the Copeland winner, which, unlike the Condorcet winner, is guaranteed to exist. The first, Copeland Confidence Bound (CCB), is designed for small numbers of arms, while the second, Scalable Copeland Bandits (SCB), works better for large-scale problems. We provide theoretical results bounding the regret accumulated by CCB and SCB, both substantially improving existing results.

data mining, information retrieval, machine learning, (20 more...)

Neural Information Processing Systems

Oct-2-2025, 11:26:35 GMT

Conferences PDF

Add feedback

Country:
- Europe > United Kingdom > England (0.28)

Genre:
- Research Report (0.68)

Technology:
- Information Technology
  - Data Science > Data Mining
    - Big Data (0.54)
  - Artificial Intelligence
    - Machine Learning (0.70)
    - Natural Language > Information Retrieval (0.47)
    - Representation & Reasoning (0.46)

Duplicate Docs Excel Report

Title
Copeland Dueling Bandits Zohar Karnin Informatics Institute
Copeland Dueling Bandits

Similar Docs Excel Report more

Title	Similarity	Source
None found