A Further Related Work

Aug-17-2025, 10:40:52 GMT–Neural Information Processing Systems

The "dueling bandits" problem, initially proposed as a model for similar recommendation systems A number of works in recent years explore online problems where an agent responds to the decision-maker's actions, influencing their reward. The "revealed preferences" literature involves a similar requirement of learning a mapping Some recent work has begun to explore the problem of designing optimal strategies in a repeated game against agents who adapt their strategies over time using a no-regret algorithm. As such, the empirical probability of b must be close to 1 /2. We make use of a lemma from [2], which we restate here. Lemma 8. Consider two vectors We prove local learnability results for each case.

probability, query, vector, (17 more...)

Neural Information Processing Systems

Aug-17-2025, 10:40:52 GMT

Conferences PDF

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (0.93)
  - Representation & Reasoning > Personal Assistant Systems (0.48)

Duplicate Docs Excel Report

Title
a75db7d2ee1e4bee8fb819979b0a6cad-Supplemental-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found