A Smoothed Analysis of the Greedy Algorithm for the Linear Contextual Bandit Problem
Sampath Kannan, Jamie H. Morgenstern, Aaron Roth, Bo Waggoner, Zhiwei Steven Wu
–Neural Information Processing Systems
We give a smoothed analysis, showing that even when contexts may be chosen by an adversary, small perturbations of the adversary's choices suffice for the algorithm to achieve "no regret", perhaps (depending on the specifics of the setting) with a constant amount of initial training data.
Neural Information Processing Systems
Nov-20-2025, 23:23:49 GMT
- Country:
- North America
- Canada > Quebec
- Montreal (0.04)
- United States
- Minnesota (0.04)
- Pennsylvania (0.04)
- Canada > Quebec
- North America
- Industry:
- Technology: