regt
Country:
- North America > United States (0.04)
- North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
- Europe > Sweden > Stockholm > Stockholm (0.04)
- Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)
ABest-of-Both-WorldsAlgorithmforBanditswith DelayedFeedback
We present a modified tuning of the algorithm of Zimmert and Seldin [2020] for adversarial multiarmed bandits with delayed feedback, which in addition to the minimax optimal adversarial regret guarantee shown by Zimmert and Seldin simultaneously achieves a near-optimal regret guarantee in the stochastic setting with fixed delays.
Technology:
- Information Technology > Artificial Intelligence (0.67)
- Information Technology > Data Science (0.46)