Reviews: Weighted Linear Bandits for Non-Stationary Environments
–Neural Information Processing Systems
Update (after reading the rebuttals): After reading the rebuttal of authors, I have addressed my concerns on the novelty of the new self-normalized concentration, since the key point is that the coefficient of regularizer is changing. I indeed appreciate this work. The idea of this paper is natural but there indeed exist technical challenges, and the authors address these issues elegantly. So I think it deserves an acceptance. Nevertheless, there are still many typos in current verison besides those listed before, for example, in Theorem 2, eq.
Neural Information Processing Systems
Jan-22-2025, 09:44:41 GMT
- Technology: