Federated Linear Bandits with Finite Adversarial Actions
–Neural Information Processing Systems
We study a federated linear bandits model, where M clients communicate with a central server to solve a linear contextual bandits problem with finite adversarial action sets that may be different across clients. To address the unique challenges of **adversarial finite** action sets, we propose the FedSupLinUCB algorithm, which extends the principles of SupLinUCB and OFUL algorithms in linear contextual bandits. We prove that FedSupLinUCB achieves a total regret of \tilde{O}(\sqrt{d T}), where T is the total number of arm pulls from all clients, and d is the ambient dimension of the linear model. This matches the minimax lower bound and thus is order-optimal (up to polylog terms). We study both asynchronous and synchronous cases and show that the communication cost can be controlled as O(d M 2 \log(d)\log(T)) and O(\sqrt{d 3 M 3} \log(d)), respectively.
Neural Information Processing Systems
Jan-19-2025, 21:35:51 GMT
- Technology: