Federated Linear Bandits with Finite Adversarial Actions

Jan-19-2025, 21:35:51 GMT–Neural Information Processing Systems

We study a federated linear bandits model, where M clients communicate with a central server to solve a linear contextual bandits problem with finite adversarial action sets that may be different across clients. To address the unique challenges of **adversarial finite** action sets, we propose the FedSupLinUCB algorithm, which extends the principles of SupLinUCB and OFUL algorithms in linear contextual bandits. We prove that FedSupLinUCB achieves a total regret of \tilde{O}(\sqrt{d T}), where T is the total number of arm pulls from all clients, and d is the ambient dimension of the linear model. This matches the minimax lower bound and thus is order-optimal (up to polylog terms). We study both asynchronous and synchronous cases and show that the communication cost can be controlled as O(d M 2 \log(d)\log(T)) and O(\sqrt{d 3 M 3} \log(d)), respectively.

artificial intelligence, finite adversarial action, machine learning, (5 more...)

Neural Information Processing Systems

Jan-19-2025, 21:35:51 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.41)