Parallel Contextual Bandits in Wireless Handover Optimization

Colin, Igor, Thomas, Albert, Draief, Moez

Jan-21-2019–arXiv.org Machine Learning

Abstract--As cellular networks become denser, a scalable and dynamic tuning of wireless base station parameters can only be achieved through automated optimization. Although the contextual banditframework arises as a natural candidate for such a task, its extension to a parallel setting is not straightforward: one needs to carefully adapt existing methods to fully leverage the multi-agent structure of this problem. We propose two approaches: one derived from a deterministic UCB-like method and the other relying on Thompson sampling. Thanks to its bayesian nature, the latter is intuited to better preserve the exploration-exploitation balance in the bandit batch. This is verified on toy experiments, where Thompson sampling shows robustness to the variability of the contexts. Finally, we apply both methods on a real base station network dataset and evidence that Thompson sampling outperforms both manual tuning and contextual UCB. I. INTRODUCTION The land area covered by a cellular wireless network, such as a mobile phone network, is divided into small areas called cells, each cell being covered by the antenna of a fixed base station (see Figure 1).

bandit, télécommunications, upstream oil & gas, (20 more...)

arXiv.org Machine Learning

Jan-21-2019

arXiv.org PDF

Add feedback

Genre:
- Research Report (0.64)

Industry:
- Energy > Oil & Gas
  - Upstream (0.34)
- Telecommunications (1.00)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning > Statistical Learning (0.68)
    - Representation & Reasoning (0.88)
  - Communications > Networks (0.66)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found