Bandit optimisation of functions in the Mat\'ern kernel RKHS
Janz, David, Burt, David R., González, Javier
We consider the problem of optimising functions in the Reproducing kernel Hilbert space (RKHS) of a Mat\'ern family kernel with parameter $\nu$ over the domain $[0,1]^d$ under noisy bandit feedback. Our contribution, the $\pi$-GP-UCB algorithm, is the first practical approach with guaranteed sublinear regret for all $\nu>1$ and $d \geq 1$. Empirical validation suggests better performance and drastically improved computational scalablity compared with its predecessor, Improved GP-UCB.
Jan-28-2020
- Country:
- Europe
- United Kingdom > England
- Cambridgeshire > Cambridge (0.14)
- Italy > Sicily
- Palermo (0.04)
- United Kingdom > England
- Europe
- Genre:
- Research Report (0.82)
- Technology: