On Information Gain and Regret Bounds in Gaussian Process Bandits

Vakili, Sattar, Khezeli, Kia, Picheny, Victor

Oct-9-2020–arXiv.org Machine Learning

Consider the sequential optimization of an expensive to evaluate and possibly non-convex objective function $f$ from noisy feedback, that can be considered as a continuum-armed bandit problem. Upper bounds on the regret performance of several learning algorithms (GP-UCB, GP-TS, and their variants) are known under both a Bayesian (when $f$ is a sample from a Gaussian process (GP)) and a frequentist (when $f$ lives in a reproducing kernel Hilbert space) setting. The regret bounds often rely on the maximal information gain $\gamma_T$ between $T$ observations and the underlying GP (surrogate) model. We provide general bounds on $\gamma_T$ based on the decay rate of the eigenvalues of the GP kernel, whose specialisation for commonly used kernels, improves the existing bounds on $\gamma_T$, and consequently the regret bounds relying on $\gamma_T$ under numerous settings. For the Mat\'ern family of kernels, where the lower bounds on $\gamma_T$, and regret under the frequentist setting, are known, our results close a huge polynomial in $T$ gap between the upper and lower bounds (up to logarithmic in $T$ factors).

artificial intelligence, data mining, machine learning, (19 more...)

arXiv.org Machine Learning

Oct-9-2020

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Virginia > Arlington County
    - Arlington (0.04)
  - Minnesota > Olmsted County
    - Rochester (0.04)
  - California > Los Angeles County
    - Long Beach (0.04)
- Europe
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
    - Oxfordshire > Oxford (0.04)
  - Sweden > Stockholm
    - Stockholm (0.04)
  - Netherlands > North Holland
    - Amsterdam (0.04)
- Asia > Japan
  - Honshū > Kantō > Kanagawa Prefecture (0.04)

Genre:
- Research Report (0.84)

Industry:
- Health & Medicine (0.36)

Technology:
- Information Technology
  - Artificial Intelligence > Machine Learning (1.00)
  - Data Science > Data Mining
    - Big Data (0.34)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found