Efficient Prior Selection in Gaussian Process Bandits with Thompson Sampling

Sandberg, Jack, Chehreghani, Morteza Haghir

Feb-3-2025–arXiv.org Machine Learning

Gaussian process (GP) bandits provide a powerful framework for solving blackbox optimization of unknown functions. The characteristics of the unknown function depends heavily on the assumed GP prior. Most work in the literature assume that this prior is known but in practice this seldom holds. Instead, practitioners often rely on maximum likelihood estimation to select the hyperparameters of the prior - which lacks theoretical guarantees. In this work, we propose two algorithms for joint prior selection and regret minimization in GP bandits based on GP Thompson sampling (GP-TS): Prior-Elimination GP-TS (PE-GP-TS) and HyperPrior GP-TS (HP-GP-TS). We theoretically analyze the algorithms and establish upper bounds for their respective regret. In addition, we demonstrate the effectiveness of our algorithms compared to the alternatives through experiments with synthetic and real-world data.

data mining, experiment, machine learning, (19 more...)

arXiv.org Machine Learning

Feb-3-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - California (0.14)
  - New York > New York County
    - New York City (0.04)
  - Massachusetts > Middlesex County
    - Cambridge (0.14)
- Europe > Sweden
  - Vaestra Goetaland > Gothenburg (0.04)
- Asia > Russia
  - Siberian Federal District > Novosibirsk Oblast > Novosibirsk (0.04)

Genre:
- Research Report (0.64)

Industry:
- Transportation (0.46)
- Government (0.46)

Technology:
- Information Technology
  - Data Science > Data Mining
    - Big Data (0.46)
  - Artificial Intelligence
    - Representation & Reasoning > Uncertainty
      - Bayesian Inference (0.68)
    - Machine Learning > Learning Graphical Models
      - Directed Networks > Bayesian Learning (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found