Gaussian Process Upper Confidence Bound Achieves Nearly-Optimal Regret in Noise-Free Gaussian Process Bandits

Jun-17-2026, 16:19:28 GMT–Neural Information Processing Systems

We study the noise-free Gaussian Process (GP) bandit problem, in which a learner seeks to minimize regret through noise-free observations of a black-box objective function that lies in a known reproducing kernel Hilbert space (RKHS). The Gaussian Process Upper Confidence Bound (GP-UCB) algorithm is a well-known approach for GP bandits, where query points are adaptively selected based on the GP-based upper confidence bound score. While several existing works have reported the practical success of GP-UCB, its theoretical performance remains suboptimal. However, GP-UCB often empirically outperforms other nearly-optimal noise-free algorithms that use non-adaptive sampling schemes. This paper resolves the gap between theoretical and empirical performance by establishing a nearly-optimal regret upper bound for noise-free GP-UCB. Specifically, our analysis provides the first constant cumulative regret bounds in the noise-free setting for both the squared exponential kernel and the Mat ern kernel with some degree of smoothness.

artificial intelligence, data mining, machine learning, (20 more...)

Neural Information Processing Systems

Jun-17-2026, 16:19:28 GMT

Conferences PDF

Add feedback

Country:
- Asia > Japan > Honshū > Kantō (0.28)

Genre:
- Research Report > Experimental Study (1.00)

Technology:
- Information Technology
  - Modeling & Simulation (1.00)
  - Data Science > Data Mining
    - Big Data (0.48)
  - Artificial Intelligence
    - Machine Learning (1.00)
    - Representation & Reasoning > Optimization (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found