Simplifying Bayesian Optimization Via In-Context Direct Optimum Sampling

de Carvalho, Gustavo Sutter Pessurno, Abdulrahman, Mohammed, Wang, Hao, Subramanian, Sriram Ganapathi, St-Aubin, Marc, O'Sullivan, Sharon, Wan, Lawrence, Ricardez-Sandoval, Luis, Poupart, Pascal, Kristiadi, Agustinus

Jun-2-2025–arXiv.org Machine Learning

The optimization of expensive black-box functions is ubiquitous in science and engineering. A common solution to this problem is Bayesian optimization (BO), which is generally comprised of two components: (i) a surrogate model and (ii) an acquisition function, which generally require expensive re-training and optimization steps at each iteration, respectively. Although recent work enabled in-context surrogate models that do not require re-training, virtually all existing BO methods still require acquisition function maximization to select the next observation, which introduces many knobs to tune, such as Monte Carlo samplers and multi-start optimizers. In this work, we propose a completely in-context, zero-shot solution for BO that does not require surrogate fitting or acquisition function optimization. This is done by using a pre-trained deep generative model to directly sample from the posterior over the optimum point. We show that this process is equivalent to Thompson sampling and demonstrate the capabilities and cost-effectiveness of our foundation model on a suite of real-world benchmarks. We achieve an efficiency gain of more than 35x in terms of wall-clock time when compared with Gaussian process-based BO, enabling efficient parallel and distributed BO, e.g., for high-throughput optimization.

machine learning, natural language, optimization, (18 more...)

arXiv.org Machine Learning

Jun-2-2025

arXiv.org PDF

Add feedback

Country:
- Europe
  - France (0.04)
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
- Asia > Russia
  - Siberian Federal District > Novosibirsk Oblast > Novosibirsk (0.04)

Genre:
- Research Report (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Representation & Reasoning
    - Optimization (1.00)
    - Uncertainty (0.94)
  - Machine Learning > Neural Networks
    - Deep Learning (0.49)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found