Goto

Collaborating Authors

 Folch, Jose Pablo


BARK: A Fully Bayesian Tree Kernel for Black-box Optimization

arXiv.org Machine Learning

We perform Bayesian optimization using a Gaussian process perspective on Bayesian Additive Regression Trees (BART). Our BART Kernel (BARK) uses tree agreement to define a posterior over piecewise-constant functions, and we explore the space of tree kernels using a Markov chain Monte Carlo approach. Where BART only samples functions, the resulting BARK model obtains samples of Gaussian processes defining distributions over functions, which allow us to build acquisition functions for Bayesian optimization. Our tree-based approach enables global optimization over the surrogate, even for mixed-feature spaces. Moreover, where many previous tree-based kernels provide uncertainty quantification over function values, our sampling scheme captures uncertainty over the tree structure itself. Our experiments show the strong performance of BARK on both synthetic and applied benchmarks, due to the combination of our fully Bayesian surrogate and the optimization procedure.


Transition Constrained Bayesian Optimization via Markov Decision Processes

arXiv.org Artificial Intelligence

Bayesian optimization is a methodology to optimize black-box functions. Traditionally, it focuses on the setting where you can arbitrarily query the search space. However, many real-life problems do not offer this flexibility; in particular, the search space of the next query may depend on previous ones. Example challenges arise in the physical sciences in the form of local movement constraints, required monotonicity in certain variables, and transitions influencing the accuracy of measurements. Altogether, such transition constraints necessitate a form of planning. This work extends Bayesian optimization via the framework of Markov Decision Processes, iteratively solving a tractable linearization of our objective using reinforcement learning to obtain a policy that plans ahead over long horizons. The resulting policy is potentially history-dependent and non-Markovian. We showcase applications in chemical reactor optimization, informative path planning, machine calibration, and other synthetic examples.


Practical Path-based Bayesian Optimization

arXiv.org Artificial Intelligence

There has been a surge in interest in data-driven experimental design with applications to chemical engineering and drug manufacturing. Bayesian optimization (BO) has proven to be adaptable to such cases, since we can model the reactions of interest as expensive black-box functions. Sometimes, the cost of this black-box functions can be separated into two parts: (a) the cost of the experiment itself, and (b) the cost of changing the input parameters. In this short paper, we extend the SnAKe algorithm to deal with both types of costs simultaneously. We further propose extensions to the case of a maximum allowable input change, as well as to the multi-objective setting.


Gaussian Processes for Monitoring Air-Quality in Kampala

arXiv.org Machine Learning

Monitoring air pollution is of vital importance to the overall health of the population. Unfortunately, devices that can measure air quality can be expensive, and many cities in low and middle-income countries have to rely on a sparse allocation of them. In this paper, we investigate the use of Gaussian Processes for both nowcasting the current air-pollution in places where there are no sensors and forecasting the air-pollution in the future at the sensor locations. In particular, we focus on the city of Kampala in Uganda, using data from AirQo's network of sensors. We demonstrate the advantage of removing outliers, compare different kernel functions and additional inputs. We also compare two sparse approximations to allow for the large amounts of temporal data in the dataset.


Combining Multi-Fidelity Modelling and Asynchronous Batch Bayesian Optimization

arXiv.org Artificial Intelligence

The optimal design of many engineering processes can be subject to expensive and time-consuming experimentation. For efficiency, we seek to avoid wasting valuable resources in testing sub-optimal designs. One way to achieve this is by obtaining cheaper approximations of the desired system, which allow us to quickly explore new regimes and avoid areas that are clearly sub-optimal. As an example, consider the case diagrammed in Figure 1 from battery materials research with the goal of designing electrode materials for optimal performance in pouch cells. We can use experiments with cheaper coin cells and shorter test procedures to approximate the behaviour of the material in longer stability tests in pouch cells, which is in turn closer to the expected performance in electric car applications [Chen et al., 2019, Dörfler et al., 2020, Liu et al., 2021]. Similarly, design goals regarding battery life such as discharge capacity retention can be approximated using an early prediction model on the first few charge cycles rather than running aging and stability tests to completion [Attia et al., 2020].