Walz, David
Practical Path-based Bayesian Optimization
Folch, Jose Pablo, Odgers, James, Zhang, Shiqiang, Lee, Robert M, Shafei, Behrang, Walz, David, Tsay, Calvin, van der Wilk, Mark, Misener, Ruth
There has been a surge in interest in data-driven experimental design with applications to chemical engineering and drug manufacturing. Bayesian optimization (BO) has proven to be adaptable to such cases, since we can model the reactions of interest as expensive black-box functions. Sometimes, the cost of this black-box functions can be separated into two parts: (a) the cost of the experiment itself, and (b) the cost of changing the input parameters. In this short paper, we extend the SnAKe algorithm to deal with both types of costs simultaneously. We further propose extensions to the case of a maximum allowable input change, as well as to the multi-objective setting.
Combining Multi-Fidelity Modelling and Asynchronous Batch Bayesian Optimization
Folch, Jose Pablo, Lee, Robert M, Shafei, Behrang, Walz, David, Tsay, Calvin, van der Wilk, Mark, Misener, Ruth
The optimal design of many engineering processes can be subject to expensive and time-consuming experimentation. For efficiency, we seek to avoid wasting valuable resources in testing sub-optimal designs. One way to achieve this is by obtaining cheaper approximations of the desired system, which allow us to quickly explore new regimes and avoid areas that are clearly sub-optimal. As an example, consider the case diagrammed in Figure 1 from battery materials research with the goal of designing electrode materials for optimal performance in pouch cells. We can use experiments with cheaper coin cells and shorter test procedures to approximate the behaviour of the material in longer stability tests in pouch cells, which is in turn closer to the expected performance in electric car applications [Chen et al., 2019, Dörfler et al., 2020, Liu et al., 2021]. Similarly, design goals regarding battery life such as discharge capacity retention can be approximated using an early prediction model on the first few charge cycles rather than running aging and stability tests to completion [Attia et al., 2020].
Tree ensemble kernels for Bayesian optimization with known constraints over mixed-feature spaces
Thebelt, Alexander, Tsay, Calvin, Lee, Robert M., Sudermann-Merx, Nathan, Walz, David, Shafei, Behrang, Misener, Ruth
Tree ensembles can be well-suited for black-box optimization tasks such as algorithm tuning and neural architecture search, as they achieve good predictive performance with little or no manual tuning, naturally handle discrete feature spaces, and are relatively insensitive to outliers in the training data. Two well-known challenges in using tree ensembles for black-box optimization are (i) effectively quantifying model uncertainty for exploration and (ii) optimizing over the piece-wise constant acquisition function. To address both points simultaneously, we propose using the kernel interpretation of tree ensembles as a Gaussian Process prior to obtain model variance estimates, and we develop a compatible optimization formulation for the acquisition function. The latter further allows us to seamlessly integrate known constraints to improve sampling efficiency by considering domain-knowledge in engineering settings and modeling search space symmetries, e.g., hierarchical relationships in neural architecture search. Our framework performs as well as state-of-the-art methods for unconstrained black-box optimization over continuous/discrete features and outperforms competing methods for problems combining mixed-variable feature spaces and known input constraints.
Multi-Objective Constrained Optimization for Energy Applications via Tree Ensembles
Thebelt, Alexander, Tsay, Calvin, Lee, Robert M., Sudermann-Merx, Nathan, Walz, David, Tranter, Tom, Misener, Ruth
Energy systems optimization problems are complex due to strongly non-linear system behavior and multiple competing objectives, e.g. economic gain vs. environmental impact. Moreover, a large number of input variables and different variable types, e.g. continuous and categorical, are challenges commonly present in real-world applications. In some cases, proposed optimal solutions need to obey explicit input constraints related to physical properties or safety-critical operating conditions. This paper proposes a novel data-driven strategy using tree ensembles for constrained multi-objective optimization of black-box problems with heterogeneous variable spaces for which underlying system dynamics are either too complex to model or unknown. In an extensive case study comprised of synthetic benchmarks and relevant energy applications we demonstrate the competitive performance and sampling efficiency of the proposed algorithm compared to other state-of-the-art tools, making it a useful all-in-one solution for real-world applications with limited evaluation budgets.