Gaussian Process Bandit Optimization of the Thermodynamic Variational Objective