Pre-training helps Bayesian optimization too