One-Shot Bayes Opt with Probabilistic Population Based Training
Parker-Holder, Jack, Nguyen, Vu, Roberts, Stephen
Selecting optimal hyperparameters is a key challenge in machine learning. An exciting recent result showed it is possible to learn high-performing hyperparameter schedules on the fly in a single training run through methods inspired by Evolutionary Algorithms. These approaches have been shown to increase performance across a wide variety of machine learning tasks, ranging from supervised (SL) to reinforcement learning (RL). However, since they remain primarily evolutionary, they act in a greedy fashion, thus require a combination of vast computational resources and carefully selected meta-parameters to effectively explore the hyperparameter space. To address these shortcomings we look to Bayesian Optimization (BO), where a Gaussian Process surrogate model is combined with an acquisition function to produce a principled mechanism to trade off exploration vs exploitation. Our approach, which we call Probabilistic Population-Based Training ($\mathrm{P2BT}$), is able to transfer sample efficiency of BO to the online setting, making it possible to achieve these traits in a single training run. We show that $\mathrm{P2BT}$ is able to achieve high performance with only a small population size, making it useful for all researchers regardless of their computational resources.
Feb-6-2020
- Country:
- North America
- United States
- Massachusetts
- Plymouth County > Hanover (0.04)
- Middlesex County > Cambridge (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Massachusetts
- Canada > Ontario
- Toronto (0.14)
- United States
- Europe
- United Kingdom > England
- Oxfordshire > Oxford (0.14)
- Sweden > Stockholm
- Stockholm (0.04)
- United Kingdom > England
- Asia > Middle East
- Jordan (0.04)
- North America
- Genre:
- Research Report > New Finding (1.00)
- Technology: