Towards Automated Deep Learning: Efficient Joint Neural Architecture and Hyperparameter Search
Zela, Arber, Klein, Aaron, Falkner, Stefan, Hutter, Frank
–arXiv.org Artificial Intelligence
While existing work on neural architecture search (NAS) tunes hyperparameters in a separate post-processing step, we demonstrate that architectural choices and other hyperparameter settings interact in a way that can render this separation suboptimal. Likewise, we demonstrate that the common practice of using very few epochs during the main NAS and much larger numbers of epochs during a post-processing step is inefficient due to little correlation in the relative rankings for these two training regimes. To combat both of these problems, we propose to use a recent combination of Bayesian optimization and Hyperband for efficient joint neural architecture and hyperparameter search.
arXiv.org Artificial Intelligence
Jul-18-2018
- Country:
- Oceania > Australia
- New South Wales > Sydney (0.04)
- North America > Canada
- Europe > Germany
- Baden-Württemberg
- Freiburg (0.05)
- Tübingen Region > Tübingen (0.04)
- Baden-Württemberg
- Oceania > Australia
- Genre:
- Research Report (0.64)
- Technology: