Towards Automated Deep Learning: Efficient Joint Neural Architecture and Hyperparameter Search

Zela, Arber, Klein, Aaron, Falkner, Stefan, Hutter, Frank

Jul-18-2018–arXiv.org Artificial Intelligence

While existing work on neural architecture search (NAS) tunes hyperparameters in a separate post-processing step, we demonstrate that architectural choices and other hyperparameter settings interact in a way that can render this separation suboptimal. Likewise, we demonstrate that the common practice of using very few epochs during the main NAS and much larger numbers of epochs during a post-processing step is inefficient due to little correlation in the relative rankings for these two training regimes. To combat both of these problems, we propose to use a recent combination of Bayesian optimization and Hyperband for efficient joint neural architecture and hyperparameter search.

budget, deep learning, neural network, (20 more...)

arXiv.org Artificial Intelligence

Jul-18-2018

arXiv.org PDF

Add feedback

Country:
- Europe > Germany > Baden-Württemberg (0.29)

Genre:
- Research Report (0.64)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found