Hyperparameter Optimization in Machine Learning
Franceschi, Luca, Donini, Michele, Perrone, Valerio, Klein, Aaron, Archambeau, Cédric, Seeger, Matthias, Pontil, Massimiliano, Frasconi, Paolo
Hyperparameters are configuration variables controlling the behavior of machine learning algorithms. They are ubiquitous in machine learning and artificial intelligence and the choice of their values determine the effectiveness of systems based on these technologies. Manual hyperparameter search is often unsatisfactory and becomes unfeasible when the number of hyperparameters is large. Automating the search is an important step towards automating machine learning, freeing researchers and practitioners alike from the burden of finding a good set of hyperparameters by trial and error. In this survey, we present a unified treatment of hyperparameter optimization, providing the reader with examples and insights into the state-of-the-art. We cover the main families of techniques to automate hyperparameter search, often referred to as hyperparameter optimization or tuning, including random and quasi-random search, bandit-, model- and gradient- based approaches. We further discuss extensions, including online, constrained, and multi-objective formulations, touch upon connections with other fields such as meta-learning and neural architecture search, and conclude with open questions and future research directions.
Oct-30-2024
- Country:
- Oceania > Australia
- New South Wales > Sydney (0.04)
- North America
- United States
- Ohio > Franklin County
- Columbus (0.04)
- New York > New York County
- New York City (0.04)
- Nevada > Clark County
- Las Vegas (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Colorado > Denver County
- Denver (0.04)
- California
- San Francisco County > San Francisco (0.14)
- Los Angeles County > Long Beach (0.04)
- Ohio > Franklin County
- Canada
- Quebec > Montreal (0.04)
- Nova Scotia > Halifax Regional Municipality
- Halifax (0.04)
- United States
- Europe
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Netherlands > South Holland
- Dordrecht (0.04)
- Germany > Saxony
- Leipzig (0.04)
- France > Hauts-de-France
- United Kingdom > England
- Asia > Russia
- Oceania > Australia
- Genre:
- Research Report (1.00)
- Overview (1.00)
- Instructional Material > Course Syllabus & Notes (0.45)
- Industry:
- Information Technology (0.67)
- Education (0.67)
- Leisure & Entertainment (0.46)
- Technology:
- Information Technology > Artificial Intelligence
- Natural Language > Large Language Model (1.00)
- Representation & Reasoning
- Search (1.00)
- Optimization (1.00)
- Mathematical & Statistical Methods (0.92)
- Uncertainty > Bayesian Inference (0.67)
- Machine Learning
- Statistical Learning (1.00)
- Neural Networks > Deep Learning (1.00)
- Evolutionary Systems (1.00)
- Learning Graphical Models > Directed Networks
- Bayesian Learning (0.67)
- Information Technology > Artificial Intelligence