AITopics | hyperband

Collaborating Authors

hyperband

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Selecting Hyperparameters for Tree-Boosting

Koster, Floris Jan, Sigrist, Fabio

arXiv.org Machine LearningFeb-6-2026

Tree-boosting is a widely used machine learning technique for tabular data. However, its out-of-sample accuracy is critically dependent on multiple hyperparameters. In this article, we empirically compare several popular methods for hyperparameter optimization for tree-boosting including random grid search, the tree-structured Parzen estimator (TPE), Gaussian-process-based Bayesian optimization (GP-BO), Hyperband, the sequential model-based algorithm configuration (SMAC) method, and deterministic full grid search using $59$ regression and classification data sets. We find that the SMAC method clearly outperforms all the other considered methods. We further observe that (i) a relatively large number of trials larger than $100$ is required for accurate tuning, (ii) using default values for hyperparameters yields very inaccurate models, (iii) all considered hyperparameters can have a material effect on the accuracy of tree-boosting, i.e., there is no small set of hyperparameters that is more important than others, and (iv) choosing the number of boosting iterations using early stopping yields more accurate results compared to including it in the search space for regression tasks.

artificial intelligence, machine learning, optimization problem, (16 more...)

arXiv.org Machine Learning

2602.05786

Country: Europe > Switzerland > Zürich > Zürich (0.14)

Genre: Research Report > New Finding (0.46)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.89)

Add feedback

POCAII: Parameter Optimization with Conscious Allocation using Iterative Intelligence

Inman, Joshua, Khandait, Tanmay, Sankar, Lalitha, Pedrielli, Giulia

arXiv.org Machine LearningMay-20-2025

In this paper we propose for the first time the hyperparameter optimization (HPO) algorithm POCAII. POCAII differs from the Hyperband and Successive Halving literature by explicitly separating the search and evaluation phases and utilizing principled approaches to exploration and exploitation principles during both phases. Such distinction results in a highly flexible scheme for managing a hyperparameter optimization budget by focusing on search (i.e., generating competing configurations) towards the start of the HPO process while increasing the evaluation effort as the HPO comes to an end. POCAII was compared to state of the art approaches SMAC, BOHB and DEHB. Our algorithm shows superior performance in low-budget hyperparameter optimization regimes. Since many practitioners do not have exhaustive resources to assign to HPO, it has wide applications to real-world problems. Moreover, the empirical evidence showed how POCAII demonstrates higher robustness and lower variance in the results. This is again very important when considering realistic scenarios with extremely expensive models to train.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Machine Learning

2505.11745

Genre: Research Report > Promising Solution (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.92)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.67)
(2 more...)

Add feedback

FlexHB: a More Efficient and Flexible Framework for Hyperparameter Optimization

Zhang, Yang, Wu, Haiyang, Yang, Yuekui

arXiv.org Artificial IntelligenceFeb-21-2024

Given a Hyperparameter Optimization(HPO) problem, how to design an algorithm to find optimal configurations efficiently? Bayesian Optimization(BO) and the multi-fidelity BO methods employ surrogate models to sample configurations based on history evaluations. More recent studies obtain better performance by integrating BO with HyperBand(HB), which accelerates evaluation by early stopping mechanism. However, these methods ignore the advantage of a suitable evaluation scheme over the default HyperBand, and the capability of BO is still constrained by skewed evaluation results. In this paper, we propose FlexHB, a new method pushing multi-fidelity BO to the limit as well as re-designing a framework for early stopping with Successive Halving(SH). Comprehensive study on FlexHB shows that (1) our fine-grained fidelity method considerably enhances the efficiency of searching optimal configurations, (2) our FlexBand framework (self-adaptive allocation of SH brackets, and global ranking of configurations in both current and past SH procedures) grants the algorithm with more flexibility and improves the anytime performance. Our method achieves superior efficiency and outperforms other methods on various HPO tasks. Empirical results demonstrate that FlexHB can achieve up to 6.9X and 11.1X speedups over the state-of-the-art MFES-HB and BOHB respectively.

configuration, fidelity level, hyperband, (14 more...)

arXiv.org Artificial Intelligence

2402.13641

Country:

Asia > China > Beijing > Beijing (0.04)
Europe > Spain > Andalusia > Cádiz Province > Cadiz (0.04)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

Add feedback

Parameter Optimization with Conscious Allocation (POCA)

Inman, Joshua, Khandait, Tanmay, Pedrielli, Giulia, Sankar, Lalitha

arXiv.org Machine LearningDec-28-2023

The performance of modern machine learning algorithms depends upon the selection of a set of hyperparameters. Common examples of hyperparameters are learning rate and the number of layers in a dense neural network. Auto-ML is a branch of optimization that has produced important contributions in this area. Within Auto-ML, hyperband-based approaches, which eliminate poorly-performing configurations after evaluating them at low budgets, are among the most effective. However, the performance of these algorithms strongly depends on how effectively they allocate the computational budget to various hyperparameter configurations. We present the new Parameter Optimization with Conscious Allocation (POCA), a hyperband-based algorithm that adaptively allocates the inputted budget to the hyperparameter configurations it generates following a Bayesian sampling scheme. We compare POCA to its nearest competitor at optimizing the hyperparameters of an artificial toy function and a deep neural network and find that POCA finds strong configurations faster in both settings.

budget, configuration, hyperband, (14 more...)

arXiv.org Machine Learning

2312.17404

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > Arizona > Maricopa County > Tempe (0.04)
North America > United States > Texas > Travis County > Austin (0.04)
(6 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

Impact of HPO on AutoML Forecasting Ensembles

Hoffmann, David

arXiv.org Artificial IntelligenceNov-7-2023

Due to this uncertainty over which models will perform best, it is common place in the forecasting space that domain experts and data scientists have to experiment with several methods before they find one that works acceptably well on a particular problem. This exploration process can be time and resource consuming and is not always practical, due to the plethora of unsolved forecasting problems as well as the scarcity of domain experts and data scientists. In recent years, Automated Machine Learning (AutoML) has become more popular, allowing non-technical users to solve machine learning problems without in depth knowledge about the underlying methodology, filling in for the lack of available data scientists through automation [5]. In forecasting there are several approaches to AutoML, one of them being the established method of using ensemble learning and aggregation of forecasts [6]. This has seen a recent increase in attention, with the top performing models in the M4 Competition [7] being of this nature [8]. Ensembling, can be conceptualised as the automation of the previously manual step of exploring the performance of various algorithms on a given problem and selecting the best one or a combination of models. This, however, does not address another important aspect of data science which is the selection of good hyperparameters, leading to better performance of a model trained using a particular algorithm. The combination of ensemble learning and hyperparameter tuning in an AutoML forecasting setup will be discussed in this paper.

configuration, experiment, latency, (14 more...)

arXiv.org Artificial Intelligence

2311.04034

Country:

North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.05)
Oceania > Australia (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
(4 more...)

Genre: Research Report > Experimental Study (0.46)

Industry:

Energy > Renewable > Solar (0.46)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.45)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Iterative Deepening Hyperband

Brandt, Jasmin, Wever, Marcel, Iliadis, Dimitrios, Bengs, Viktor, Hüllermeier, Eyke

arXiv.org Artificial IntelligenceFeb-6-2023

Hyperparameter optimization (HPO) is concerned with the automated search for the most appropriate hyperparameter configuration (HPC) of a parameterized machine learning algorithm. A state-of-the-art HPO method is Hyperband, which, however, has its own parameters that influence its performance. One of these parameters, the maximal budget, is especially problematic: If chosen too small, the budget needs to be increased in hindsight and, as Hyperband is not incremental by design, the entire algorithm must be re-run. This is not only costly but also comes with a loss of valuable knowledge already accumulated. In this paper, we propose incremental variants of Hyperband that eliminate these drawbacks, and show that these variants satisfy theoretical guarantees qualitatively similar to those for the original Hyperband with the "right" budget. Moreover, we demonstrate their practical utility in experiments with benchmark data sets.

artificial intelligence, configuration, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2302.00511

Country:

Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
Europe > Belgium > Flanders > East Flanders > Ghent (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.93)

Add feedback

AC-Band: A Combinatorial Bandit-Based Approach to Algorithm Configuration

Brandt, Jasmin, Schede, Elias, Bengs, Viktor, Haddenhorst, Björn, Hüllermeier, Eyke, Tierney, Kevin

arXiv.org Artificial IntelligenceDec-1-2022

We study the algorithm configuration (AC) problem, in which one seeks to find an optimal parameter configuration of a given target algorithm in an automated way. Recently, there has been significant progress in designing AC approaches that satisfy strong theoretical guarantees. However, a significant gap still remains between the practical performance of these approaches and state-of-the-art heuristic methods. To this end, we introduce AC-Band, a general approach for the AC problem based on multi-armed bandits that provides theoretical guarantees while exhibiting strong practical performance. We show that AC-Band requires significantly less computation time than other AC approaches providing theoretical guarantees while still yielding high-quality configurations.

artificial intelligence, data mining, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2212.00333

Country:

Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
North America > United States (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.66)

Add feedback

Hyperparameter optimization in deep multi-target prediction

Iliadis, Dimitrios, Wever, Marcel, De Baets, Bernard, Waegeman, Willem

arXiv.org Artificial IntelligenceNov-8-2022

As a result of the ever increasing complexity of configuring and fine-tuning machine learning models, the field of automated machine learning (AutoML) has emerged over the past decade. However, software implementations like Auto-WEKA and Auto-sklearn typically focus on classical machine learning (ML) tasks such as classification and regression. Our work can be seen as the first attempt at offering a single AutoML framework for most problem settings that fall under the umbrella of multi-target prediction, which includes popular ML settings such as multi-label classification, multivariate regression, multi-task learning, dyadic prediction, matrix completion, and zero-shot learning. Automated problem selection and model configuration are achieved by extending DeepMTP, a general deep learning framework for MTP problem settings, with popular hyperparameter optimization (HPO) methods. Our extensive benchmarking across different datasets and MTP problem settings identifies cases where specific HPO methods outperform others.

artificial intelligence, machine learning, optimization, (18 more...)

arXiv.org Artificial Intelligence

2211.04362

Country:

Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
Europe > Belgium > Flanders > East Flanders > Ghent (0.04)

Genre: Research Report (0.82)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A Deep Neural Networks ensemble workflow from hyperparameter search to inference leveraging GPU clusters

Pochelu, Pierrick, Petiton, Serge G., Conche, Bruno

arXiv.org Artificial IntelligenceAug-30-2022

Automated Machine Learning with ensembling (or AutoML with ensembling) seeks to automatically build ensembles of Deep Neural Networks (DNNs) to achieve qualitative predictions. Ensemble of DNNs are well known to avoid over-fitting but they are memory and time consuming approaches. Therefore, an ideal AutoML would produce in one single run time different ensembles regarding accuracy and inference speed. While previous works on AutoML focus to search for the best model to maximize its generalization ability, we rather propose a new AutoML to build a larger library of accurate and diverse individual models to then construct ensembles. First, our extensive benchmarks show asynchronous Hyperband is an efficient and robust way to build a large number of diverse models to combine them. Then, a new ensemble selection method based on a multi-objective greedy algorithm is proposed to generate accurate ensembles by controlling their computing cost. Finally, we propose a novel algorithm to optimize the inference of the DNNs ensemble in a GPU cluster based on allocation optimization. The produced AutoML with ensemble method shows robust results on two datasets using efficiently GPU clusters during both the training phase and the inference phase. Deep Neural networks (DNNs) are notoriously difficult to tune, train, and ensemble to achieve state-of-the-art results. Automatic machine learning with ensembling or "AutoML+ensembling" tools provide a simple interface to train and evaluate many ensembles of DNNs to achieve high accuracy by reducing overfitting. Nowadays, multiple researchers and practitioners have well understood the benefit of ensembling DNNs. Further, several winners and top performers on challenges routinely use ensembles to improve accuracy. However, ensembles of DNNs suffer from three main limitations to be widely deployed in research and industrial applications.

algorithm, ensemble, international conference, (13 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3492805.3492819

2208.14046

Country:

North America > United States > New York > New York County > New York City (0.04)
Europe > France > Nouvelle-Aquitaine > Pyrénées-Atlantiques > Pau (0.04)
Asia > Middle East > Jordan (0.04)
(13 more...)

Genre: Workflow (1.00)

Industry: Information Technology > Security & Privacy (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

On the Importance of Hyperparameter Optimization for Model-based Reinforcement Learning

Zhang, Baohe, Rajan, Raghu, Pineda, Luis, Lambert, Nathan, Biedenkapp, André, Chua, Kurtland, Hutter, Frank, Calandra, Roberto

arXiv.org Artificial IntelligenceFeb-26-2021

Model-based Reinforcement Learning (MBRL) is a promising framework for learning control in a data-efficient manner. MBRL algorithms can be fairly complex due to the separate dynamics modeling and the subsequent planning algorithm, and as a result, they often possess tens of hyperparameters and architectural choices. For this reason, MBRL typically requires significant human expertise before it can be applied to new problems and domains. To alleviate this problem, we propose to use automatic hyperparameter optimization (HPO). We demonstrate that this problem can be tackled effectively with automated HPO, which we demonstrate to yield significantly improved performance compared to human experts. In addition, we show that tuning of several MBRL hyperparameters dynamically, i.e. during the training itself, further improves the performance compared to using static hyperparameters which are kept fixed for the whole training. Finally, our experiments provide valuable insights into the effects of several hyperparameters, such as plan horizon or learning rate and their influence on the stability of training and resulting rewards.

configuration, hyperparameter, log 10, (15 more...)

arXiv.org Artificial Intelligence

2102.13651

Country:

Europe > Germany > Baden-Württemberg > Freiburg (0.04)
North America > United States > California > San Diego County > San Diego (0.04)

Genre: Research Report > Experimental Study (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback