A Deep Neural Networks ensemble workflow from hyperparameter search to inference leveraging GPU clusters
Pochelu, Pierrick, Petiton, Serge G., Conche, Bruno
–arXiv.org Artificial Intelligence
Automated Machine Learning with ensembling (or AutoML with ensembling) seeks to automatically build ensembles of Deep Neural Networks (DNNs) to achieve qualitative predictions. Ensemble of DNNs are well known to avoid over-fitting but they are memory and time consuming approaches. Therefore, an ideal AutoML would produce in one single run time different ensembles regarding accuracy and inference speed. While previous works on AutoML focus to search for the best model to maximize its generalization ability, we rather propose a new AutoML to build a larger library of accurate and diverse individual models to then construct ensembles. First, our extensive benchmarks show asynchronous Hyperband is an efficient and robust way to build a large number of diverse models to combine them. Then, a new ensemble selection method based on a multi-objective greedy algorithm is proposed to generate accurate ensembles by controlling their computing cost. Finally, we propose a novel algorithm to optimize the inference of the DNNs ensemble in a GPU cluster based on allocation optimization. The produced AutoML with ensemble method shows robust results on two datasets using efficiently GPU clusters during both the training phase and the inference phase. Deep Neural networks (DNNs) are notoriously difficult to tune, train, and ensemble to achieve state-of-the-art results. Automatic machine learning with ensembling or "AutoML+ensembling" tools provide a simple interface to train and evaluate many ensembles of DNNs to achieve high accuracy by reducing overfitting. Nowadays, multiple researchers and practitioners have well understood the benefit of ensembling DNNs. Further, several winners and top performers on challenges routinely use ensembles to improve accuracy. However, ensembles of DNNs suffer from three main limitations to be widely deployed in research and industrial applications.
arXiv.org Artificial Intelligence
Aug-30-2022
- Country:
- Oceania > Australia
- New South Wales > Sydney (0.04)
- North America
- United States
- Virginia > Arlington County
- Arlington (0.04)
- New York > New York County
- New York City (0.04)
- Georgia > Chatham County
- Savannah (0.04)
- Colorado > Denver County
- Denver (0.04)
- California
- San Diego County > Carlsbad (0.04)
- Los Angeles County > Los Angeles (0.04)
- Alaska > Anchorage Municipality
- Anchorage (0.04)
- Virginia > Arlington County
- Puerto Rico > San Juan
- San Juan (0.04)
- Canada > Alberta
- United States
- Europe
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- France
- Nouvelle-Aquitaine > Pyrénées-Atlantiques
- Pau (0.04)
- Hauts-de-France > Nord
- Lille (0.04)
- Nouvelle-Aquitaine > Pyrénées-Atlantiques
- Spain > Catalonia
- Asia
- Middle East > Jordan (0.04)
- Japan > Honshū
- Kansai > Kyoto Prefecture > Kyoto (0.04)
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- Oceania > Australia
- Genre:
- Workflow (1.00)
- Industry:
- Information Technology > Security & Privacy (0.46)
- Technology: