AMLB: an AutoML Benchmark

Gijsbers, Pieter, Bueno, Marcos L. P., Coors, Stefan, LeDell, Erin, Poirier, Sébastien, Thomas, Janek, Bischl, Bernd, Vanschoren, Joaquin

Nov-16-2023–arXiv.org Machine Learning

Comparing different AutoML frameworks is notoriously challenging and often done incorrectly. We introduce an open and extensible benchmark that follows best practices and avoids common mistakes when comparing AutoML frameworks. We conduct a thorough comparison of 9 well-known AutoML frameworks across 71 classification and 33 regression tasks. The differences between the AutoML frameworks are explored with a multi-faceted analysis, evaluating model accuracy, its trade-offs with inference time, and framework failures. We also use Bradley-Terry trees to discover subsets of tasks where the relative AutoML framework rankings differ. The benchmark comes with an open-source tool that integrates with many AutoML frameworks and automates the empirical evaluation process end-to-end: from framework installation and resource allocation to in-depth evaluation. The benchmark uses public data sets, can be easily extended with other AutoML frameworks and tasks, and has a website with up-to-date results.

data mining, evolutionary algorithm, machine learning, (21 more...)

arXiv.org Machine Learning

Nov-16-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Oregon > Multnomah County
    - Portland (0.04)
  - New York > New York County
    - New York City (0.04)
  - California
    - Santa Clara County > Mountain View (0.04)
    - Los Angeles County > Long Beach (0.04)
- Europe
  - Spain > Andalusia
    - Cádiz Province > Cadiz (0.04)
  - Netherlands
    - North Brabant > Eindhoven (0.04)
    - Gelderland > Nijmegen (0.04)
  - Germany > Bavaria
    - Upper Bavaria > Munich (0.04)

Genre:
- Research Report
  - New Finding (0.68)
  - Experimental Study (0.67)

Industry:
- Health & Medicine > Therapeutic Area (0.67)

Technology:
- Information Technology
  - Information Management (1.00)
  - Data Science > Data Mining (1.00)
  - Software (0.87)
  - Artificial Intelligence
    - Representation & Reasoning
      - Optimization (1.00)
      - Search (0.68)
    - Machine Learning
      - Statistical Learning (1.00)
      - Evolutionary Systems (0.92)
      - Neural Networks > Deep Learning (0.67)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found