AutoML-Med: A Framework for Automated Machine Learning in Medical Tabular Data

Francia, Riccardo, Leone, Maurizio, Leonardi, Giorgio, Montani, Stefania, Pennisi, Marzio, Striani, Manuel, D'Alfonso, Sandra

Aug-5-2025–arXiv.org Artificial Intelligence

In recent years, the advent of deep learning and, in particular, transformer-based architectures, has significantly revolutionized the field of Artificial Intelligence (AI) in many scientific domains, including computer vision, natural language processing, and sequence modeling, thanks to the increasing availability of computational power and large-scale data-sets. However, classical Machine Learning (ML) methods, such as decision trees, gradient-boosted trees, Support V ector Machines (SVMs), and regression--based techniques, continue to be considered as the state-of-the-art for tabular data, which are still nowadays widely used in healthcare, finance, industrial monitoring, and other structured-data domains. There are several reasons for this. Notably, conventional AI models tend to perform reasonably well on datasets of limited size, whereas state-of-the-art deep learning techniques typically require substantially larger amounts of data to generalize effectively. Moreover, many classical AI methods, such as regression, Bayesian approaches, rule-based systems, and tree-based models, are inherently more interpretable, a characteristic that is particularly valuable in high-stakes domains such as healthcare. In contrast, deep learning models often work as black boxes, limiting their explainability. As an example, Grinsztajn et al. [1] showed that tree-based ensembles like XGBoost and Random Forests consistently outperformed a wide range of contemporary deep learning models across dozens of medium-sized tabular datasets (

artificial intelligence, deep learning, machine learning, (16 more...)

arXiv.org Artificial Intelligence

Aug-5-2025

arXiv.org PDF

Add feedback

Country:
- Europe
  - Belgium > Flanders
    - East Flanders > Ghent (0.04)
  - Italy (0.05)
- North America > United States
  - Colorado > Denver County > Denver (0.04)

Genre:
- Research Report > Experimental Study (0.93)

Industry:
- Health & Medicine
  - Consumer Health (0.68)
  - Therapeutic Area
    - Endocrinology > Diabetes (0.49)
    - Immunology (0.93)
    - Infections and Infectious Diseases (0.93)
    - Neurology > Multiple Sclerosis (0.48)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found