AITopics | Wever, Marcel

Collaborating Authors

Wever, Marcel

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

HyperSHAP: Shapley Values and Interactions for Hyperparameter Importance

Wever, Marcel, Muschalik, Maximilian, Fumagalli, Fabian, Lindauer, Marius

arXiv.org Machine LearningFeb-3-2025

Hyperparameter optimization (HPO) is a crucial step in achieving strong predictive performance. However, the impact of individual hyperparameters on model generalization is highly context-dependent, prohibiting a one-size-fits-all solution and requiring opaque automated machine learning (AutoML) systems to find optimal configurations. The black-box nature of most AutoML systems undermines user trust and discourages adoption. To address this, we propose a game-theoretic explainability framework for HPO that is based on Shapley values and interactions. Our approach provides an additive decomposition of a performance measure across hyperparameters, enabling local and global explanations of hyperparameter importance and interactions. The framework, named HyperSHAP, offers insights into ablations, the tunability of learning algorithms, and optimizer behavior across different hyperparameter spaces. We evaluate HyperSHAP on various HPO benchmarks by analyzing the interaction structure of the HPO problem. Our results show that while higher-order interactions exist, most performance improvements can be explained by focusing on lower-order representations.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Machine Learning

2502.01276

Country:

Asia (0.67)
Europe > Germany (0.46)
North America > United States > California > San Francisco County > San Francisco (0.14)

Genre: Research Report > New Finding (1.00)

Industry: Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Position: Why We Must Rethink Empirical Research in Machine Learning

Herrmann, Moritz, Lange, F. Julian D., Eggensperger, Katharina, Casalicchio, Giuseppe, Wever, Marcel, Feurer, Matthias, Rügamer, David, Hüllermeier, Eyke, Boulesteix, Anne-Laure, Bischl, Bernd

arXiv.org Machine LearningMay-25-2024

In practice, that leads to non-replicable results, makes it may jeopardize applied empirical researchers' confidence findings unreliable, and threatens to undermine in experimental results and discourage them from applying progress in the field. To overcome this alarming ML methods, even though these novel approaches might be situation, we call for more awareness of the beneficial. For example, ML is increasingly being used in plurality of ways of gaining knowledge experimentally the medical domain, and this is often promising in terms of but also of some epistemic limitations.

artificial intelligence, deep learning, machine learning, (15 more...)

arXiv.org Machine Learning

2405.022

Country:

Europe (1.00)
North America > United States > California > San Francisco County > San Francisco (0.14)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine (1.00)
Education (0.93)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Automated Machine Learning for Multi-Label Classification

Wever, Marcel

arXiv.org Artificial IntelligenceFeb-28-2024

Automated machine learning (AutoML) aims to select and configure machine learning algorithms and combine them into machine learning pipelines tailored to a dataset at hand. For supervised learning tasks, most notably binary and multinomial classification, aka single-label classification (SLC), such AutoML approaches have shown promising results. However, the task of multi-label classification (MLC), where data points are associated with a set of class labels instead of a single class label, has received much less attention so far. In the context of multi-label classification, the data-specific selection and configuration of multi-label classifiers are challenging even for experts in the field, as it is a high-dimensional optimization problem with multi-level hierarchical dependencies. While for SLC, the space of machine learning pipelines is already huge, the size of the MLC search space outnumbers the one of SLC by several orders. In the first part of this thesis, we devise a novel AutoML approach for single-label classification tasks optimizing pipelines of machine learning algorithms, consisting of two algorithms at most. This approach is then extended first to optimize pipelines of unlimited length and eventually configure the complex hierarchical structures of multi-label classification methods. Furthermore, we investigate how well AutoML approaches that form the state of the art for single-label classification tasks scale with the increased problem complexity of AutoML for multi-label classification. In the second part, we explore how methods for SLC and MLC could be configured more flexibly to achieve better generalization performance and how to increase the efficiency of execution-based AutoML systems.

artificial intelligence, evolutionary algorithm, machine learning, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.17619/UNIPB/1-1302

2402.18198

Country:

Asia (1.00)
Europe > Germany (0.67)
North America > Canada (0.67)
(3 more...)

Genre:

Overview (1.00)
Instructional Material > Course Syllabus & Notes (0.47)
Research Report > New Finding (0.45)

Industry:

Information Technology (0.93)
Education (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(3 more...)

Add feedback

Information Leakage Detection through Approximate Bayes-optimal Prediction

Gupta, Pritha, Wever, Marcel, Hüllermeier, Eyke

arXiv.org Artificial IntelligenceJan-25-2024

In today's data-driven world, the proliferation of publicly available information intensifies the challenge of information leakage (IL), raising security concerns. IL involves unintentionally exposing secret (sensitive) information to unauthorized parties via systems' observable information. Conventional statistical approaches, which estimate mutual information (MI) between observable and secret information for detecting IL, face challenges such as the curse of dimensionality, convergence, computational complexity, and MI misestimation. Furthermore, emerging supervised machine learning (ML) methods, though effective, are limited to binary system-sensitive information and lack a comprehensive theoretical framework. To address these limitations, we establish a theoretical framework using statistical learning theory and information theory to accurately quantify and detect IL. We demonstrate that MI can be accurately estimated by approximating the log-loss and accuracy of the Bayes predictor. As the Bayes predictor is typically unknown in practice, we propose to approximate it with the help of automated machine learning (AutoML). First, we compare our MI estimation approaches against current baselines, using synthetic data sets generated using the multivariate normal (MVN) distribution with known MI. Second, we introduce a cut-off technique using one-sided statistical tests to detect IL, employing the Holm-Bonferroni correction to increase confidence in detection decisions. Our study evaluates IL detection performance on real-world data sets, highlighting the effectiveness of the Bayes predictor's log-loss estimation, and finds our proposed method to effectively estimate MI on synthetic data sets and thus detect ILs accurately.

artificial intelligence, machine learning, survey article, (15 more...)

arXiv.org Artificial Intelligence

2401.14283

Country:

Europe (0.92)
North America > United States > New York > New York County > New York City (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)

Genre:

Research Report > New Finding (1.00)
Research Report > Promising Solution (0.67)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Towards Green Automated Machine Learning: Status Quo and Future Directions

Tornede, Tanja, Tornede, Alexander, Hanselle, Jonas, Wever, Marcel, Mohr, Felix, Hüllermeier, Eyke

arXiv.org Artificial IntelligenceJun-13-2023

Automated machine learning (AutoML) strives for the automatic configuration of machine learning algorithms and their composition into an overall (software) solution - a machine learning pipeline - tailored to the learning task (dataset) at hand. Over the last decade, AutoML has developed into an independent research field with hundreds of contributions. At the same time, AutoML is being criticised for its high resource consumption as many approaches rely on the (costly) evaluation of many machine learning pipelines, as well as the expensive large scale experiments across many datasets and approaches. In the spirit of recent work on Green AI, this paper proposes Green AutoML, a paradigm to make the whole AutoML process more environmentally friendly. Therefore, we first elaborate on how to quantify the environmental footprint of an AutoML tool. Afterward, different strategies on how to design and benchmark an AutoML tool wrt. their "greenness", i.e. sustainability, are summarized. Finally, we elaborate on how to be transparent about the environmental footprint and what kind of research incentives could direct the community into a more sustainable AutoML research direction. Additionally, we propose a sustainability checklist to be attached to every AutoML paper featuring all core aspects of Green AutoML.

artificial intelligence, machine learning, proceedings, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1613/jair.1.14340

2111.0585

Country: Europe > Germany (0.28)

Genre: Research Report (1.00)

Industry: Energy (0.97)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Towards Green Automated Machine Learning: Status Quo and Future Directions

Tornede, Tanja (a:1:{s:5:"en_US";s:20:"Paderborn University";}) | Tornede, Alexander | Hanselle, Jonas | Mohr, Felix | Wever, Marcel | Hüllermeier, Eyke

Journal of Artificial Intelligence ResearchJun-12-2023

Automated machine learning (AutoML) strives for the automatic configuration of machine learning algorithms and their composition into an overall (software) solution — a machine learning pipeline — tailored to the learning task (dataset) at hand. Over the last decade, AutoML has developed into an independent research field with hundreds of contributions. At the same time, AutoML is being criticized for its high resource consumption as many approaches rely on the (costly) evaluation of many machine learning pipelines, as well as the expensive large-scale experiments across many datasets and approaches. In the spirit of recent work on Green AI, this paper proposes Green AutoML, a paradigm to make the whole AutoML process more environmentally friendly. Therefore, we first elaborate on how to quantify the environmental footprint of an AutoML tool. Afterward, different strategies on how to design and benchmark an AutoML tool w.r.t. their “greenness”, i.e., sustainability, are summarized. Finally, we elaborate on how to be transparent about the environmental footprint and what kind of research incentives could direct the community in a more sustainable AutoML research direction. As part of this, we propose a sustainability checklist to be attached to every AutoML paper featuring all core aspects of Green AutoML.

artificial intelligence, machine learning, survey article, (19 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.1.14340

AI Access Foundation

14340

Journal of Artificial Intelligence Research

Country: Europe > Germany (0.28)

Genre:

Research Report (0.67)
Overview (0.67)

Industry: Energy (0.97)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

PyExperimenter: Easily distribute experiments and track results

Tornede, Tanja, Tornede, Alexander, Fehring, Lukas, Gehring, Lukas, Graf, Helena, Hanselle, Jonas, Mohr, Felix, Wever, Marcel

arXiv.org Artificial IntelligenceApr-21-2023

It is intended to be used by researchers in the field of artificial intelligence, but is not limited to those. The empirical analysis of algorithms is often accompanied by the execution of algorithms for different inputs and variants of the algorithms, specified via parameters, and the measurement of non-functional properties. Since the individual evaluations are usually independent, the evaluation can be performed in a distributed manner on an HPC system. However, setting up, documenting, and evaluating the results of such a study is often file-based. Usually, this requires extensive manual work to create configuration files for the inputs or to read and aggregate measured results from a report file.

artificial intelligence, experiment, machine learning, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.21105/joss.05149

2301.06348

Country: Europe > Germany (0.30)

Genre: Research Report (0.65)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.32)

Add feedback

Iterative Deepening Hyperband

Brandt, Jasmin, Wever, Marcel, Iliadis, Dimitrios, Bengs, Viktor, Hüllermeier, Eyke

arXiv.org Artificial IntelligenceFeb-6-2023

Hyperparameter optimization (HPO) is concerned with the automated search for the most appropriate hyperparameter configuration (HPC) of a parameterized machine learning algorithm. A state-of-the-art HPO method is Hyperband, which, however, has its own parameters that influence its performance. One of these parameters, the maximal budget, is especially problematic: If chosen too small, the budget needs to be increased in hindsight and, as Hyperband is not incremental by design, the entire algorithm must be re-run. This is not only costly but also comes with a loss of valuable knowledge already accumulated. In this paper, we propose incremental variants of Hyperband that eliminate these drawbacks, and show that these variants satisfy theoretical guarantees qualitatively similar to those for the original Hyperband with the "right" budget. Moreover, we demonstrate their practical utility in experiments with benchmark data sets.

artificial intelligence, configuration, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2302.00511

Country: Europe > Germany (0.28)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.93)

Add feedback

Meta-Learning for Automated Selection of Anomaly Detectors for Semi-Supervised Datasets

Schubert, David, Gupta, Pritha, Wever, Marcel

arXiv.org Artificial IntelligenceNov-24-2022

In anomaly detection, a prominent task is to induce a model to identify anomalies learned solely based on normal data. Generally, one is interested in finding an anomaly detector that correctly identifies anomalies, i.e., data points that do not belong to the normal class, without raising too many false alarms. Which anomaly detector is best suited depends on the dataset at hand and thus needs to be tailored. The quality of an anomaly detector may be assessed via confusion-based metrics such as the Matthews correlation coefficient (MCC). However, since during training only normal data is available in a semi-supervised setting, such metrics are not accessible. To facilitate automated machine learning for anomaly detectors, we propose to employ meta-learning to predict MCC scores based on metrics that can be computed with normal data only. First promising results can be obtained considering the hypervolume and the false positive rate as meta-features.

artificial intelligence, data mining, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2211.13681

Country: Europe (0.28)

Genre: Research Report (0.40)

Industry: Information Technology (0.46)

Technology:

Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

A Survey of Methods for Automated Algorithm Configuration

Schede, Elias (a:1:{s:5:"en_US";s:20:"Bielefeld University";}) | Brandt, Jasmin (Department of Computer Science, Paderborn University) | Tornede, Alexander ( Department of Computer Science, Paderborn University,) | Wever, Marcel (Institute of Informatics, LMU Munich) | Bengs, Viktor (Institute of Informatics, LMU Munich) | Hüllermeier, Eyke (Institute of Informatics, LMU Munich) | Tierney, Kevin (Decision and Operation Technologies Group, Bielefeld University)

Journal of Artificial Intelligence ResearchOct-10-2022

Algorithm configuration (AC) is concerned with the automated search of the most suitable parameter configuration of a parametrized algorithm. There is currently a wide variety of AC problem variants and methods proposed in the literature. Existing reviews do not take into account all derivatives of the AC problem, nor do they offer a complete classification scheme. To this end, we introduce taxonomies to describe the AC problem and features of configuration methods, respectively. We review existing AC literature within the lens of our taxonomies, outline relevant design choices of configuration approaches, contrast methods and problem variants against each other, and describe the state of AC in industry. Finally, our review provides researchers and practitioners with a look at future research directions in the field of AC.

data mining, evolutionary algorithm, machine learning, (26 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.1.13676

AI Access Foundation

13676

Journal of Artificial Intelligence Research

Country:

Europe > Germany (0.46)
North America > United States (0.45)

Genre:

Research Report > New Finding (1.00)
Overview (1.00)
Research Report > Experimental Study (0.67)

Industry: Education (0.92)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
(9 more...)

Add feedback