AITopics | Ensemble Learning

Collaborating Authors

Ensemble Learning

Ensemble methods use multiple learning algorithms to obtain better predictive performance than could be obtained from any of the constituent learning algorithms alone. (Wikipedia)

News Overviews Instructional Materials AI-Alerts Classics

What is boosting in machine learning?

#artificialintelligenceJul-17-2022, 05:52:33 GMT

This article is part of Demystifying AI, a series of posts that (try to) disambiguate the jargon and myths surrounding AI. We train machine learning models to predict values such as the weather, stock prices, the class of an image, or the sentiment of a social media post. However, often, machine learning models fail to meet the performance levels that we expect of them. There are several solutions to improve the accuracy of machine learning models. One popular method is "boosting," an ensemble learning technique that brings together several ML models that perform poorly alone but stronger together.

artificial intelligence, learner, machine learning, (14 more...)

#artificialintelligence

Country: North America > United States > Texas (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.39)

Add feedback

Boosting Machine Learning Algorithms: An Overview - KDnuggets

#artificialintelligenceJul-16-2022, 17:47:28 GMT

Combing various machine learning algorithms while solving a problem usually results in better results. The individual algorithms are referred to as weak learners. A weak learner is a model that gives better results than a random prediction in a classification problem or the mean in a regression problem. The final result from these algorithms is obtained by fitting them on the training data and combining their predictions. In classification, the combination is done by voting, while in regression, it's done via averaging.

algorithm, artificial intelligence, machine learning, (15 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.60)

Add feedback

FIGS: Attaining XGBoost-level performance with the interpretability and speed of CART

AIHubJul-12-2022, 09:00:00 GMT

Recent machine-learning advances have led to increasingly complex predictive models, often at the cost of interpretability. We often need interpretability, particularly in high-stakes applications such as in clinical decision-making; interpretable models help with all kinds of things, such as identifying errors, leveraging domain knowledge, and making speedy predictions. In this blog post we'll cover FIGS, a new method for fitting an interpretable model that takes the form of a sum of trees. Real-world experiments and theoretical results show that FIGS can effectively adapt to a wide range of structure in data, achieving state-of-the-art performance in several settings, all without sacrificing interpretability. Intuitively, FIGS works by extending CART, a typical greedy algorithm for growing a decision tree, to consider growing a sum of trees simultaneously (see Fig 1).

attaining xgboost-level performance, interpretability and speed, prediction, (8 more...)

AIHub

Industry: Health & Medicine (0.98)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.40)

Add feedback

AGBoost: Attention-based Modification of Gradient Boosting Machine

Konstantinov, Andrei, Utkin, Lev, Kirpichenko, Stanislav

arXiv.org Machine LearningJul-12-2022

A new attention-based model for the gradient boosting machine (GBM) called AGBoost (the attention-based gradient boosting) is proposed for solving regression problems. The main idea behind the proposed AGBoost model is to assign attention weights with trainable parameters to iterations of GBM under condition that decision trees are base learners in GBM. Attention weights are determined by applying properties of decision trees and by using the Huber's contamination model which provides an interesting linear dependence between trainable parameters of the attention and the attention weights. This peculiarity allows us to train the attention weights by solving the standard quadratic optimization problem with linear constraints. The attention weights also depend on the discount factor as a tuning parameter, which determines how much the impact of the weight is decreased with the number of iterations. Numerical experiments performed for two types of base learners, original decision trees and extremely randomized trees with various regression datasets illustrate the proposed model.

artificial intelligence, attention weight, machine learning, (16 more...)

arXiv.org Machine Learning

doi: 10.23919/FRUCT54823.2022.9770928

2207.05724

Country:

Asia > Russia (0.14)
Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > New York > New York County > New York City (0.04)
Europe > Russia > Northwestern Federal District > Leningrad Oblast > Saint Petersburg (0.04)

Genre: Research Report > Experimental Study (0.70)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Add feedback

EmoSens: Emotion Recognition based on Sensor data analysis using LightGBM

S, Gayathri, Anand, Akshat, Vijayvargiya, Astha, M, Pushpalatha, Moorthy, Vaishnavi, Kumar, Sumit, S, Harichandana B S

arXiv.org Artificial IntelligenceJul-12-2022

Smart wearables have played an integral part in our day to day life. From recording ECG signals to analysing body fat composition, the smart wearables can do it all. The smart devices encompass various sensors which can be employed to derive meaningful information regarding the user's physical and psychological conditions. Our approach focuses on employing such sensors to identify and obtain the variations in the mood of a user at a given instance through the use of supervised machine learning techniques. The study examines the performance of various supervised learning models such as Decision Trees, Random Forests, XGBoost, LightGBM on the dataset. With our proposed model, we obtained a high recognition rate of 92.5% using XGBoost and LightGBM for 9 different emotion classes. By utilizing this, we aim to improvise and suggest methods to aid emotion recognition for better mental health analysis and mood monitoring.

accuracy, emotion, emotion recognition, (14 more...)

arXiv.org Artificial Intelligence

2207.1464

Country:

Asia > Taiwan (0.04)
North America > United States > New York (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(4 more...)

Genre: Research Report (0.70)

Industry:

Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Diagnostic Medicine (0.92)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Emotion (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.91)

Add feedback

Attention and Self-Attention in Random Forests

Utkin, Lev V., Konstantinov, Andrei V.

arXiv.org Machine LearningJul-9-2022

New models of random forests jointly using the attention and self-attention mechanisms are proposed for solving the regression problem. The models can be regarded as extensions of the attention-based random forest whose idea stems from applying a combination of the Nadaraya-Watson kernel regression and the Huber's contamination model to random forests. The self-attention aims to capture dependencies of the tree predictions and to remove noise or anomalous predictions in the random forest. The self-attention module is trained jointly with the attention module for computing weights. It is shown that the training process of attention weights is reduced to solving a single quadratic or linear optimization problem. Three modifications of the general approach are proposed and compared. A specific multi-head self-attention for the random forest is also considered. Heads of the self-attention are obtained by changing its tuning parameters including the kernel parameters and the contamination parameter of models. Numerical experiments with various datasets illustrate the proposed models and show that the supplement of the self-attention improves the model performance for many datasets.

artificial intelligence, machine learning, mechanism, (19 more...)

arXiv.org Machine Learning

2207.04293

Country:

Asia > Russia (0.14)
North America > United States > New York (0.04)
Europe > Russia > Northwestern Federal District > Leningrad Oblast > Saint Petersburg (0.04)

Genre:

Research Report > Experimental Study (0.93)
Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.98)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

ControlBurn: Nonlinear Feature Selection with Sparse Tree Ensembles

Liu, Brian, Xie, Miaolan, Yang, Haoyue, Udell, Madeleine

arXiv.org Machine LearningJul-8-2022

ControlBurn is a Python package to construct feature-sparse tree ensembles that support nonlinear feature selection and interpretable machine learning. The algorithms in this package first build large tree ensembles that prioritize basis functions with few features and then select a feature-sparse subset of these basis functions using a weighted lasso optimization criterion. The package includes visualizations to analyze the features selected by the ensemble and their impact on predictions. Hence ControlBurn offers the accuracy and flexibility of tree-ensemble models and the interpretability of sparse generalized additive models. ControlBurn is scalable and flexible: for example, it can use warm-start continuation to compute the regularization path (prediction error for any number of selected features) for a dataset with tens of thousands of samples and hundreds of features in seconds. For larger datasets, the runtime scales linearly in the number of samples and features (up to a log factor), and the package support acceleration using sketching. Moreover, the ControlBurn framework accommodates feature costs, feature groupings, and $\ell_0$-based regularizers. The package is user-friendly and open-source: its documentation and source code appear on https://pypi.org/project/ControlBurn/ and https://github.com/udellgroup/controlburn/.

artificial intelligence, controlburn, machine learning, (15 more...)

arXiv.org Machine Learning

2207.03935

Country:

Pacific Ocean > North Pacific Ocean > San Francisco Bay (0.04)
North America > United States > Wisconsin (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(3 more...)

Genre: Research Report (0.50)

Industry: Health & Medicine > Health Care Providers & Services (0.95)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

Hands-on Random Forest with Python

#artificialintelligenceJul-7-2022, 12:50:31 GMT

One model may make a wrong prediction. But if you combine the predictions of several models into one, you can make better predictions. This concept is called ensemble learning. Ensembles are methods that combine multiple models to build more powerful models. Ensemble methods have gained huge popularity during the last decade.

dataset, prediction, random forest, (15 more...)

#artificialintelligence

Country: North America > United States > Wisconsin (0.05)

Industry: Health & Medicine > Therapeutic Area (0.30)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.64)

Add feedback

Hands-on Random Forest with Python

#artificialintelligenceJul-6-2022, 12:18:03 GMT

Originally published on Towards AI the World's Leading AI and Technology News and Media Company. If you are building an AI-related product or service, we invite you to consider becoming an AI sponsor. At Towards AI, we help scale AI and technology startups. Let us help you unleash your technology to the masses. One model may make a wrong prediction.

dataset, prediction, random forest, (15 more...)

#artificialintelligence

Country: North America > United States > Wisconsin (0.05)

Industry: Health & Medicine > Therapeutic Area (0.30)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.63)

Add feedback

Random_Forest_Medium_Article

#artificialintelligenceJul-5-2022, 16:50:51 GMT

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.65)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.57)

Add feedback