AITopics | Ensemble Learning

Collaborating Authors

Ensemble Learning

Ensemble methods use multiple learning algorithms to obtain better predictive performance than could be obtained from any of the constituent learning algorithms alone. (Wikipedia)

News Overviews Instructional Materials AI-Alerts Classics

Introduction to AdaBoost for Absolute Beginners - Analytics Vidhya

#artificialintelligenceJan-31-2022, 08:15:57 GMT

This article was published as a part of the Data Science Blogathon. AdaBoost stands for Adaptive Boosting. It is a statistical classification algorithm. It is an algorithm that forms a committee of weak classifiers. It boosts the performance of machine learning algorithms.

adaboost classifier, analytic vidhya, classifier, (6 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.56)

Add feedback

Building a Random Forest Classifier to Predict Neural Spikes

#artificialintelligenceJan-30-2022, 13:25:16 GMT

A step-by-step guide to building a Random Forest classifier in Python to predict subtypes of neural extracellular spikes using a real data-set recorded from Human brain organoids. Given the heterogeneity of neurons within the human brain itself, classification tools are commonly utilised to correlate electrical activity with different cell types and/or morphologies. This is a long-standing question in Neuroscience circles, and can be considerably variable between different species, pathologies, brain regions and layers. Fortunately, with the readily increasing computational power allowing improvements in machine-learning and deep-learning algorithms, Neuroscientists are provided with the tools to dive further into asking these important questions. However, as stated by Juavinett et al., for the most part programming skills are underrepresented in the community and new resources to teach them are crucial to solving the complexity of the human brain.

algorithm, predict neural spike, random forest classifier, (1 more...)

#artificialintelligence

Industry: Health & Medicine > Therapeutic Area > Neurology (0.98)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.75)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.61)
(2 more...)

Add feedback

Geometry- and Accuracy-Preserving Random Forest Proximities

Rhodes, Jake S., Cutler, Adele, Moon, Kevin R.

arXiv.org Machine LearningJan-29-2022

Abstract--Random forests are considered one of the best out-of-the-box classification and regression algorithms due to their high level of predictive performance with relatively little tuning. Pairwise proximities can be computed from a trained random forest which measure the similarity between data points relative to the supervised task. Random forest proximities have been used in many applications including the identification of variable importance, data imputation, outlier detection, and data visualization. However, existing definitions of random forest proximities do not accurately reflect the data geometry learned by the random forest. In this paper, we introduce a novel definition of random forest proximities called Random Forest-Geometry-and Accuracy-Preserving proximities (RF-GAP). We prove that the proximity-weighted sum (regression) or majority vote (classification) using RF-GAP exactly match the out-of-bag random forest prediction, thus capturing the data geometry learned by the random forest. We empirically show that this improved geometric representation outperforms traditional random forest proximities in tasks such as data imputation and provides outlier detection and visualization results consistent with the learned data geometry. ANDOM forests [1] are well-known, powerful predictors comprised of an ensemble of binary recursive was first defined by Leo Breiman as the proportion of decision trees. Random forests are easily adapted for both trees in which the observations reside in the same terminal classification and regression, are trivially parallelizable, can node [16].

proximity, random forest, random forest proximity, (15 more...)

arXiv.org Machine Learning

2201.12682

Country:

Europe > Austria > Vienna (0.14)
Oceania > Australia > Australian Capital Territory > Canberra (0.04)
North America > United States > Utah (0.04)
(6 more...)

Genre: Research Report > New Finding (0.67)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.34)

Add feedback

H2O.ai

#artificialintelligenceJan-25-2022, 03:15:09 GMT

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensemble...

learning

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.97)

Add feedback

MeltpoolNet: Melt pool Characteristic Prediction in Metal Additive Manufacturing Using Machine Learning

Akbari, Parand, Ogoke, Francis, Kao, Ning-Yu, Meidani, Kazem, Yeh, Chun-Yu, Lee, William, Farimani, Amir Barati

arXiv.org Artificial IntelligenceJan-25-2022

Characterizing meltpool shape and geometry is essential in metal Additive Manufacturing (MAM) to control the printing process and avoid defects. Predicting meltpool flaws based on process parameters and powder material is difficult due to the complex nature of MAM process. Machine learning (ML) techniques can be useful in connecting process parameters to the type of flaws in the meltpool. In this work, we introduced a comprehensive framework for benchmarking ML for melt pool characterization. An extensive experimental dataset has been collected from more than 80 MAM articles containing MAM processing conditions, materials, meltpool dimensions, meltpool modes and flaw types. We introduced physics-aware MAM featurization, versatile ML models, and evaluation metrics to create a comprehensive learning framework for meltpool defect and geometry prediction. This benchmark can serve as a basis for melt pool control and process optimization. In addition, data-driven explicit models have been identified to estimate meltpool geometry from process parameters and material properties which outperform Rosenthal estimation for meltpool geometry while maintaining interpretability.

additive manufacturing, dataset, featurization, (15 more...)

arXiv.org Artificial Intelligence

2201.11662

Country:

Europe > United Kingdom (0.04)
North America > United States > Texas > Travis County > Austin (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Asia > Middle East > Republic of Türkiye > Karaman Province > Karaman (0.04)

Genre: Research Report (0.83)

Industry:

Materials > Metals & Mining (1.00)
Machinery > Industrial Machinery (0.88)
Government (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.71)

Add feedback

Model Generalization in Arrival Runway Occupancy Time Prediction by Feature Equivalences

Nguyen, An-Dan, Pham, Duc-Thinh, Lilith, Nimrod, Alam, Sameer

arXiv.org Artificial IntelligenceJan-25-2022

General real-time runway occupancy time prediction modelling for multiple airports is a current research gap. An attempt to generalize a real-time prediction model for Arrival Runway Occupancy Time (AROT) is presented in this paper by substituting categorical features by their numerical equivalences. Three days of data, collected from Saab Sensis' Aerobahn system at three US airports, has been used for this work. Three tree-based machine learning algorithms: Decision Tree, Random Forest and Gradient Boosting are used to assess the generalizability of the model using numerical equivalent features. We have shown that the model trained on numerical equivalent features not only have performances at least on par with models trained on categorical features but also can make predictions on unseen data from other airports.

airport, numerical equivalence, prediction model, (12 more...)

arXiv.org Artificial Intelligence

2201.11654

Country:

North America > United States > Virginia > Fairfax County > McLean (0.04)
North America > United States > Massachusetts (0.04)
North America > United States > Arizona > Maricopa County > Phoenix (0.04)
(4 more...)

Genre: Research Report (1.00)

Industry:

Transportation > Infrastructure & Services > Airport (1.00)
Transportation > Air (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.55)

Add feedback

Regression Analysis for Beginners -- Part 2

#artificialintelligenceJan-21-2022, 12:05:31 GMT

What is Extreme Gradient Boosting?

beginner, part 2, regression analysis, (8 more...)

#artificialintelligence

Genre:

Research Report > New Finding (0.40)
Research Report > Experimental Study (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.40)

Add feedback

Machine Learning : Random Forest with Python from Scratch

#artificialintelligenceJan-18-2022, 15:24:39 GMT

Are you ready to start your path to becoming a Machine Learning expert! Are you ready to train your machine like a father trains his son! A breakthrough in Machine Learning would be worth ten Microsofts." -Bill Gates There are lots of courses and lectures out there regarding random forest. After taking this course, the curtains of machine learning and especially random forest will be lifted for you. You'll be learning a state-of-the-art algorithm in details with practical implementation.

machine learning, python, random forest, (4 more...)

#artificialintelligence

Country: North America > United States (0.06)

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry: Education (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.96)

Add feedback

Fraud Detection with EvalML

#artificialintelligenceJan-17-2022, 08:05:40 GMT

Data analytics has created a great impact in the banking and financial services industry, for example, by providing insights of global financial trends and financial modelling etc. Among them, fraud prevention and detection are one of the applications. This article applied predictive data analytics and supervised machine learning (ML) methods for card-not-present (CNP) fraud detection, and demonstrated modelling using EvalML, an auto machine learning library. This article also identified that both Decision Tree (DT) and XGBoost models work better than Linear models (LM), Random Forest (RF) and LightGBM models. The dataset used to demonstrate modelling is a large-scale dataset from Vesta which is available on Kaggle .

dataset, evalml, fraud detection

#artificialintelligence

Industry:

Banking & Finance > Financial Services (1.00)
Law Enforcement & Public Safety > Fraud (0.69)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.72)

Add feedback

What's so special about CatBoost?

#artificialintelligenceJan-12-2022, 05:45:52 GMT

CatBoost is based on gradient boosting. A new machine learning technique developed by Yandex outperforms many existing boosting algorithms like XGBoost, Light GBM. While deep learning algorithms require lots of data and computational power, boosting algorithms are still needed for most business problems. However, boosting algorithms like XGBoost takes hours to train, and sometimes you'll get frustrated…

algorithm, catboost

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.87)

Add feedback