AITopics | Ensemble Learning

Collaborating Authors

Ensemble Learning

Ensemble methods use multiple learning algorithms to obtain better predictive performance than could be obtained from any of the constituent learning algorithms alone. (Wikipedia)

News Overviews Instructional Materials AI-Alerts Classics

Classification Using Tree Based Models

#artificialintelligenceJan-7-2017, 09:15:24 GMT

Machine Learning can sound very complicated, but anyone with a will to learn can successfully apply it, if they approach it from first principles. This course, Classification Using Tree Based Models, covers a specific class of Machine Learning problems - classification problems and how to solve these problems using Tree based models. First, you'll learn about building and visualizing decision trees as well as recognizing the serious problem of overfitting and its causes. Next, you'll learn about using ensemble learning to overcome overfitting. Finally, you'll explore 2 specific ensemble learning techniques - Random Forests and Gradient boosted trees By the end of this course, you'll be able to recognize opportunities where you can use Tree based models to solve classification problems and measure how well your solution is doing.

classification, decision tree learning, machine learning, (3 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (0.69)

Industry: Education > Focused Education > Special Education (0.32)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.69)

Add feedback

Introduction to Boosted Trees -- xgboost 0.6 documentation

#artificialintelligenceJan-1-2017, 02:15:22 GMT

Based on different understandings of \( y_i \) we can have different problems, such as regression, classification, ordering, etc. We need to find a way to find the best parameters given the training data. In order to do so, we need to define a so-called objective function, to measure the performance of the model given a certain set of parameters. A very important fact about objective functions is they must always contain two parts: training loss and regularization. The training loss measures how predictive our model is on training data.

artificial intelligence, documentation, machine learning, (8 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.40)

Add feedback

Pruning Random Forests for Prediction on a Budget

Nan, Feng, Wang, Joseph, Saligrama, Venkatesh

Neural Information Processing SystemsDec-31-2016

We propose to prune a random forest (RF) for resource-constrained prediction. We first construct a RF and then prune it to optimize expected feature cost & accuracy. We pose pruning RFs as a novel 0-1 integer program with linear constraints that encourages feature re-use. We establish total unimodularity of the constraint set to prove that the corresponding LP relaxation solves the original integer program. We then exploit connections to combinatorial optimization and develop an efficient primal-dual algorithm, scalable to large datasets. In contrast to our bottom-up approach, which benefits from good RF initialization, conventional methods are top-down acquiring features based on their utility value and is generally intractable, requiring heuristics. Empirically, our pruning algorithm outperforms existing state-of-the-art resource-constrained algorithms.

artificial intelligence, constraint, machine learning, (19 more...)

Neural Information Processing Systems

Genre: Overview (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.85)

Add feedback

Quant Trading using Machine Learning - Udemy

#artificialintelligenceDec-29-2016, 23:40:19 GMT

Prerequisites: Working knowledge of Python is necessary if you want to run the source code that is provided. Basic knowledge of machine learning, especially ML classification techniques, would be helpful but it's not mandatory. Taught by a Stanford-educated, ex-Googler and an IIT, IIM - educated ex-Flipkart lead analyst. This team has decades of practical experience in quant trading, analytics and e-commerce. Completely Practical: This course has just enough theory to get you started with both Quant Trading and Machine Learning.

artificial intelligence, machine learning, quant trading, (9 more...)

#artificialintelligence

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (0.40)
Education > Educational Setting > Online (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.41)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.35)

Add feedback

ggRandomForests: Exploring Random Forest Survival

Ehrlinger, John

arXiv.org Machine LearningDec-28-2016

Random forest (Leo Breiman 2001a) (RF) is a non-parametric statistical method requiring no distributional assumptions on covariate relation to the response. RF is a robust, nonlinear technique that optimizes predictive accuracy by fitting an ensemble of trees to stabilize model estimates. Random survival forests (RSF) (Ishwaran and Kogalur 2007; Ishwaran et al. 2008) are an extension of Breimans RF techniques allowing efficient nonparametric analysis of time to event data. The randomForestSRC package (Ishwaran and Kogalur 2014) is a unified treatment of Breimans random forest for survival, regression and classification problems. Predictive accuracy makes RF an attractive alternative to parametric models, though complexity and interpretability of the forest hinder wider application of the method. We introduce the ggRandomForests package, tools for visually understand random forest models grown in R (R Core Team 2014) with the randomForestSRC package. The ggRandomForests package is structured to extract intermediate data objects from randomForestSRC objects and generate figures using the ggplot2 (Wickham 2009) graphics package. This document is structured as a tutorial for building random forest for survival with the randomForestSRC package and using the ggRandomForests package for investigating how the forest is constructed. We analyse the Primary Biliary Cirrhosis of the liver data from a clinical trial at the Mayo Clinic (Fleming and Harrington 1991). Our aim is to demonstrate the strength of using Random Forest methods for both prediction and information retrieval, specifically in time to event data settings.

artificial intelligence, decision tree learning, machine learning, (18 more...)

arXiv.org Machine Learning

1612.08974

Country:

North America > United States > California > Alameda County > Berkeley (0.14)
Europe > Austria > Vienna (0.14)
North America > United States > New York (0.04)
(3 more...)

Genre: Research Report > Experimental Study (1.00)

Industry: Health & Medicine > Therapeutic Area > Nephrology (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.54)

Add feedback

Regression Machine Learning with Python - Udemy

#artificialintelligenceDec-25-2016, 01:45:29 GMT

It explores main concepts from basic to expert level which can help you achieve better grades, develop your academic career, apply your knowledge at work or make business forecasting related decisions. Read data files and perform regression machine learning operations by installing related packages and running code on the Python IDE. Approximate ensemble methods such as random forest regression and gradient boosting machine regression to enhance decision tree regression prediction accuracy. Read data files and perform regression machine learning operations by installing related packages and running code on the Python IDE. Approximate ensemble methods such as random forest regression and gradient boosting machine regression to enhance decision tree regression prediction accuracy.

artificial intelligence, machine learning, regression, (15 more...)

#artificialintelligence

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (0.40)
Education > Educational Setting > Online (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Add feedback

Beginners Tutorial on XGBoost and Parameter Tuning in R

#artificialintelligenceDec-21-2016, 16:40:13 GMT

Last week, we learned about Random Forest Algorithm. Now we know it helps us reduce a model's variance by building models on resampled data and thereby increases its generalization capability.

artificial intelligence, classification, machine learning, (16 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.30)

Add feedback

Microsoft R Server 9.0 now available

#artificialintelligenceDec-21-2016, 08:20:15 GMT

Microsoft R Server 9.0, Microsoft's R distribution with added big-data, in-database, and integration capabilities, was released today and is now available for download to MSDN subscribers. This latest release is built on Microsoft R Open 3.3.2, This release includes a brand new R package for machine learning: MicrosoftML. This package provides state-of-the-art, fast and scalable machine learning algorithms for common data science tasks including featurization, classification and regression. Fast linear and logistic model functions based on the Stochastic Dual Coordinate Ascent method; Fast Forests, a random forest and quantile regression forest implementation based on FastRank, an efficient implementation of the MART gradient boosting algorithm; A neural network algorithm with support for custom, multilayer network topologies and GPU acceleration; One-class anomaly detection based on support vector machines. One-class anomaly detection based on support vector machines.

data mining, machine learning, server 9, (14 more...)

#artificialintelligence

Technology:

Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.61)

Add feedback

Improving Predictions with Ensemble Model

@machinelearnbotDec-19-2016, 04:55:04 GMT

"Alone we can do so little and together we can do much" - a phrase from Helen Keller during 50's is a reflection of achievements and successful stories in real life scenarios from decades. Same thing applies with most of the cases from innovation with big impacts and with advanced technologies world. The machine Learning domain is also in the same race to make predictions and classification in a more accurate way using so called ensemble method and it is proved that ensemble modeling offers one of the most convincing way to build highly accurate predictive models. Ensemble methods are learning models that achieve performance by combining the opinions of multiple learners. Typically, an ensemble model is a supervised learning technique for combining multiple weak learners or models to produce a strong learner with the concept of Bagging and Boosting for data sampling.

artificial intelligence, learner, machine learning, (19 more...)

@machinelearnbot

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.56)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.52)

Add feedback

Predicting flu deaths with R

#artificialintelligenceDec-18-2016, 06:10:26 GMT

As Google learned, predicting the spread of influenza, even with mountains of data, is notoriously difficult. Nonetheless, bioinformatician and R user Shirin Glander has created a two-part tutorial about predicting flu deaths with R (part 2 here). The analysis is based on just 136 cases of influenza A H7N9 in China in 2013 (data provided in the outbreaks package) so the intent was not to create a generally predictive model, but by providing all of the R code and graphics Shirin has created a useful example of real-word predictive modeling with R. The tutorial covers loading and cleaning the data (including a nice example of using the mice package to impute missing values) and begins with some exploratory data visualizations. I was particularly impressed by the use of density charts (using the stat_density2d ggplot2 aesthetic) to highlight differences in the scatterplots of flu cases ending in death and recovery. Decision trees (implemented using rpart and visualized using fancyRpartPlot from the rattle package) Random Forests (using caret's "rf" training method) Elastic-Net Regularized Generalized Linear Models (using caret's "glmnet" training method) K-nearest neighbors clustering (using caret's "kknn" training method) Penalized Discriminant Analysis (using caret's "pda" training method) and in Part 2, Extreme gradient boosting using the xgboost package and various preprocessing techniques from the caret package Due to the limited data size, there's not too much difference between the models: in each case, 13-15 of the 23 cases were classified correctly.

artificial intelligence, machine learning, training method, (15 more...)

#artificialintelligence

Country: Asia > China (0.26)

Genre: Instructional Material (0.58)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.63)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.63)

Add feedback