AITopics | Ensemble Learning

Collaborating Authors

Ensemble Learning

Ensemble methods use multiple learning algorithms to obtain better predictive performance than could be obtained from any of the constituent learning algorithms alone. (Wikipedia)

News Overviews Instructional Materials AI-Alerts Classics

How to Tune the Number and Size of Decision Trees with XGBoost in Python - Machine Learning Mastery

#artificialintelligenceSep-7-2016, 02:05:38 GMT

Gradient boosting involves the creation and addition of decision trees sequentially, each attempting to correct the mistakes of the learners that came before it. This raises the question as to how many trees (weak learners or estimators) to configure in your gradient boosting model and how big each tree should be. In this post you will discover how to design a systematic experiment to select the number and size of decision trees to use on your problem. How to Tune the Number and Size of Decision Trees with XGBoost in Python Photo by USFWSmidwest, some rights reserved. XGBoost is the high performance implementation of gradient boosting that you can now access directly in Python.

artificial intelligence, decision tree learning, machine learning, (15 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Add feedback

How to Best Tune Multithreading Support for XGBoost in Python - Machine Learning Mastery

#artificialintelligenceSep-4-2016, 20:15:36 GMT

The XGBoost library for gradient boosting uses is designed for efficient multi-core parallel processing. This allows it to efficiently use all of the CPU cores in your system when training. In this post you will discover the parallel processing capabilities of the XGBoost in Python. How to Best Tune Multithreading Support for XGBoost in Python Photo by Nicholas A. Tonelli, some rights reserved. XGBoost is the high performance implementation of gradient boosting that you can now access directly in Python.

artificial intelligence, machine learning, xgboost, (14 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)

Add feedback

Practical XGBoost in Python

#artificialintelligenceSep-4-2016, 20:15:33 GMT

For the sake of reproducibility, I'm giving you access to personalized Docker image for provisioning the environment. You should be able to run it on your operating system. If you don't want to (or can't) you will have to install all the required libraries manually. You should also have Git installed to download necessary course materials. The course starts now and never ends!

artificial intelligence, machine learning, practical xgboost, (1 more...)

#artificialintelligence

Genre: Instructional Material (0.32)

Industry: Education > Educational Setting > Online (0.37)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.40)

Add feedback

Feature Importance and Feature Selection With XGBoost in Python - Machine Learning Mastery

#artificialintelligenceAug-30-2016, 19:41:13 GMT

A benefit of using ensembles of decision tree methods like gradient boosting is that they can automatically provide estimates of feature importance from a trained predictive model. In this post you will discover how you can estimate the importance of features for a predictive modeling problem using the XGBoost library in Python. Feature Importance and Feature Selection With XGBoost in Python Photo by Keith Roper, some rights reserved. XGBoost is the high performance implementation of gradient boosting that you can now access directly in Python. A benefit of using gradient boosting is that after the boosted trees are constructed, it is relatively straightforward to retrieve importance scores for each attribute.

artificial intelligence, feature importance, machine learning, (14 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)

Add feedback

Dataiku Offers No-Code Machine Learning -- ADTmag

#artificialintelligenceAug-29-2016, 07:11:07 GMT

We've covered a lot of low-code/no-code development tools, but they've mostly been relegated to simple app creation, not complicated, cutting-edge technology like machine learning (ML) -- until now. That's because predictive analytics specialist Dataiku has updated its Dataiku Data Science Studio (DSS) platform to Version 3.1, which it said "unleashes visual machine learning." Dataiku has introduced five new back-ends for visual ML development with no need for those hard-to-find, expensive developer types well versed in writing programming code. Much like the numerous mobile development back-ends that take care of tedious chores such as external integrations, database access and so on, those new ML backstops help create predictive ML models. "Dataiku DSS 3.1 introduces new visual machine learning engines that allow users to create incredibly powerful predictive applications within a code-free interface," the company said in a statement this week.

artificial intelligence, dataiku dss 3, dataiku offer no-code machine learning, (8 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.33)

Add feedback

Random Forest for Label Ranking

Zhou, Yangming, Qiu, Guoping

arXiv.org Machine LearningAug-27-2016

Label ranking aims to learn a mapping from instances to rankings over a finite number of predefined labels. Random forest is a powerful and one of the most successfully general-purpose machine learning algorithms of modern times. In the literature, there seems no research has yet been done in applying random forest to label ranking. In this paper, We present a powerful random forest label ranking method which uses random decision trees to retrieve nearest neighbors that are not only similar in the feature space but also in the ranking space. We have developed a novel two-step rank aggregation strategy to effectively aggregate neighboring rankings discovered by the random forest into a final predicted ranking. Compared with existing methods, the new random forest method has many advantages including its intrinsically scalable tree data structure, highly parallel-able computational architecture and much superior performances. We present extensive experimental results to demonstrate that our new method achieves the best predictive accuracy performances compared with state-of-the-art methods for datasets with complete ranking and datasets with only partial ranking information.

algorithm, artificial intelligence, machine learning, (18 more...)

arXiv.org Machine Learning

1608.0771

Country:

Europe > United Kingdom > England > Nottinghamshire > Nottingham (0.14)
North America > United States > Wisconsin (0.04)
Europe > France (0.04)
(2 more...)

Genre: Research Report > New Finding (0.68)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.48)

Add feedback

Random Forest Tutorial: Predicting Crime in San Francisco

#artificialintelligenceAug-25-2016, 22:56:39 GMT

Announcement: Layman Tutorials for Data Science site Annalyzin is now called Algobeans! We're creating a new mailing list to deliver tutorials to your inbox. If you like to be included, sign up below. If you're already subscribed, signing up to this new mailing list will remove you from the old one. Can several wrongs make a right?

artificial intelligence, decision tree, machine learning, (15 more...)

#artificialintelligence

Country: North America > United States > California > San Francisco County > San Francisco (0.52)

Industry: Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.75)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.64)

Add feedback

A Gentle Introduction to XGBoost for Applied Machine Learning - Machine Learning Mastery

#artificialintelligenceAug-24-2016, 03:25:58 GMT

When getting started with a new tool like XGBoost, it can be helpful to review a few talks on the topic before diving into the code. Tianqi Chen, the creator of the library gave a talk to the LA Data Science group in June 2016 titled "XGBoost: A Scalable Tree Boosting System". There is more information on the DataScience LA blog. Tong He, a contributor to XGBoost for the R interface gave a talk at the NYC Data Science Academy in December 2015 titled "XGBoost: eXtreme Gradient Boosting". There is more information about this talk on the NYC Data Science Academy blog.

artificial intelligence, machine learning, xgboost, (12 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)

Add feedback

Implementing a Weighted Majority Rule Ensemble Classifier

#artificialintelligenceAug-22-2016, 03:10:12 GMT

If you are interested in using the EnsembleClassifier, please note that it is now also available through scikit learn ( 0.17) as VotingClassifier. Here, I want to present a simple and conservative approach of implementing a weighted majority rule ensemble classifier in scikit-learn that yielded remarkably good results when I tried it in a kaggle competition. For me personally, kaggle competitions are just a nice way to try out and compare different approaches and ideas – basically an opportunity to learn in a controlled environment with nice datasets. Of course, there are other implementations of more sophisticated ensemble methods in scikit-learn, such as bagging classifiers, random forests, or the famous AdaBoost algorithm. However, as far as I am concerned, they all require the usage of a common "base classifier." In contrast, my motivation for the following approach was to combine conceptually different machine learning classifiers and use a majority vote rule.

artificial intelligence, classifier, machine learning, (14 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.57)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.36)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.31)

Add feedback

Data Preparation for Gradient Boosting with XGBoost in Python - Machine Learning Mastery

#artificialintelligenceAug-21-2016, 20:10:29 GMT

XGBoost is a popular implementation of Gradient Boosting because of its speed and performance. Internally, XGBoost models represent all problems as a regression predictive modeling problem that only takes numerical values as input. If your data is in a different form, it must be prepared into the expected format. In this post you will discover how to prepare your data for using with gradient boosting with the XGBoost library in Python. Data Preparation for Gradient Boosting with XGBoost in Python Photo by Ed Dunens, some rights reserved.

artificial intelligence, machine learning, xgboost, (16 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (0.68)

Industry: Health & Medicine (0.32)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)

Add feedback