AITopics | Decision Tree Learning

Collaborating Authors

Decision Tree Learning

Learning to Classify with Branching Tests: "A decision tree takes as input an object or situation described by a set of properties, and outputs a yes/no decision. Decision trees therefore represent Boolean functions. Functions with a larger range of outputs can also be represented...."
– Artificial Intelligence: A Modern Approach. By Stuart Russell & Peter Norvig. 2002. Section 18.3; page 531.

News Overviews Instructional Materials AI-Alerts Classics

Best Machine Learning, Data Mining, & NLP Books for Data Scientists and Machine Learning Engineers

@machinelearnbotApr-13-2016, 15:33:10 GMT

Top Machine Learning & Data Mining Books - in this post, we have scraped various signals (e.g. We have combined all signals to compute the Quality Score for each book and publish the list of top Machine Learning and Data Mining books. The readers will love the list because it is data-driven & objective. This book is very well rated on Amazon website and is written by three professors from USC, Stanford and University of Washington. The three authors: Gareth James, Daniela Witten, & Trevor Hastie all have backgrounds in statistics.

artificial intelligence, decision tree learning, machine learning, (12 more...)

@machinelearnbot

Country:

Asia > Middle East > Israel > Jerusalem District > Jerusalem (0.06)
Oceania > New Zealand > North Island > Waikato (0.05)
North America > United States > New York (0.05)
(3 more...)

Genre: Summary/Review (0.50)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.30)

Add feedback

Accurate Sales Forecast for Data Analysts: Building a Random Forest model with Just SQL and Hivemall Treasure Data Blog

#artificialintelligenceApr-12-2016, 23:45:59 GMT

In this blog post, we will use Hivemall, the open source Machine Learning-on-SQL library available in the Treasure Data environment, to introduce the basics of machine learning. We will use an E-Commerce dataset from Kaggle, the data science competition platform. The first challenge is predicting the retail sales for the Rossman stores (the full details at Kaggle). We will use an ensemble learning technique known as Random Forest regression. Rossman is a pharmacy chain with over 3,000 stores in seven countries within Europe.

artificial intelligence, machine learning, training data, (13 more...)

#artificialintelligence

Country: Europe > Germany (0.05)

Industry: Marketing (0.30)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.75)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.65)

Add feedback

A Complete Tutorial on Tree Based Modeling from Scratch (in R & Python)

#artificialintelligenceApr-12-2016, 17:11:10 GMT

Tree based learning algorithms are considered to be one of the best and mostly used supervised learning methods. Tree based methods empower predictive models with high accuracy, stability and ease of interpretation. Unlike linear models, they map non-linear relationships quite well. They are adaptable at solving any kind of problem at hand (classification or regression). Methods like decision trees, random forest, gradient boosting are being popularly used in all kinds of data science problems. Hence, for every analyst (fresher also), it's important to learn these algorithms and use them for modeling. This tutorial is meant to help beginners learn tree based modeling from scratch. After the successful completion of this tutorial, one is expected to become proficient at using tree based algorithms and build predictive models. Note: This tutorial requires no prior knowledge of machine learning.

algorithm, artificial intelligence, machine learning, (18 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.48)

Add feedback

Confidence Decision Trees via Online and Active Learning for Streaming (BIG) Data

De Rosa, Rocco

arXiv.org Machine LearningApr-12-2016

Decision tree classifiers are a widely used tool in data stream mining. The use of confidence intervals to estimate the gain associated with each split leads to very effective methods, like the popular Hoeffding tree algorithm. From a statistical viewpoint, the analysis of decision tree classifiers in a streaming setting requires knowing when enough new information has been collected to justify splitting a leaf. Although some of the issues in the statistical analysis of Hoeffding trees have been already clarified, a general and rigorous study of confidence intervals for splitting criteria is missing. We fill this gap by deriving accurate confidence intervals to estimate the splitting gain in decision tree learning with respect to three criteria: entropy, Gini index, and a third index proposed by Kearns and Mansour. Our confidence intervals depend in a more detailed way on the tree parameters. We also extend our confidence analysis to a selective sampling setting, in which the decision tree learner adaptively decides which labels to query in the stream. We furnish theoretical guarantee bounding the probability that the classification is non-optimal learning the decision tree via our selective sampling strategy. Experiments on real and synthetic data in a streaming setting show that our trees are indeed more accurate than trees with the same number of leaves generated by other techniques and our active learning module permits to save labeling cost. In addition, comparing our labeling strategy with recent methods, we show that our approach is more robust and consistent respect all the other techniques applied to incremental decision trees.

artificial intelligence, decision tree learning, machine learning, (16 more...)

arXiv.org Machine Learning

1604.03278

Country:

Europe (0.28)
North America > United States (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Add feedback

Healthcare Data Analytics with Extreme Tree Models

#artificialintelligenceApr-11-2016, 20:25:39 GMT

Tree-based models provide robust first-cut solutions to such data. I introduce various kinds of trees and how they are different from each other. After understanding these trees, you can build better custom models of your own.

bioinformatics, data mining, machine learning, (4 more...)

#artificialintelligence

Industry:

Information Technology > Security & Privacy (0.63)
Health & Medicine > Consumer Health (0.63)

Technology:

Information Technology > Data Science > Data Mining (0.63)
Information Technology > Biomedical Informatics > Clinical Informatics (0.63)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.40)

Add feedback

Best Machine Learning, Data Mining, & NLP Books for Data Scientists and Machine Learning Engineers

@machinelearnbotApr-11-2016, 20:24:14 GMT

artificial intelligence, decision tree learning, machine learning, (12 more...)

@machinelearnbot

Country:

Asia > Middle East > Israel > Jerusalem District > Jerusalem (0.06)
Oceania > New Zealand > North Island > Waikato (0.05)
North America > United States > New York (0.05)
(3 more...)

Genre: Summary/Review (0.50)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.30)

Add feedback

Operational Machine Learning for Developers

#artificialintelligenceApr-11-2016, 13:45:55 GMT

Machine learning (ML) is the unsung hero that powers many applications, systems, sensors, devices, and products. Machine learning is so pervasive that we can often assume its presence in most of the applications and systems without having to specifically call it out. In simple terms, machine learning is a computer's ability to learn from data, and it is one of the most useful tools we have to develop intelligent systems and applications. Machine learning is used widely today for all kinds of tasks, from churn prediction in large companies, to web search, to medical diagnostics, to robotics. It's hard to find a field that cannot benefit from machine learning in one way or another.

artificial intelligence, machine learning, prediction, (15 more...)

#artificialintelligence

Country: Europe > Spain > Galicia > Madrid (0.05)

Genre: Instructional Material (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.50)

Add feedback

Top 3 Algorithms in Plain English - Dataconomy

#artificialintelligenceApr-11-2016, 06:50:30 GMT

In order to do this, C4.5 is given a set of data representing things that are already classified. A classifier is a tool in data mining that takes a bunch of data representing things we want to classify and attempts to predict which class the new data belongs to. Sure, suppose a dataset contains a bunch of patients. We know various things about each patient like age, pulse, blood pressure, VO2max, family history, etc. Now: Given these attributes, we want to predict whether the patient will get cancer. The patient can fall into 1 of 2 classes: will get cancer or won't get cancer.

artificial intelligence, decision tree learning, machine learning, (18 more...)

#artificialintelligence

Industry: Health & Medicine > Therapeutic Area (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.81)

Add feedback

Classification And Regression Trees for Machine Learning - Machine Learning Mastery

#artificialintelligenceApr-8-2016, 16:36:07 GMT

Decision Trees are an important type of algorithm for predictive modeling machine learning. The classical decision tree algorithms have been around for decades and modern variations like random forest are among the most powerful techniques available. In this post you will discover the humble decision tree algorithm known by it's more modern name CART which stands for Classification And Regression Trees. If you have taken an algorithms and data structures course, it might be hard to hold you back from implementing this simple and powerful algorithm. Classification And Regression Trees for Machine Learning Photo by Wonderlane, some rights reserved.

algorithm, artificial intelligence, machine learning, (16 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Add feedback

How to Bin or Convert Numerical Variables to Categorical Variables with Decision Trees

@machinelearnbotApr-7-2016, 18:10:20 GMT

This is a guest repost by Jacob Joseph from CleverTap. Why would you want to convert a numerical variable into categorical one? Depending on the situation, it can lead to a better interpretation of the numerical variable, quick segmentation or just an additional feature for building your predictive model by creating bins for the numerical variable. Binning is a popular feature engineering technique. Suppose your hypothesis is that the age of a customer is correlated with their tendency to interact with a mobile app.

artificial intelligence, decision tree learning, machine learning, (15 more...)

@machinelearnbot

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.48)

Add feedback