AITopics | Decision Tree Learning

Collaborating Authors

Decision Tree Learning

Learning to Classify with Branching Tests: "A decision tree takes as input an object or situation described by a set of properties, and outputs a yes/no decision. Decision trees therefore represent Boolean functions. Functions with a larger range of outputs can also be represented...."
– Artificial Intelligence: A Modern Approach. By Stuart Russell & Peter Norvig. 2002. Section 18.3; page 531.

News Overviews Instructional Materials AI-Alerts Classics

Playing with Continuous uncertainty in Decision Trees • /r/MachineLearning

@machinelearnbotMay-1-2016, 00:19:34 GMT

Classically, for decision trees we define a split or various "buckets" to transform continuous data into discrete data. The data I am currently processing has uncertainty associated with it (each data point comes from an aggregate set). As such, I might define a boundary- let's say N, where a data's uncertainty could place it in multiple buckets (say the parameter value N? Normally these boundaries are binary, but I was considering using the probability of these'overlapping instances' towards both buckets weighted by their respective probabilities. This doesn't seem to violate the entropy term (total probability will still sum to 1). However, I can't place half an instance within a branch- which would destroy the meaning behind the term.

continuous uncertainty, decision tree learning, machine learning, (5 more...)

@machinelearnbot

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.78)
Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (0.64)

Add feedback

Adaptive Concentration of Regression Trees, with Application to Random Forests

Wager, Stefan, Walther, Guenther

arXiv.org Machine LearningApr-30-2016

We study the convergence of the predictive surface of regression trees and forests. To support our analysis we introduce a notion of adaptive concentration for regression trees. This approach breaks tree training into a model selection phase in which we pick the tree splits, followed by a model fitting phase where we find the best regression model consistent with these splits. We then show that the fitted regression tree concentrates around the optimal predictor with the same splits: as d and n get large, the discrepancy is with high probability bounded on the order of sqrt(log(d) log(n)/k) uniformly over the whole regression surface, where d is the dimension of the feature space, n is the number of training examples, and k is the minimum leaf size for each tree. We also provide rate-matching lower bounds for this adaptive concentration statement. From a practical perspective, our result enables us to prove consistency results for adaptively grown forests in high dimensions, and to carry out valid post-selection inference in the sense of Berk et al. [2013] for subgroups defined by tree leaves.

adaptive concentration, artificial intelligence, machine learning, (15 more...)

arXiv.org Machine Learning

1503.06388

Country: North America > United States > California (0.28)

Genre: Research Report > New Finding (0.87)

Industry: Health & Medicine (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.48)

Add feedback

Dealing with Unbalanced Classes, SVMs, Random Forests, and Decision Trees in Python

#artificialintelligenceApr-29-2016, 22:23:45 GMT

So far I have talked about decision trees and ensembles. But I hope, I have made you understand the logic behind these concepts without getting too much into the mathematical details. In this post lets get into action, I will be implementing the concepts that we learned in these two blog posts. The only concept that I haven't discussed about is SVM. I suggest you to watch Professor Andrew Ng's week 7 videos on Coursera.

artificial intelligence, machine learning, random forest, (14 more...)

#artificialintelligence

Country: Europe > Portugal (0.05)

Industry:

Education (0.60)
Consumer Products & Services > Food, Beverage, Tobacco & Cannabis > Beverages (0.36)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Add feedback

When Does Deep Learning Work Better Than SVMs or Random Forests?

#artificialintelligenceApr-27-2016, 07:45:57 GMT

Guest blog by Sebastian Raschka, originally posted here. If we tackle a supervised learning problem, my advice is to start with the simplest hypothesis space first. I.e., try a linear model such as logistic regression. If this doesn't work "well" (i.e., it doesn't meet our expectation or performance criterion that we defined earlier), I would move on to the next experiment. I would say that random forests are probably THE "worry-free" approach - if such a thing exists in ML: There are no real hyperparameters to tune (maybe except for the number of trees; typically, the more trees we have the better).

artificial intelligence, machine learning, random forest, (15 more...)

#artificialintelligence

Country: North America > United States > Michigan (0.05)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.82)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.73)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.61)

Add feedback

Top Data Mining Algorithms Identified by IEEE & Related Python Resources

@machinelearnbotApr-25-2016, 03:40:17 GMT

IEEE International Conference on Data Mining identified 10 algorithms in 2006 using surveys from past winners and voting. This is a list of those algorithms a short description and related python resources. The detailed paper is given here. C4.5 is an algorithm used to generate a decision tree developed by Ross Quinlan. The decision trees generated by C4.5 can be used for classification, and for this reason, C4.5 is often referred to as a statistical classifier.

algorithm, artificial intelligence, machine learning, (10 more...)

@machinelearnbot

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.76)

Add feedback

Neural Random Forests

Biau, Gérard, Scornet, Erwan, Welbl, Johannes

arXiv.org Machine LearningApr-25-2016

Decision tree learning is a popular data-modeling technique that has been around for over fifty years in the fields of statistics, artificial intelligence, and machine learning. The approach and its innumerable variants have been 1 successfully involved in many challenges requiring classification and regression tasks, and it is no exaggeration to say that many modern predictive algorithms rely directly or indirectly on tree principles. What has greatly contributed to this success is the simplicity and transparency of trees, together with their ability to explain complex data sets. The monographs by Breiman et al. (1984), Devroye et al. (1996), Rokach and Maimon (2008), and Hastie et al. (2009) will provide the reader with introductions to the general subject area, both from a practical and theoretical perspective. The history of trees goes on today with random forests (Breiman, 2001), which are on the list of the most successful machine learning algorithms currently available to handle large-scale and high-dimensional data sets.

artificial intelligence, machine learning, neural network, (19 more...)

arXiv.org Machine Learning

1604.07143

Country:

Europe (0.46)
North America > United States (0.28)

Genre: Research Report (0.82)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Add feedback

Random Forest in Python

@machinelearnbotApr-24-2016, 13:35:04 GMT

Random Forest is a machine learning algorithm used for classification, regression, and feature selection. It's an ensemble technique, meaning it combines the output of one weaker technique in order to get a stronger result.

artificial intelligence, decision tree learning, random forest, (2 more...)

@machinelearnbot

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.88)

Add feedback

When Does Deep Learning Work Better Than SVMs or Random Forests?

#artificialintelligenceApr-22-2016, 18:35:54 GMT

If we tackle a supervised learning problem, my advice is to start with the simplest hypothesis space first. I.e., try a linear model such as logistic regression. If this doesn't work "well" (i.e., it doesn't meet our expectation or performance criterion that we defined earlier), I would move on to the next experiment. I would say that random forests are probably THE "worry-free" approach - if such a thing exists in ML: There are no real hyperparameters to tune (maybe except for the number of trees; typically, the more trees we have the better). On the contrary, there are a lot of knobs to be turned in SVMs: Choosing the "right" kernel, regularization penalties, the slack variable, ... Both random forests and SVMs are non-parametric models (i.e., the complexity grows as the number of training samples increases).

artificial intelligence, machine learning, random forest, (12 more...)

#artificialintelligence

Country: North America > United States > Michigan (0.05)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.63)

Add feedback

Bagging and Random Forest Ensemble Algorithms for Machine Studying

#artificialintelligenceApr-21-2016, 20:05:48 GMT

Random Forest is 1 of the most preferred and most highly effective equipment discovering algorithms. It is a style of ensemble equipment discovering algorithm referred to as Bootstrap Aggregation or bagging. In this publish you will explore the Bagging ensemble algorithm and the Random Forest algorithm for predictive modeling. This publish was published for developers and assumes no background in studies or mathematics. The publish focuses on how the algorithm performs and how to use it for predictive modeling complications.

algorithm, artificial intelligence, machine learning, (14 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Add feedback

What Random Forests Tell Us About Democracy

#artificialintelligenceApr-21-2016, 09:50:49 GMT

A popular method for learning from large data sets is Random Forests (see my class on the topic, in Spanish). I would like to drive a paralellism between the way they work and our political decision structures and the so called Wisdom of the crowd. Random Forests are what is called an ensemble method as they perform better than individual methods by combining their results. The individual method used in Random Forests are Decision Trees, trained from a subset of all the available data (and because of this property of operating on subsets of the data, they are a good method for applying on large datasets). More interestingly, Random Forests (as discussed in the Machine Learning article by Leo Breiman in 2001), can not only train each of their trees on a subset of the data but also use a subset of the available information (features) when training each decision node in the tree.

artificial intelligence, decision tree learning, machine learning, (7 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Add feedback