AITopics | Decision Tree Learning

Collaborating Authors

Decision Tree Learning

Learning to Classify with Branching Tests: "A decision tree takes as input an object or situation described by a set of properties, and outputs a yes/no decision. Decision trees therefore represent Boolean functions. Functions with a larger range of outputs can also be represented...."
– Artificial Intelligence: A Modern Approach. By Stuart Russell & Peter Norvig. 2002. Section 18.3; page 531.

News Overviews Instructional Materials AI-Alerts Classics

Predicting Loan Credit Risk using Apache Spark Machine Learning Random Forests

#artificialintelligenceJul-16-2016, 05:05:27 GMT

Let's go through an example of Credit Risk for Bank Loans: Decision trees create a model that predicts the class or label based on several input features. Decision trees work by evaluating an expression containing a feature at every node and selecting a branch to the next node based on the answer. A possible decision tree for predicting Credit Risk is shown below. The feature questions are the nodes, and the answers "yes" or "no" are the branches in the tree to the child nodes. Our data is from the German Credit Data Set which classifies people described by a set of attributes as good or bad credit risks.

artificial intelligence, credit risk, decision tree learning, (14 more...)

#artificialintelligence

Industry:

Banking & Finance > Risk Management (1.00)
Banking & Finance > Credit (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Add feedback

Rapid Prediction of Player Retention in Free-to-Play Mobile Games

Drachen, Anders, Lundquist, Eric Thurston, Kung, Yungjen, Rao, Pranav Simha, Klabjan, Diego, Sifa, Rafet, Runge, Julian

arXiv.org Machine LearningJul-11-2016

Predicting and improving player retention is crucial to the success of mobile Free-to-Play games. This paper explores the problem of rapid retention prediction in this context. Heuristic modeling approaches are introduced as a way of building simple rules for predicting short-term retention. Compared to common classification algorithms, our heuristic-based approach achieves reasonable and comparable performance using information from the first session, day, and week of player activity.

artificial intelligence, machine learning, prediction, (18 more...)

arXiv.org Machine Learning

1607.03202

Genre: Research Report (0.82)

Industry: Leisure & Entertainment > Games > Computer Games (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.50)

Add feedback

Top Machine Learning, Data Mining, & NLP Books for Data Scientists and Machine Learning Engineers

#artificialintelligenceJul-10-2016, 18:10:36 GMT

Top Machine Learning & Data Mining Books - in this post, we have scraped various signals (e.g. We have combined all signals to compute the Quality Score for each book and publish the list of top Machine Learning and Data Mining books. The readers will love the list because it is data-driven & objective. This book is very well rated on Amazon website and is written by three professors from USC, Stanford and University of Washington. The book's authors: Gareth James, Daniela Witten, Trevor Hastie, & Rob Tibshirani all have backgrounds in statistics.

artificial intelligence, book review, machine learning, (13 more...)

#artificialintelligence

Country:

Asia > Middle East > Israel > Jerusalem District > Jerusalem (0.06)
Oceania > New Zealand > North Island > Waikato (0.05)
North America > United States > New York (0.05)
(3 more...)

Genre: Summary/Review (0.50)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.30)

Add feedback

Random Forest From Top To Bottom

#artificialintelligenceJul-7-2016, 07:45:27 GMT

In three months (as of June 2016) the New Orleans Saints will play a football game against the Atlanta Falcons. I want to know who will win. I ask my friend and he says the Saints. Technically this is a predictive model, but it's probably not worth much. I can improve upon this model by asking other people who they think will win.

artificial intelligence, decision tree learning, machine learning, (15 more...)

#artificialintelligence

Country: North America > United States > Louisiana > Orleans Parish > New Orleans (0.25)

Industry: Leisure & Entertainment > Sports > Football (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Add feedback

Forest Floor Visualizations of Random Forests

Welling, Soeren H., Refsgaard, Hanne H. F., Brockhoff, Per B., Clemmensen, Line H.

arXiv.org Machine LearningJul-4-2016

We propose a novel methodology, forest floor, to visualize and interpret random forest (RF) models. RF is a popular and useful tool for non-linear multi-variate classification and regression, which yields a good trade-off between robustness (low variance) and adaptiveness (low bias). Direct interpretation of a RF model is difficult, as the explicit ensemble model of hundreds of deep trees is complex. Nonetheless, it is possible to visualize a RF model fit by its mapping from feature space to prediction space. Hereby the user is first presented with the overall geometrical shape of the model structure, and when needed one can zoom in on local details. Dimensional reduction by projection is used to visualize high dimensional shapes. The traditional method to visualize RF model structure, partial dependence plots, achieve this by averaging multiple parallel projections. We suggest to first use feature contributions, a method to decompose trees by splitting features, and then subsequently perform projections. The advantages of forest floor over partial dependence plots is that interactions are not masked by averaging. As a consequence, it is possible to locate interactions, which are not visualized in a given projection. Furthermore, we introduce: a goodness-of-visualization measure, use of colour gradients to identify interactions and an out-of-bag cross validated variant of feature contributions.

artificial intelligence, feature contribution, machine learning, (17 more...)

arXiv.org Machine Learning

1605.09196

Country:

North America > United States (0.28)
Europe > Austria (0.28)

Genre: Research Report (0.40)

Industry:

Health & Medicine (0.68)
Leisure & Entertainment (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

Add feedback

ledell/useR-machine-learning-tutorial

#artificialintelligenceJul-1-2016, 11:11:03 GMT

Instructions for how to install the neccessary software for this tutorial is available here. Data for the tutorial can be downloaded by running ./data/get-data.sh (requires wget). Certain algorithms don't scale well when there are millions of features. For example, decision trees require computing some sort of metric (to determine the splits) on all the feature values (or some fraction of the values as in Random Forest and Stochastic GBM). Therefore, computation time is linear in the number of features. Algorithms can deal with data sparsity (where many of the feature values are zero) in different ways.

algorithm, artificial intelligence, machine learning, (9 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (0.37)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.60)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.40)

Add feedback

Deal: The Complete Machine Learning Bundle for 39.99 - 6/30/16 Androidheadlines.com

#artificialintelligenceJun-30-2016, 12:51:43 GMT

Lately in the tech world, everything has been revolving around artificial intelligence and machine learning. If you've been interested in learning more about machine learning and getting in on it as well, now you can. Thanks to this fantastic bundle that we have available on the Android Headlines Store. This bundle features 10 courses, over 400 lessons and is discounted by 94% right now. Included in the bundle is Quant Trading Using Machine Learning, Learn by Example: Statistics and Data Science in R, Learn by Example: Hadoop & MapReduce for Big Data Problems, Byte Size Chunks: Java Object-Oriented Programming & Design, An Introduction to Machine Learning & NLP in Python, Byte-Sized-Chunks: Twitter Sentiment Analysis (In Python), Byte-Sized-Chunks: Decision Trees and Random Forests, An Introduction to Deep Learning & Computer Vision, Byte-Sized-Chunks: Recommendation Systems, and From 0 to 1: Learn Python Programming – Easy as Pie.

artificial intelligence, data mining, object-oriented architecture, (7 more...)

#artificialintelligence

Technology:

Information Technology > Data Science > Data Mining > Big Data (0.62)
Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (0.62)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.62)

Add feedback

Top 10 Data Mining Algorithms, Explained

#artificialintelligenceJun-25-2016, 11:00:51 GMT

Today, I'm going to explain in plain English the top 10 most influential data mining algorithms as voted on by 3 separate panels in this survey paper. Once you know what they are, how they work, what they do and where you can find them, my hope is you'll have this blog post as a springboard to learn even more about data mining. In order to do this, C4.5 is given a set of data representing things that are already classified. A classifier is a tool in data mining that takes a bunch of data representing things we want to classify and attempts to predict which class the new data belongs to. Sure, suppose a dataset contains a bunch of patients.

artificial intelligence, data mining, machine learning, (19 more...)

#artificialintelligence

Genre: Overview (0.55)

Industry: Health & Medicine > Therapeutic Area (0.34)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.80)

Add feedback

Regression Trees and Random forest based feature selection for malaria risk exposure prediction

Kouwayè, Bienvenue

arXiv.org Machine LearningJun-24-2016

This paper deals with prediction of anopheles number, the main vector of malaria risk, using environmental and climate variables. The variables selection is based on an automatic machine learning method using regression trees, and random forests combined with stratified two levels cross validation. The minimum threshold of variables importance is accessed using the quadratic distance of variables importance while the optimal subset of selected variables is used to perform predictions. Finally the results revealed to be qualitatively better, at the selection, the prediction , and the CPU time point of view than those obtained by GLM-Lasso method.

artificial intelligence, machine learning, prediction, (16 more...)

arXiv.org Machine Learning

1606.07578

Country: Europe > France (0.28)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Add feedback

Decision trees vs. Neural Networks

#artificialintelligenceJun-23-2016, 07:11:01 GMT

I'm implementing a machine learning structure to try and predict fraud on financial systems like banks, etc... This means that there is a lot of different data that can be used to train the model eg. I'm having trouble deciding which structure is the best for this problem. I have some experience with decision trees but currently I have started to question if a neural network would be better for this kind of problem. Also if any other method would be best please feel free to enlighten me.

decision tree learning, machine learning, neural network, (1 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (0.73)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.73)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.73)

Add feedback