AITopics | Decision Tree Learning

Collaborating Authors

Decision Tree Learning

Learning to Classify with Branching Tests: "A decision tree takes as input an object or situation described by a set of properties, and outputs a yes/no decision. Decision trees therefore represent Boolean functions. Functions with a larger range of outputs can also be represented...."
– Artificial Intelligence: A Modern Approach. By Stuart Russell & Peter Norvig. 2002. Section 18.3; page 531.

News Overviews Instructional Materials AI-Alerts Classics

When Does Deep Learning Work Better Than SVMs or Random Forests?

@machinelearnbotJun-3-2017, 15:25:13 GMT

If we tackle a supervised learning problem, my advice is to start with the simplest hypothesis space first. I.e., try a linear model such as logistic regression. If this doesn't work "well" (i.e., it doesn't meet our expectation or performance criterion that we defined earlier), I would move on to the next experiment. I would say that random forests are probably THE "worry-free" approach - if such a thing exists in ML: There are no real hyperparameters to tune (maybe except for the number of trees; typically, the more trees we have the better). On the contrary, there are a lot of knobs to be turned in SVMs: Choosing the "right" kernel, regularization penalties, the slack variable, ... Both random forests and SVMs are non-parametric models (i.e., the complexity grows as the number of training samples increases).

artificial intelligence, machine learning, svm, (13 more...)

@machinelearnbot

Country: North America > United States > Michigan (0.05)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.62)

Add feedback

Want to know how to choose Machine Learning algorithm?

@machinelearnbotJun-3-2017

Machine Learning is the foundation for today's insights on customer, products, costs and revenues which learns from the data provided to its algorithms. Some of the most common examples of machine learning are Netflix's algorithms to give movie suggestions based on movies you have watched in the past or Amazon's algorithms that recommend products based on other customers bought before. Decision Trees: Decision tree output is very easy to understand even for people from non-analytical background. It does not require any statistical knowledge to read and interpret them. Fastest way to identify most significant variables and relation between two or more variables.

algorithm, artificial intelligence, machine learning, (11 more...)

@machinelearnbot

Industry: Banking & Finance > Trading (0.32)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.82)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.57)

Add feedback

Sign Up for Data Science Central

@machinelearnbotJun-2-2017, 08:30:18 GMT

Cookies may not be enabled in your browser. You will need to enable them in order to continue. Welcome to Data Science Central.

artificial intelligence, decision tree learning, machine learning

@machinelearnbot

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.40)

Add feedback

Random Forests explained intuitively

@machinelearnbotJun-2-2017, 03:20:15 GMT

Say, you appeared for the position of Statistical analyst at WalmartLabs. Now like most of the companies, you don't just have one round of interview. You have multiple rounds of interviews. Each one of these interviews is chaired by independent panels. Generally, even the questions asked in these interviews differ from each other.

decision tree learning, interview, machine learning, (2 more...)

@machinelearnbot

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.47)

Add feedback

Building Trust in Machine Learning Models (using LIME in Python)

#artificialintelligenceJun-2-2017, 00:15:53 GMT

The value is not in software, the value is in data, and this is really important for every single company, that they understand what data they've got. More and more companies are now aware of the power of data. Machine Learning models are increasing in popularity and are now being used to solve a wide variety of business problems using data. Having said that, it is also true that there is always a trade-off between accuracy of models & its interpretability. In general, if accuracy has to be improved, data scientists have to resort to using complicated algorithms like Bagging, Boosting, Random Forests etc. which are "Blackbox" methods.

algorithm, artificial intelligence, machine learning, (18 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.38)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.35)

Add feedback

How the random forest algorithm works in machine learning 7wData

#artificialintelligenceJun-1-2017, 19:15:28 GMT

You are going to learn the most popular classification algorithm. Which is the Random forest algorithm. As a motivation to go further I am going to give you one of the best advantages of random forest. The Same algorithm both for classification and regression, You mind be thinking I am kidding. But the truth is, Yes we can use the same random forest algorithm both for classification and regression.

algorithm, artificial intelligence, machine learning, (15 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Add feedback

Optimization of Tree Ensembles

Mišić, Velibor V.

arXiv.org Machine LearningMay-30-2017

Tree ensemble models such as random forests and boosted trees are among the most widely used and practically successful predictive models in applied machine learning and business analytics. Although such models have been used to make predictions based on exogenous, uncontrollable independent variables, they are increasingly being used to make predictions where the independent variables are controllable and are also decision variables. In this paper, we study the problem of tree ensemble optimization: given a tree ensemble that predicts some dependent variable using controllable independent variables, how should we set these variables so as to maximize the predicted value? We formulate the problem as a mixed-integer optimization problem. We theoretically examine the strength of our formulation, provide a hierarchy of approximate formulations with bounds on approximation quality and exploit the structure of the problem to develop two large-scale solution methods, one based on Benders decomposition and one based on iteratively generating tree split constraints. We test our methodology on real data sets, including two case studies in drug design and customized pricing, and show that our methodology can efficiently solve large-scale instances to near or full optimality, and outperforms solutions obtained by heuristic approaches. In our drug design case, we show how our approach can identify compounds that efficiently trade-off predicted performance and novelty with respect to existing, known compounds. In our customized pricing case, we show how our approach can efficiently determine optimal store-level prices under a random forest model that delivers excellent predictive accuracy.

constraint, data mining, machine learning, (16 more...)

arXiv.org Machine Learning

1705.10883

Country: North America > United States > California > Los Angeles County > Los Angeles (0.28)

Genre: Research Report > New Finding (0.67)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Consumer Products & Services (0.92)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(3 more...)

Add feedback

Simple Decision Tree Excel Add-in

@machinelearnbotMay-28-2017, 20:45:07 GMT

Simple Decision Tree is an Excel Add-in created by Thomas Seyller. The Add-in is released under the terms of GPL v3 with additional permissions. Thomas created this Add-in for the Stanford Decisions and Ethics Center and open-sourced it for the Decision Professionals Network. This software has been extensively used to teach Decision Analysis at Stanford University.

artificial intelligence, decision tree learning, machine learning, (1 more...)

@machinelearnbot

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (0.81)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.81)

Add feedback

Mining of health and disease events on Twitter: validating search protocols within the setting of Indonesia

Ramadona, Aditya L., Agusta, Rendra, Sulistyawati, null, Lazuardi, Lutfan, Cahyono, Anwar D., Holmner, Åsa, Dewi, Fatwa S. T., Kusnanto, Hari, Röcklov, Joacim

arXiv.org Machine LearningMay-28-2017

As of May 2016, there are 24.34 million Indonesian, or around 10% of the population being active monthly on Twitter [1], sharing news, events, as well as their personal feelings and experiences including healthrelated information. Twitter offers a potential for data mining of public information flows [2] and these massive data sources may be exploited for public health monitoring and surveillance purposes [3]. Previous studies have explored the use of Twitter, for example, to track levels of disease activity [4], to predicts heart disease mortality [5], and for measuring health-related quality of life [6]. However, the validity of twitter mining protocols to correctly detect health and disease events is one methodological challenge of this media. This study seeks to validate a search protocol of ill health-related terms using real-time Twitter data which can later be used to understand if, and how, twitter can reveal information on the current health situation in Indonesia. In this validation study of mining protocols, we: 1) extracted geo-located conversations related to health and disease postings on Twitter using a set of predefined keywords, 2) assessed the prevalence, frequency and timing of such content in these conversations, and 3) validated how this search protocol was able to detect relevant disease tweets.

artificial intelligence, machine learning, tweet, (16 more...)

arXiv.org Machine Learning

1608.0591

Country:

Asia > Indonesia > Java (0.18)
Europe > Austria > Vienna (0.14)

Genre: Research Report (0.84)

Industry:

Health & Medicine > Consumer Health (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.97)
Health & Medicine > Epidemiology (0.96)
(2 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.32)

Add feedback

Implementing Decision Trees using Scikit-Learn – Prashant Gupta – Medium

#artificialintelligenceMay-27-2017, 03:00:10 GMT

Scikit-Learn is a popular library for Machine Learning in python programming language. If you want to test your knowledge with just a few lines of code, scikit-learn is what you need. From Linear and Logistic Regression to SVM and KNN, you name and scikit-learn has it. You will often need to prepare and transform your data in a form that is suitable for scikit-learn to use for training the models. Pandas is an awesome library for python which can be used for this purpose.

artificial intelligence, decision tree learning, machine learning, (8 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.85)
Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (0.57)

Add feedback