AITopics | Decision Tree Learning

Collaborating Authors

Decision Tree Learning

Learning to Classify with Branching Tests: "A decision tree takes as input an object or situation described by a set of properties, and outputs a yes/no decision. Decision trees therefore represent Boolean functions. Functions with a larger range of outputs can also be represented...."
– Artificial Intelligence: A Modern Approach. By Stuart Russell & Peter Norvig. 2002. Section 18.3; page 531.

News Overviews Instructional Materials AI-Alerts Classics

Discriminatory AI explained with an example

#artificialintelligenceApr-24-2022, 23:40:14 GMT

AI is increasingly used in making decisions that impact us directly such as job applications, our credit rating, match-making on dating sites. So it is important that AI is non-discriminatory and that decisions do not favor certain races, gender, the color of skin. Discriminatory AI is a very wide subject going beyond purely technical aspects. However, to make it easily understandable, I will demonstrate how discriminatory AI looks using examples and visuals. This will give you a way to spot a discriminatory AI. Let me first establish the context of the example.

applicant, female applicant, gender, (13 more...)

#artificialintelligence

Industry: Banking & Finance (0.52)

Technology:

Information Technology > Communications > Social Media (0.39)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.39)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.34)

Add feedback

An Efficient Approach for Optimizing the Cost-effective Individualized Treatment Rule Using Conditional Random Forest

Xu, Yizhe, Greene, Tom H., Bress, Adam P., Bellows, Brandon K., Zhang, Yue, Zhang, Zugui, Kolm, Paul, Weintraub, William S., Moran, Andrew S., Shen, Jincheng

arXiv.org Machine LearningApr-22-2022

Evidence from observational studies has become increasingly important for supporting healthcare policy making via cost-effectiveness (CE) analyses. Similar as in comparative effectiveness studies, health economic evaluations that consider subject-level heterogeneity produce individualized treatment rules (ITRs) that are often more cost-effective than one-size-fits-all treatment. Thus, it is of great interest to develop statistical tools for learning such a cost-effective ITR (CE-ITR) under the causal inference framework that allows proper handling of potential confounding and can be applied to both trials and observational studies. In this paper, we use the concept of net-monetary-benefit (NMB) to assess the trade-off between health benefits and related costs. We estimate CE-ITR as a function of patients' characteristics that, when implemented, optimizes the allocation of limited healthcare resources by maximizing health gains while minimizing treatment-related costs. We employ the conditional random forest approach and identify the optimal CE-ITR using NMB-based classification algorithms, where two partitioned estimators are proposed for the subject-specific weights to effectively incorporate information from censored individuals. We conduct simulation studies to evaluate the performance of our proposals. We apply our top-performing algorithm to the NIH-funded Systolic Blood Pressure Intervention Trial (SPRINT) to illustrate the CE gains of assigning customized intensive blood pressure therapy.

artificial intelligence, machine learning, sagej, (18 more...)

arXiv.org Machine Learning

2204.10971

Country:

North America > United States > Utah (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Maryland > Montgomery County > Bethesda (0.04)
(5 more...)

Genre:

Research Report > Experimental Study (1.00)
Research Report > Strength Medium (0.68)

Industry:

Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (0.93)
Health & Medicine > Public Health (0.87)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

Add feedback

Understanding your Neural Network's predictions

#artificialintelligenceApr-19-2022, 09:40:15 GMT

Neural networks are extremely convenient. They are usable for both regression and classification, work on structured and unstructured data, handle temporal data very well, and can usually reach high performances if they are given a sufficient amount of data. What is gained in convenience is, however, lost in interpretability and that can be a major setback when models are presented to a non-technical audience, such as clients or stakeholders. For instance, last year, the Data Science team I am part of wanted to convince a client to go from a decision tree model to a neural network, and for good reasons: we had access to a large amount of data and most of it was temporal. The client was on board, but wanted to keep an understanding of what the model based its decisions on, which means evaluating its features' importance.

inference, prediction, student, (13 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.92)

Add feedback

The Application of Machine Learning Techniques for Predicting Match Results in Team Sport: A Review

Bunker, Rory, Susnjak, Teo

Journal of Artificial Intelligence ResearchApr-14-2022

Predicting the results of matches in sport is a challenging and interesting task. In this paper, we review a selection of studies from 1996 to 2019 that used machine learning for predicting match results in team sport. Considering both invasion sports and striking/fielding sports, we discuss commonly applied machine learning algorithms, as well as common approaches related to data and evaluation. Our study considers accuracies that have been achieved across different sports, and explores whether evidence exists to support the notion that outcomes of some sports may be inherently more difficult to predict. We also uncover common themes of future research directions and propose recommendations for future researchers. Although there remains a lack of benchmark datasets (apart from in soccer), and the differences between sports, datasets and features makes between-study comparisons difficult, as we discuss, it is possible to evaluate accuracy performance in other ways. Artificial Neural Networks were commonly applied in early studies, however, our findings suggest that a range of models should instead be compared. Selecting and engineering an appropriate feature set appears to be more important than having a large number of instances. For feature selection, we see potential for greater inter-disciplinary collaboration between sport performance analysis, a sub-discipline of sport science, and machine learning.

accuracy, dataset, prediction, (13 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.1.13509

AI Access Foundation

13509

Journal of Artificial Intelligence Research

Country:

Asia > Singapore (0.04)
Asia > Japan (0.04)
Oceania > New Zealand > North Island > Waikato (0.04)
(8 more...)

Genre:

Research Report > New Finding (1.00)
Overview (1.00)

Industry:

Leisure & Entertainment > Sports > Soccer (1.00)
Leisure & Entertainment > Sports > Football (1.00)
Leisure & Entertainment > Sports > Basketball (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(4 more...)

Add feedback

Decision Trees vs Random Forest

#artificialintelligenceApr-11-2022, 11:00:09 GMT

Last week I published two articles about Decision Trees: one about Decision and Classification Tree (CART) and another tutorial on how to implement Random Forest classifier. These two methods may look very similar, however there are important differences that every data professional or enthusiastic should know.

decision tree vs random forest

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Add feedback

Wrote about Decision trees -- Karthikeyan A K

#artificialintelligenceApr-11-2022, 05:45:51 GMT

Once again machine learning bug bit me, and after a long delay I wrote about Decision Trees in my book Introduction To DataScience. I am looking to write about K-nearest neighbors next. Even though I have a work where the client hasn't yet given me a unnecessary trouble yet (which has not been the case for years now, and it looks like I have entered dream land or something), work is taking time, and I have to tend to what gives me bread first. But learning Data Science from scratch is my goal, and I will achieve it. I would be very happy if people can read it and mail their feedback to [email protected], no matter where in the world I am, mail works if I have the internet, and I can take corrective measures and possibly answer you.

decision tree, karthikeyan

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.67)

Add feedback

Data Mining with Rattle

#artificialintelligenceApr-8-2022, 12:44:41 GMT

Rattle and R deliver a very sophisticated data mining environment. Data Mining with Rattle is a unique course that instructs with respect to both the concepts of data mining, as well as to the "hands-on" use of a popular, contemporary data mining software tool, "Data Miner," also known as the'Rattle' package in R software. Rattle is a popular GUI-based software tool which'fits on top of' R software. The course focuses on life-cycle issues, processes, and tasks related to supporting a'cradle-to-grave' data mining project. These include: data exploration and visualization; testing data for random variable family characteristics and distributional assumptions; transforming data by scale or by data type; performing cluster analyses; creating, analyzing and interpreting association rules; and creating and evaluating predictive models that may utilize: regression; generalized linear modeling (GLMs); decision trees; recursive partitioning; random forests; boosting; and/or support vector machine (SVM) paradigms. It is both a conceptual and a practical course as it teaches and instructs about data mining, and provides ample demonstrations of conducting data mining tasks using the Rattle R package.

data mining, rattle, software tool, (2 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (0.67)

Industry:

Materials > Metals & Mining (0.60)
Education (0.37)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.60)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.60)

Add feedback

Q-learning with online random forests

Min, Joosung, Elliott, Lloyd T.

arXiv.org Machine LearningApr-7-2022

$Q$-learning is the most fundamental model-free reinforcement learning algorithm. Deployment of $Q$-learning requires approximation of the state-action value function (also known as the $Q$-function). In this work, we provide online random forests as $Q$-function approximators and propose a novel method wherein the random forest is grown as learning proceeds (through expanding forests). We demonstrate improved performance of our methods over state-of-the-art Deep $Q$-Networks in two OpenAI gyms (`blackjack' and `inverted pendulum') but not in the `lunar lander' gym. We suspect that the resilience to overfitting enjoyed by random forests recommends our method for common tasks that do not require a strong representation of the problem domain. We show that expanding forests (in which the number of trees increases as data comes in) improve performance, suggesting that expanding forests are viable for other applications of online random forests beyond the reinforcement learning setting.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

arXiv.org Machine Learning

2204.03771

Country:

North America > United States > Massachusetts (0.04)
North America > Canada > British Columbia (0.04)

Genre: Research Report > Promising Solution (0.34)

Industry: Leisure & Entertainment > Games (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.50)

Add feedback

Machine Learning-Based GPS Multipath Detection Method Using Dual Antennas

Kim, Sanghyun, Byun, Jungyun, Park, Kwansik

arXiv.org Artificial IntelligenceApr-6-2022

In urban areas, global navigation satellite system (GNSS) signals are often reflected or blocked by buildings, thus resulting in large positioning errors. In this study, we proposed a machine learning approach for global positioning system (GPS) multipath detection that uses dual antennas. A machine learning model that could classify GPS signal reception conditions was trained with several GPS measurements selected as suggested features. We applied five features for machine learning, including a feature obtained from the dual antennas, and evaluated the classification performance of the model, after applying four machine learning algorithms: gradient boosting decision tree (GBDT), random forest, decision tree, and K-nearest neighbor (KNN). It was found that a classification accuracy of 82%-96% was achieved when the test data set was collected at the same locations as those of the training data set. However, when the test data set was collected at locations different from those of the training data, a classification accuracy of 44%-77% was obtained.

accuracy, algorithm, proc, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.23919/ASCC56756.2022.9828175

2204.14001

Country:

North America > United States (0.14)
Asia > South Korea > Incheon > Incheon (0.05)
Asia > South Korea > Daejeon > Daejeon (0.04)

Genre: Research Report (0.71)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.75)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.54)

Add feedback

How to know when AI is the right solution

#artificialintelligenceApr-5-2022, 22:25:30 GMT

AI adoption is on the rise. According to a recent McKinsey survey, 55% of companies use artificial intelligence in at least one function, and 27% attribute at least 5% of earnings before interest and taxes to AI, much of that in the form of cost savings. As AI will dramatically transform nearly every industry it touches, it's no surprise that vendors and enterprises are looking for opportunities to deploy AI everywhere they can. But not every project can benefit from AI and attempting to apply AI inappropriately can not only cost time and money but also sour employees, customers, and corporate leaders on future AI projects. The key factors for determining whether a project is suitable for AI are business value, availability of training data, and cultural readiness for change.

customer, domino, fragoso, (15 more...)

#artificialintelligence

Country: North America > United States (0.04)

Genre: Financial News (0.34)

Industry: Banking & Finance > Real Estate (0.48)

Technology:

Information Technology > Artificial Intelligence > Applied AI (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.47)

Add feedback