AITopics | Decision Tree Learning

Collaborating Authors

Decision Tree Learning

Learning to Classify with Branching Tests: "A decision tree takes as input an object or situation described by a set of properties, and outputs a yes/no decision. Decision trees therefore represent Boolean functions. Functions with a larger range of outputs can also be represented...."
– Artificial Intelligence: A Modern Approach. By Stuart Russell & Peter Norvig. 2002. Section 18.3; page 531.

News Overviews Instructional Materials AI-Alerts Classics

VisRuler: Visual Analytics for Extracting Decision Rules from Bagged and Boosted Decision Trees

Chatzimparmpas, Angelos, Martins, Rafael M., Kerren, Andreas

arXiv.org Machine LearningDec-1-2021

Bagging and boosting are two popular ensemble methods in machine learning (ML) that produce many individual decision trees. Due to the inherent ensemble characteristic of these methods, they typically outperform single decision trees or other ML models in predictive performance. However, numerous decision paths are generated for each decision tree, increasing the overall complexity of the model and hindering its use in domains that require trustworthy and explainable decisions, such as finance, social care, and health care. Thus, the interpretability of bagging and boosting algorithms, such as random forests and adaptive boosting, reduces as the number of decisions rises. In this paper, we propose a visual analytics tool that aims to assist users in extracting decisions from such ML models via a thorough visual inspection workflow that includes selecting a set of robust and diverse models (originating from different ensemble learning algorithms), choosing important features according to their global contribution, and deciding which decisions are essential for global explanation (or locally, for specific cases). The outcome is a final decision based on the class agreement of several models and the explored manual decisions exported by users. Finally, we evaluate the applicability and effectiveness of VisRuler via a use case, a usage scenario, and a user study.

algorithm, decision tree, extracting decision rule, (13 more...)

arXiv.org Machine Learning

2112.00334

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
(5 more...)

Genre: Research Report > New Finding (0.67)

Industry:

Banking & Finance (0.93)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.46)
Education > Educational Setting > Online (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Add feedback

Improve Random Forest with Linear Models

#artificialintelligenceNov-30-2021, 16:13:42 GMT

Random Forest is probably considered by most the silver bullet in supervised prediction tasks. For sure, any data scientist involved in standard machine learning applications is used to fit and benchmark a Random Forest. Random Forest is a well-known algorithm in literature and is proven to reach satisfactory results in both regression and classification contexts. It enjoys the ability to learn complex data relationships with low effort. There are a lot of open-sourced efficient implementations which are available to all of us (the one provided by scikit-learn is for sure the most famous).

implementation, linear model, random forest, (3 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Add feedback

Modelling hetegeneous treatment effects by quantitle local polynomial decision tree and forest

Xinglin, Lai

arXiv.org Machine LearningNov-30-2021

For example, the economic or social effects of a new drug trial, a new policy or even the effects of a new feature in an advertisement or software are all areas of interest to researchers.

heterogeneity, treatment effect, treatment effect function, (16 more...)

arXiv.org Machine Learning

2111.1532

Country:

Oceania > Australia (0.04)
North America > United States > New York (0.04)
North America > United States > New Jersey (0.04)
(3 more...)

Genre: Research Report > Experimental Study (1.00)

Industry:

Education (0.68)
Health & Medicine > Pharmaceuticals & Biotechnology (0.66)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Add feedback

Top 5 techniques for Explainable AI

#artificialintelligenceNov-28-2021, 08:35:32 GMT

As you can see that all these explainable AI techniques are not "nice-to-have", but mandatory. Using these techniques will help you better communicate with the person impacted through AI decisions. In some cases, as seen in the stroke prediction example, understanding these techniques can help improve or save lives. You can experience some of the techniques in this article on my website -- https://experiencedatascience.com

glucose level, prediction, probability, (15 more...)

#artificialintelligence

Industry: Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.56)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Explanation & Argumentation (0.63)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.63)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.52)

Add feedback

Random Forests Algorithm explained with a real-life example and some Python code

#artificialintelligenceNov-28-2021, 05:05:13 GMT

Random Forests is a Machine Learning algorithm that tackles one of the biggest problems with Decision Trees: variance. Even though Decision Trees is simple and flexible, it is greedy algorithm. It focuses on optimizing for the node split at hand, rather than taking into account how that split impacts the entire tree. A greedy approach makes Decision Trees run faster, but makes it prone overfitting. An overfit tree is highly optimized to predicting the values in the training dataset, resulting in a learning model with high-variance.

dataset, decision tree, random forest, (14 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Add feedback

Factor-augmented tree ensembles

Pellegrino, Filippo

arXiv.org Machine LearningNov-27-2021

This manuscript proposes to extend the information set of time-series regression trees with latent stationary factors extracted via state-space methods. First, it allows to handle predictors that exhibit measurement error, non-stationary trends, seasonality and/or irregularities such as missing observations. Second, it gives a transparent way for using domain-specific theory to inform time-series regression trees. As a byproduct, this technique sets the foundations for structuring powerful ensembles. Their real-world applicability is studied under the lenses of empirical macro-finance. Keywords: Ensemble learning, Factor models, State-space models, Time series, Unobserved components.Introduction In time series, the simplicity of regression trees (Morgan and Sonquist, 1963; Breiman et al., 1984; Quinlan, 1986) comes at a cost: irregularities, complicated periodic patterns and non-stationary trends cannot be explicitly modelled, and this is unfortunate given that many real-world examples are subject to them. Following, in spirit, Harvey et al. (1998), this paper proposes to pre-process problematic predictors using state-space representations general enough to deal with all these complexities at once. This operation can be thought as an automated feature engineering process that extracts stationary patterns hidden across multiple predictors, while handling problematic data characteristics. Besides, when the state-space representation is compatible with domain-specific theory, this becomes a transparent way for extracting signals with structural interpretation. The resulting stationary common components, referred hereinbelow as stationary dynamic factors, are then employed as regular predictors for standard time-series regression trees. This manuscript calls them factor-augmented regression trees to stress their dependence on latent components. I thank Matteo Barigozzi and Kostas Kalogeropoulos for their valuable suggestions and supervision; Serena Lariccia and Qiwei Yao for their helpful comments on a preliminary draft of this article.

artificial intelligence, decision tree learning, machine learning, (19 more...)

arXiv.org Machine Learning

2111.14

Country:

North America > United States (1.00)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.64)

Industry:

Banking & Finance > Economy (1.00)
Government > Regional Government > North America Government > United States Government (0.93)
Energy (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Add feedback

Statistical Tests for Comparing Classification Algorithms

#artificialintelligenceNov-24-2021, 21:05:01 GMT

Comparing prediction methods to define which one should be used for the task at hand is a daily activity for most data scientists. Usually, one will have a pool of classification models and will validate them using cross-validation to define which one is best. Another goal, however, is not to compare classifiers, but the learning algorithms themselves. The idea is: given this task (data), which learning algorithm (KNN, SVM, Random Forests, etc) will generate more accurate classifiers on a dataset of size D? As we will see, every method presented here has some pros and cons. However, the first intuition of using a two proportions test can lead to some really bad results.

algorithm, implementation, statistical test, (15 more...)

#artificialintelligence

Country: North America > United States > California > Orange County > Irvine (0.05)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (0.56)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.36)

Add feedback

MURAL: An Unsupervised Random Forest-Based Embedding for Electronic Health Record Data

Gerasimiuk, Michal, Shung, Dennis, Tong, Alexander, Stanley, Adrian, Schultz, Michael, Ngu, Jeffrey, Laine, Loren, Wolf, Guy, Krishnaswamy, Smita

arXiv.org Artificial IntelligenceNov-19-2021

A major challenge in embedding or visualizing clinical patient data is the heterogeneity of variable types including continuous lab values, categorical diagnostic codes, as well as missing or incomplete data. In particular, in EHR data, some variables are {\em missing not at random (MNAR)} but deliberately not collected and thus are a source of information. For example, lab tests may be deemed necessary for some patients on the basis of suspected diagnosis, but not for others. Here we present the MURAL forest -- an unsupervised random forest for representing data with disparate variable types (e.g., categorical, continuous, MNAR). MURAL forests consist of a set of decision trees where node-splitting variables are chosen at random, such that the marginal entropy of all other variables is minimized by the split. This allows us to also split on MNAR variables and discrete variables in a way that is consistent with the continuous variables. The end goal is to learn the MURAL embedding of patients using average tree distances between those patients. These distances can be fed to nonlinear dimensionality reduction method like PHATE to derive visualizable embeddings. While such methods are ubiquitous in continuous-valued datasets (like single cell RNA-sequencing) they have not been used extensively in mixed variable data. We showcase the use of our method on one artificial and two clinical datasets. We show that using our approach, we can visualize and classify data more accurately than competing approaches. Finally, we show that MURAL can also be used to compare cohorts of patients via the recently proposed tree-sliced Wasserstein distances.

dataset, missingness, mural-embedding, (16 more...)

arXiv.org Artificial Intelligence

2111.10452

Country:

North America > United States > Connecticut > New Haven County > New Haven (0.04)
North America > Canada > Quebec > Montreal (0.04)
North America > United States > New York > New York County > New York City (0.04)
(10 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Health Care Technology > Medical Record (1.00)
Health & Medicine > Health Care Providers & Services (0.96)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Add feedback

A Hybrid Approach for an Interpretable and Explainable Intrusion Detection System

Dias, Tiago, Oliveira, Nuno, Sousa, Norberto, Praça, Isabel, Sousa, Orlando

arXiv.org Artificial IntelligenceNov-19-2021

Cybersecurity has been a concern for quite a while now. In the latest years, cyberattacks have been increasing in size and complexity, fueled by significant advances in technology. Nowadays, there is an unavoidable necessity of protecting systems and data crucial for business continuity. Hence, many intrusion detection systems have been created in an attempt to mitigate these threats and contribute to a timelier detection. This work proposes an interpretable and explainable hybrid intrusion detection system, which makes use of artificial intelligence methods to achieve better and more long-lasting security. The system combines experts' written rules and dynamic knowledge continuously generated by a decision tree algorithm as new shreds of evidence emerge from network activity.

hybrid approach, intrusion detection system, knowledge base, (12 more...)

arXiv.org Artificial Intelligence

2111.1028

Country:

North America > United States (0.15)
Europe > Sweden > Stockholm > Stockholm (0.04)
Europe > Portugal > Porto > Porto (0.04)

Genre: Research Report (0.64)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Information Technology > Security & Privacy (1.00)
Government > Military > Cyberwarfare (0.70)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.71)
(4 more...)

Add feedback

A Large Scale Benchmark for Individual Treatment Effect Prediction and Uplift Modeling

Diemert, Eustache, Betlei, Artem, Renaudin, Christophe, Amini, Massih-Reza, Gregoir, Théophane, Rahier, Thibaud

arXiv.org Artificial IntelligenceNov-19-2021

Individual Treatment Effect (ITE) prediction is an important area of research in machine learning which aims at explaining and estimating the causal impact of an action at the granular level. It represents a problem of growing interest in multiple sectors of application such as healthcare, online advertising or socioeconomics. To foster research on this topic we release a publicly available collection of 13.9 million samples collected from several randomized control trials, scaling up previously available datasets by a healthy 210x factor. We provide details on the data collection and perform sanity checks to validate the use of this data for causal inference tasks. First, we formalize the task of uplift modeling (UM) that can be performed with this data, along with the relevant evaluation metrics. Then, we propose synthetic response surfaces and heterogeneous treatment assignment providing a general set-up for ITE prediction. Finally, we report experiments to validate key characteristics of the dataset leveraging its size to evaluate and compare - with high statistical significance - a selection of baseline UM and ITE prediction methods.

benchmark, dataset, experiment, (14 more...)

arXiv.org Artificial Intelligence

2111.10106

Country:

North America > United States > California > Monterey County > Monterey (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > France > Auvergne-Rhône-Alpes > Isère > Grenoble (0.04)

Genre: Research Report > Experimental Study (1.00)

Industry:

Health & Medicine (1.00)
Law (0.93)
Information Technology > Services (0.88)
Marketing (0.88)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.46)

Add feedback