AITopics | Decision Tree Learning

Collaborating Authors

Decision Tree Learning

Learning to Classify with Branching Tests: "A decision tree takes as input an object or situation described by a set of properties, and outputs a yes/no decision. Decision trees therefore represent Boolean functions. Functions with a larger range of outputs can also be represented...."
– Artificial Intelligence: A Modern Approach. By Stuart Russell & Peter Norvig. 2002. Section 18.3; page 531.

News Overviews Instructional Materials AI-Alerts Classics

On Computing Probabilistic Explanations for Decision Trees

Arenas, Marcelo, Barceló, Pablo, Romero, Miguel, Subercaseaux, Bernardo

arXiv.org Artificial IntelligenceJun-30-2022

Formal XAI (explainable AI) is a growing area that focuses on computing explanations with mathematical guarantees for the decisions made by ML models. Inside formal XAI, one of the most studied cases is that of explaining the choices taken by decision trees, as they are traditionally deemed as one of the most interpretable classes of models. Recent work has focused on studying the computation of sufficient reasons, a kind of explanation in which given a decision tree and an instance, one explains the decision () by providing a subset of the features of such that for any other instance compatible with, it holds that () = (), intuitively meaning that the features in are already enough to fully justify the classification of by. It has been argued, however, that sufficient reasons constitute a restrictive notion of explanation. For such a reason, the community has started to study their probabilistic counterpart, in which one requires that the probability of () = () must be at least some value (0, 1], where is a random instance that is compatible with. Our paper settles the computational complexity of -sufficient-reasons over decision trees, showing that both (1) finding -sufficient-reasons that are minimal in size, and (2) finding -sufficient-reasons that are minimal inclusion-wise, do not admit polynomial-time algorithms (unless PTIME = NP). This is in stark contrast with the deterministic case (= 1) where inclusion-wise minimal sufficient-reasons are easy to compute. By doing this, we answer two open problems originally raised by Izza et al., and extend the hardness of explanations for Boolean circuits presented by Wäldchen et al. to the more restricted case of decision trees. On the positive side, we identify structural restrictions of decision trees that make the problem tractable, and show how SAT solvers might be able to tackle these problems in practical settings.

artificial intelligence, decision tree learning, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2207.12213

Country:

South America > Chile (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Europe > Finland > Uusimaa > Helsinki (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.45)

Add feedback

Prediction of Dilatory Behavior in eLearning: A Comparison of Multiple Machine Learning Models

Imhof, Christof, Comsa, Ioan-Sorin, Hlosta, Martin, Parsaeifard, Behnam, Moser, Ivan, Bergamin, Per

arXiv.org Machine LearningJun-30-2022

Procrastination, the irrational delay of tasks, is a common occurrence in online learning. Potential negative consequences include higher risk of drop-outs, increased stress, and reduced mood. Due to the rise of learning management systems and learning analytics, indicators of such behavior can be detected, enabling predictions of future procrastination and other dilatory behavior. However, research focusing on such predictions is scarce. Moreover, studies involving different types of predictors and comparisons between the predictive performance of various methods are virtually non-existent. In this study, we aim to fill these research gaps by analyzing the performance of multiple machine learning algorithms when predicting the delayed or timely submission of online assignments in a higher education setting with two categories of predictors: subjective, questionnaire-based variables and objective, log-data based indicators extracted from a learning management system. The results show that models with objective predictors consistently outperform models with subjective predictors, and a combination of both variable types perform slightly better. For each of these three options, a different approach prevailed (Gradient Boosting Machines for the subjective, Bayesian multilevel models for the objective, and Random Forest for the combined predictors). We conclude that careful attention should be paid to the selection of predictors and algorithms before implementing such models in learning management systems.

artificial intelligence, machine learning, predictor, (18 more...)

arXiv.org Machine Learning

2206.15079

Country:

Europe > United Kingdom > England > Bedfordshire (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
North America > Canada (0.04)
(5 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (1.00)
Education > Educational Setting > Online (1.00)
Education > Educational Technology > Educational Software > Learning Management System (0.64)

Technology:

Information Technology > Enterprise Applications > Human Resources > Learning Management (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Open Problem: Properly learning decision trees in polynomial time?

Blanc, Guy, Lange, Jane, Qiao, Mingda, Tan, Li-Yang

arXiv.org Machine LearningJun-29-2022

The authors recently gave an $n^{O(\log\log n)}$ time membership query algorithm for properly learning decision trees under the uniform distribution (Blanc et al., 2021). The previous fastest algorithm for this problem ran in $n^{O(\log n)}$ time, a consequence of Ehrenfeucht and Haussler (1989)'s classic algorithm for the distribution-free setting. In this article we highlight the natural open problem of obtaining a polynomial-time algorithm, discuss possible avenues towards obtaining it, and state intermediate milestones that we believe are of independent interest.

algorithm, artificial intelligence, machine learning, (13 more...)

arXiv.org Machine Learning

2206.14431

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.05)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (0.78)

Add feedback

The Applied Artificial Intelligence Workshop: Start working with AI today, to build games, design decision trees, and train your own machine learning models: So, Anthony, So, William, Nagy, Zsolt: 9781800205819: Amazon.com: Books

#artificialintelligenceJun-27-2022, 19:41:36 GMT

Zsolt Nagy is a software engineer, manager, tech lead, and mentor specializing in the development of maintainable web applications with cutting edge technologies since 2010. As a software engineer, Zsolt continuously challenges himself to stick to the highest possible standards. Zsolt puts extra effort into building a T-shaped profile in leadership and software engineering. You can read more about Zsolt's specializations by visiting his blogs. His tech blog (zsoltnagy.eu) is on improving your JavaScript skills by solving tech interviewing questions and developing real world web applications that you can monetize or display in your portfolio.

applied artificial intelligence workshop, design decision tree, zsolt, (7 more...)

#artificialintelligence

Industry:

Retail > Online (0.40)
Leisure & Entertainment > Games > Computer Games (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.40)

Add feedback

Introduction to Machine Learning: Supervised Learning

#artificialintelligenceJun-25-2022, 08:26:23 GMT

In this course, you'll be learning various supervised ML algorithms and prediction tasks applied to different data. You'll learn when to use which model and why, and how to improve the model performances. We will cover models such as linear and logistic regression, KNN, Decision trees and ensembling methods such as Random Forest and Boosting, kernel methods such as SVM. Prior coding or scripting knowledge is required. We will be utilizing Python extensively throughout the course.

learning, machine learning, supervised learning, (2 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (0.38)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (0.47)
Education > Educational Setting > Online (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.61)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.40)

Add feedback

Using regression techniques to predict a student's grade for a course

#artificialintelligenceJun-22-2022, 09:26:52 GMT

I will be using Keras and TensorFlow to train a deep neural network to predict the grade using 2 hidden layers, mean squared error loss, and an RMSprop optimizer. Let's graph the error and the loss during training and evaluate the model We are getting a 0.69 mean absolute error with this approach. We also need to save the model to deploy it in an API. Since I am using google Colab I can easily save it to google drive. Initialize a random forest with 100 decision trees and train it on the same data.

initialize, regression technique, student

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.40)

Add feedback

Regression Trees on Grassmann Manifold for Adapting Reduced-Order Models

Liu, Xiao, Liu, Xinchao

arXiv.org Machine LearningJun-22-2022

Low dimensional and computationally less expensive Reduced-Order Models (ROMs) have been widely used to capture the dominant behaviors of high-dimensional systems. A ROM can be obtained, using the well-known Proper Orthogonal Decomposition (POD), by projecting the full-order model to a subspace spanned by modal basis modes which are learned from experimental, simulated or observational data, i.e., training data. However, the optimal basis can change with the parameter settings. When a ROM, constructed using the POD basis obtained from training data, is applied to new parameter settings, the model often lacks robustness against the change of parameters in design, control, and other real-time operation problems. This paper proposes to use regression trees on Grassmann Manifold to learn the mapping between parameters and POD bases that span the low-dimensional subspaces onto which full-order models are projected. Motivated by the fact that a subspace spanned by a POD basis can be viewed as a point in the Grassmann manifold, we propose to grow a tree by repeatedly splitting the tree node to maximize the Riemannian distance between the two subspaces spanned by the predicted POD bases on the left and right daughter nodes. Five numerical examples are presented to comprehensively demonstrate the performance of the proposed method, and compare the proposed tree-based method to the existing interpolation method for POD basis and the use of global POD basis. The results show that the proposed tree-based method is capable of establishing the mapping between parameters and POD bases, and thus adapt ROMs for new parameters.

artificial intelligence, machine learning, pod basis, (18 more...)

arXiv.org Machine Learning

2206.11324

Country:

North America > United States > Arkansas > Washington County > Fayetteville (0.14)
Europe > Switzerland > Basel-City > Basel (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.48)

Industry: Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.62)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

GitHub - microsoft/hummingbird: Hummingbird compiles trained ML models into tensor computation for faster inference.

#artificialintelligenceJun-12-2022, 16:10:05 GMT

Hummingbird compiles trained ML models into tensor computation for faster inference. - GitHub - microsoft/hummingbird: Hummingbird compiles trained ML models into tensor computation for faster inference.

hummingbird, ml model, node, (12 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.59)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.41)

Add feedback

Sr. AWS DevOps Developer with AI/ML experience

#artificialintelligenceJun-9-2022, 00:11:00 GMT

Will accept BS in related field with minimum of 5 years of experience. A drive to learn and master new technologies and techniques. Excellent written and verbal communication skills for coordinating across teams. Green card or US citizen required. Will accept BS in related field with minimum of 5 years of experience.

ai ml experience, aw devops developer, machine-learning and operation research, (12 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.48)

Add feedback

Your ultimate AI/ML decision tree

#artificialintelligenceJun-8-2022, 01:25:52 GMT

The services that will work best for you will depend on your specific use case and your team's level of expertise. Because it takes a lot of effort and ML expertise to build and maintain high quality ML models, a general rule of thumb is to use pretrained models or AI solutions whenever possible -- that is, when they fit your use case. If your data is structured, and it's in BigQuery, and your users are already comfortable with SQL, then choose BigQuery ML. If you realize that your use case requires writing your own model code, then use custom training options in Vertex AI. Let's look at your options in some more detail.

expertise, ultimate ai ml decision tree, unstructured data, (9 more...)

#artificialintelligence

Industry: Information Technology > Services (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.40)

Add feedback