AITopics

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry: Education (0.63)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

#artificialintelligenceAug-26-2022, 19:46:42 GMT

A Complete Guide To Decision Tree Software - KDnuggets

A decision tree software is a machine learning-led application that helps take the best action and organize data to form the most relevant and compatible decisions. Pictorially, a decision tree is a tree-like framework with nodes containing information. Decision trees categorize and classify relevant datasets into meaningful and easily interpretable information bases. Further, decision trees can also be trained to predict future actions based on previous data submitted to the framework. Decision tree models are used to classify information into meaningful sequential results.

decision tree, information, node, (10 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Laber, Eduardo, Murtinho, Lucas, Oliveira, Felipe

Shallow decision trees for explainable $k$-means clustering

arXiv.org Artificial IntelligenceAug-26-2022

A number of recent works have employed decision trees for the construction of explainable partitions that aim to minimize the $k$-means cost function. These works, however, largely ignore metrics related to the depths of the leaves in the resulting tree, which is perhaps surprising considering how the explainability of a decision tree depends on these depths. To fill this gap in the literature, we propose an efficient algorithm that takes into account these metrics. In experiments on 16 datasets, our algorithm yields better results than decision-tree clustering algorithms such as the ones presented in \cite{dasgupta2020explainable}, \cite{frost2020exkmc}, \cite{laber2021price} and \cite{DBLP:conf/icml/MakarychevS21}, typically achieving lower or equivalent costs with considerably shallower trees. We also show, through a simple adaptation of existing techniques, that the problem of building explainable partitions induced by binary trees for the $k$-means cost function does not admit an $(1+\epsilon)$-approximation in polynomial time unless $P=NP$, which justifies the quest for approximation algorithms and/or heuristics.

algorithm, exshallow, partition, (17 more...)

2112.14718

Country:

South America > Brazil > Rio de Janeiro > Rio de Janeiro (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > California > Monterey County > Monterey (0.04)
(2 more...)

Genre: Research Report > Experimental Study (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.68)

Cho, Woon Hyung, Shin, Jiseon, Kim, Young Duck, Jung, George J.

Pixel-wise classification in graphene-detection with tree-based machine learning algorithms

arXiv.org Artificial IntelligenceAug-24-2022

Mechanical exfoliation of graphene and its identification by optical inspection is one of the milestones in condensed matter physics that sparked the field of 2D materials. Finding regions of interest from the entire sample space and identification of layer number is a routine task potentially amenable to automatization. We propose supervised pixel-wise classification methods showing a high performance even with a small number of training image datasets that require short computational time without GPU. We introduce four different tree-based machine learning algorithms -- decision tree, random forest, extreme gradient boost, and light gradient boosting machine. We train them with five optical microscopy images of graphene, and evaluate their performances with multiple metrics and indices. We also discuss combinatorial machine learning models between the three single classifiers and assess their performances in identification and reliability. The code developed in this paper is open to the public and will be released at github.com/gjung-group/Graphene_segmentation.

artificial intelligence, classifier, machine learning, (19 more...)

2209.07578

Country:

Asia > South Korea > Seoul > Seoul (0.05)
North America > United States > New York > New York County > New York City (0.04)
Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.89)

Zhang, Guangyi, Gionis, Aristides

Regularized impurity reduction: Accurate decision trees with complexity guarantees

arXiv.org Artificial IntelligenceAug-23-2022

Decision trees are popular classification models, providing high accuracy and intuitive explanations. However, as the tree size grows the model interpretability deteriorates. Traditional tree-induction algorithms, such as C4.5 and CART, rely on impurity-reduction functions that promote the discriminative power of each split. Thus, although these traditional methods are accurate in practice, there has been no theoretical guarantee that they will produce small trees. In this paper, we justify the use of a general family of impurity functions, including the popular functions of entropy and Gini-index, in scenarios where small trees are desirable, by showing that a simple enhancement can equip them with complexity guarantees. We consider a general setting, where objects to be classified are drawn from an arbitrary probability distribution, classification can be binary or multi-class, and splitting tests are associated with non-uniform costs. As a measure of tree complexity, we adopt the expected cost to classify an object drawn from the input distribution, which, in the uniform-cost case, is the expected number of tests. We propose a tree-induction algorithm that gives a logarithmic approximation guarantee on the tree complexity. This approximation factor is tight up to a constant factor under mild assumptions. The algorithm recursively selects a test that maximizes a greedy criterion defined as a weighted sum of three components. The first two components encourage the selection of tests that improve the balance and the cost-efficiency of the tree, respectively, while the third impurity-reduction component encourages the selection of more discriminative tests. As shown in our empirical evaluation, compared to the original heuristics, the enhanced algorithms strike an excellent balance between predictive accuracy and tree complexity.

artificial intelligence, decision tree, machine learning, (17 more...)

doi: 10.1007/s10618-022-00884-7

2208.10949

Country:

Europe > Finland (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Belcak, Peter, Wattenhofer, Roger

Deterministic Graph-Walking Program Mining

arXiv.org Artificial IntelligenceAug-22-2022

Owing to their versatility, graph structures admit representations of intricate relationships between the separate entities comprising the data. We formalise the notion of connection between two vertex sets in terms of edge and vertex features by introducing graph-walking programs. We give two algorithms for mining of deterministic graph-walking programs that yield programs in the order of increasing length. These programs characterise linear long-distance relationships between the given two vertex sets in the context of the whole graph.

data mining, machine learning, vertex, (19 more...)

2208.1029

Country:

Europe > Switzerland > Zürich > Zürich (0.04)
North America > United States (0.04)

Genre: Research Report (0.50)

Industry: Education > Educational Setting (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.68)
Information Technology > Data Science > Data Mining (0.68)

arXiv.org Artificial IntelligenceAug-22-2022

MetaRF: Differentiable Random Forest for Reaction Yield Prediction with a Few Trails

Chen, Kexin, Chen, Guangyong, Li, Junyou, Huang, Yuansheng, Heng, Pheng-Ann

Artificial intelligence has deeply revolutionized the field of medicinal chemistry with many impressive applications, but the success of these applications requires a massive amount of training samples with high-quality annotations, which seriously limits the wide usage of data-driven methods. In this paper, we focus on the reaction yield prediction problem, which assists chemists in selecting high-yield reactions in a new chemical space only with a few experimental trials. To attack this challenge, we first put forth MetaRF, an attention-based differentiable random forest model specially designed for the few-shot yield prediction, where the attention weight of a random forest is automatically optimized by the meta-learning framework and can be quickly adapted to predict the performance of new reagents while given a few additional samples. To improve the few-shot learning performance, we further introduce a dimension-reduction based sampling method to determine valuable samples to be experimentally tested and then learned. Our methodology is evaluated on three different datasets and acquires satisfactory performance on few-shot prediction. In high-throughput experimentation (HTE) datasets, the average yield of our methodology's top 10 high-yield reactions is relatively close to the results of ideal yield selection.

dataset, prediction, reaction, (13 more...)

2208.10083

Country:

North America > United States (0.28)
Asia > China > Hong Kong (0.05)
Asia > China > Zhejiang Province > Hangzhou (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre:

Research Report > Experimental Study (0.48)
Research Report > New Finding (0.48)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (0.66)
Materials > Chemicals (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)

#artificialintelligenceAug-21-2022, 02:24:28 GMT

La veille de la cybersécurité

Classification is a two-step process, learning step and prediction step, in machine learning. In the learning step, the model is developed based on given training data. In the prediction step, the model is used to predict the response for given data. Decision Tree is one of the easiest and popular classification algorithms to understand and interpret. Decision Tree algorithm belongs to the family of supervised learning algorithms.

algorithm, customer, supervised learning algorithm, (6 more...)

Industry: Banking & Finance (0.35)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

#artificialintelligenceAug-20-2022, 16:45:12 GMT

How Companies Are Using AI to Alleviate Labor Shortages

Three of every four companies have reported talent or labor shortages and difficulty hiring–a 16-year high. Profound social, economic and demographic changes have created unmet demands for workers in industries ranging from hospitality to logistics to healthcare. Executives across sectors are struggling to attract and retain talent and it's likely that labor shortages will remain a critical issue for many organizations moving forward. However, the rapid advances in artificial intelligence (AI) have the potential to significantly disrupt labor markets. Leading organizations are using AI technologies to reduce the impact of labor shortages and improve their competitive position, while also saving on costs. Here's how they're putting AI and big data to use: Some say a non-supportive and unpleasant work environment is the reason their employees quit, creating labor shortages.

alleviate labor shortage, labor shortage, productivity gain, (8 more...)

Country: North America > United States > South Carolina > York County > Rock Hill (0.05)

Industry: Health & Medicine (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

#artificialintelligenceAug-19-2022, 08:20:53 GMT

[100%OFF] Decision Trees, Random Forests, Bagging & XGBoost: R Studio

You're looking for a complete Decision tree course that teaches you everything you need to create a Decision tree/ Random Forest/ XGBoost model in R, right? You've found the right Decision Trees and tree based advanced techniques course! How this course will help you? A Verifiable Certificate of Completion is presented to all students who undertake this Machine learning advanced course. If you are a business manager or an executive, or a student who wants to learn and apply machine learning in Real world problems of business, this course will give you a solid base for that by teaching you some of the advanced technique of machine learning, which are Decision tree, Random Forest, Bagging, AdaBoost and XGBoost.

decision tree, learning, machine learning, (12 more...)

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)