AITopics

1808.10406

Country:

South America > Brazil > São Paulo (0.04)
Europe > Netherlands > North Brabant > Eindhoven (0.04)
Europe > Portugal > Porto > Porto (0.04)
(2 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.67)

#artificialintelligenceAug-29-2018, 14:06:15 GMT

How can I build AI capabilities for the data center?

Trying to manage a sprawling, complex data center via manual data input and monitoring can increase the likelihood... You forgot to provide an Email Address. This email address doesn't appear to be valid. This email address is already registered. You have exceeded the maximum character limit.

ai capability, artificial intelligence, machine learning, (8 more...)

Genre: Frequently Asked Questions (FAQ) (0.40)

Industry: Information Technology > Services (0.73)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.32)

Kanbar, Lara J., Onu, Charles C., Shalish, Wissam, Brown, Karen A., Sant'Anna, Guilherme M., Kearney, Robert E., Precup, Doina

Undersampling and Bagging of Decision Trees in the Analysis of Cardiorespiratory Behavior for the Prediction of Extubation Readiness in Extremely Preterm Infants

arXiv.org Machine LearningAug-23-2018

Abstract-- Extremely preterm infants often require endotracheal intubation and mechanical ventilation during the first days of life. Due to the detrimental effects of prolonged invasive mechanical ventilation (IMV), clinicians aim to extubate infants as soon as they deem them ready. Unfortunately, existing strategies for prediction of extubation readiness vary across clinicians and institutions, and lead to high reintubation rates. We present an approach using Random Forest classifiers for the analysis of cardiorespiratory variability to predict extubation readiness. We address the issue of data imbalance by employing random undersampling of examples from the majority class before training each Decision Tree in a bag. By incorporating clinical domain knowledge, we further demonstrate that our classifier could have identified 71% of infants who failed extubation, while maintaining a success detection rate of 78%.

artificial intelligence, classifier, machine learning, (13 more...)

1808.07992

Country:

North America > Canada > Quebec > Montreal (0.16)
Oceania > Australia (0.04)
North America > United States > Rhode Island > Providence County > Providence (0.04)
North America > United States > Michigan > Wayne County > Detroit (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.68)

Industry:

Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.94)
Health & Medicine > Health Care Providers & Services (0.69)
Health & Medicine > Health Care Technology (0.68)
Health & Medicine > Diagnostic Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.97)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Zhou, Yichen, Zhou, Zhengze, Hooker, Giles

Approximation Trees: Statistical Stability in Model Distillation

arXiv.org Machine LearningAug-22-2018

Approximation Trees: Statistical Stability in Model Distillation Yichen Zhou, Zhengze Zhou, Giles Hooker Department of Statistical Science Cornell University Ithaca, NY 14853, USA Abstract This paper examines the stability of learned explanations for black-box predictions via model distillation with decision trees. One approach to intelligibility in machine learning is to use an understandable "student" model to mimic the output of an accurate "teacher". Here, we consider the use of regression trees as a student model, in which nodes of the tree can be used as "explanations" for particular predictions, and the whole structure of the tree can be used as a global representation of the resulting function. However, individual trees are sensitive to the particular data sets used to train them, and an interpretation of a student model may be suspect if small changes in the training data have a large effect on it. In this context, access to outcomes from a teacher helps to stabilize the greedy splitting strategy by generating a much larger corpus of training examples than was originally available. We develop tests to ensure that enough examples are generated at each split so that the same splitting rule would be chosen with high probability were the tree to be retrained. Further, we develop a stopping rule to indicate how deep the tree should be built based on recent results on the variability of Random Forests when these are used as the teacher. We provide concrete examples of these procedures on the CAD-MDD and COMPAS data sets. 1 Introduction This paper examines the use of regression trees for model distillation. While Machine Learning has traditionally focused on predictive performance, there has been considerable recent interest in "X-raying the black box": finding methods to make the ways in which neural networks, Random Forests and other predictive models arrive at their predictions understandable to humans. This problem can be approached by creating summaries of these models such as variable importance scores (Breiman, 2001), partial dependence or ICE plots (Friedman, 2001; Goldstein et al., 2013), saliency maps (Simonyan et al., 2013) and other local explanations (Ribeiro et al., 2016). It can also be approached by developing intelligible "student" models which mimic the predictions of the original "teacher" black box: a strategy encompassed by the term model distillation . Within model distillation, common student models are generalized additive models (GAMS: see Lou et al. (2012); Tan et al. (2017), Hooker (2007) provides a link between these and PDPs) and decision trees Breiman et al. (1984); Quinlan (1987), which are our focus. Decision trees have an intelligible graphical representation and can automatically fit complex high-dimensional functions, both of which make them appealing as student models.

approximation tree, artificial intelligence, machine learning, (18 more...)

1808.07573

Country:

North America > United States > New York > Tompkins County > Ithaca (0.24)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Florida > Broward County (0.04)

Genre:

Research Report > New Finding (0.67)
Research Report > Experimental Study (0.47)

Industry: Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

#artificialintelligenceAug-20-2018, 19:19:01 GMT

Random Forests · UC Business Analytics R Programming Guide

Bagging (bootstrap aggregating) regression trees is a technique that can turn a single tree model with high variance and poor predictive power into a fairly accurate prediction function. Unfortunately, bagging regression trees typically suffers from tree correlation, which reduces the overall performance of the model. Random forests are a modification of bagging that builds a large collection of de-correlated trees and have become a very popular "out-of-the-box" learning algorithm that enjoys good predictive performance. This tutorial will cover the fundamentals of random forests. This tutorial serves as an introduction to the random forests.

artificial intelligence, decision tree learning, machine learning, (17 more...)

Genre: Instructional Material > Course Syllabus & Notes (0.55)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

arXiv.org Artificial IntelligenceAug-20-2018

Discovering Context Specific Causal Relationships

Ma, Saisai, Li, Jiuyong, Liu, Lin, Le, Thuc Duy

With the increasing need of personalised decision making, such as personalised medicine and online recommendations, a growing attention has been paid to the discovery of the context and heterogeneity of causal relationships. Most existing methods, however, assume a known cause (e.g. a new drug) and focus on identifying from data the contexts of heterogeneous effects of the cause (e.g. patient groups with different responses to the new drug). There is no approach to efficiently detecting directly from observational data context specific causal relationships, i.e. discovering the causes and their contexts simultaneously. In this paper, by taking the advantages of highly efficient decision tree induction and the well established causal inference framework, we propose the Tree based Context Causal rule discovery (TCC) method, for efficient exploration of context specific causal relationships from data. Experiments with both synthetic and real world data sets show that TCC can effectively discover context specific causal rules from the data.

artificial intelligence, causal rule, machine learning, (17 more...)

arXiv.org Artificial Intelligence

1808.06316

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Oceania > Australia > South Australia (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
(2 more...)

Genre:

Research Report > Experimental Study (0.93)
Research Report > New Finding (0.93)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Obstetrics/Gynecology (0.69)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.91)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.47)

#artificialintelligenceAug-19-2018, 11:42:10 GMT

How to Visualize a Decision Tree from a Random Forest in Python using Scikit-Learn

File: This makes use of the export_graphviz function in Scikit-Learn. There are many parameters here that control the look and information displayed. Take a look at the documentation for specifics. This requires installation of graphviz which includes the dot utility. For the complete options for conversion, take a look at the documentation.

artificial intelligence, machine learning, scikit-learn, (6 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Oskooei, Ali, Manica, Matteo, Mathis, Roland, Martinez, Maria Rodriguez

Network-based Biased Tree Ensembles (NetBiTE) for Drug Sensitivity Prediction and Drug Sensitivity Biomarker Identification in Cancer

arXiv.org Machine LearningAug-18-2018

We present the Network-based Biased Tree Ensembles (NetBiTE) method for drug sensitivity prediction and drug sensitivity biomarker identification in cancer using a combination of prior knowledge and gene expression data. Our devised method consists of a biased tree ensemble that is built according to a probabilistic bias weight distribution. The bias weight distribution is obtained from the assignment of high weights to the drug targets and propagating the assigned weights over a protein-protein interaction network such as STRING. The propagation of weights, defines neighborhoods of influence around the drug targets and as such simulates the spread of perturbations within the cell, following drug administration. Using a synthetic dataset, we showcase how application of biased tree ensembles (BiTE) results in significant accuracy gains at a much lower computational cost compared to the unbiased random forests (RF) algorithm. We then apply NetBiTE to the Genomics of Drug Sensitivity in Cancer (GDSC) dataset and demonstrate that NetBiTE outperforms RF in predicting IC50 drug sensitivity, only for drugs that target membrane receptor pathways (MRPs): RTK, EGFR and IGFR signaling pathways. We propose based on the NetBiTE results, that for drugs that inhibit MRPs, the expression of target genes prior to drug administration is a biomarker for IC50 drug sensitivity following drug administration. We further verify and reinforce this proposition through control studies on, PI3K/MTOR signaling pathway inhibitors, a drug category that does not target MRPs, and through assignment of dummy targets to MRP inhibiting drugs and investigating the variation in NetBiTE accuracy.

artificial intelligence, machine learning, netbite, (17 more...)

1808.06603

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
North America > United States > New York (0.04)
North America > United States > Massachusetts (0.04)
(2 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.94)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Oncology > Leukemia (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.69)

#artificialintelligenceAug-16-2018, 21:52:32 GMT

how_decision_trees_work.html

Decision trees are one of my favorite models. They are simple, and they are powerful. In fact most high performing Kaggle entries are a combination of XGBoost, which is variant of decision tree, and some very clever feature engineering. The concept behind decision trees is refreshingly straightforward. Imagine creating a data set by recording the time you left your house, and noting whether you arrived at work on time.

artificial intelligence, decision tree learning, machine learning, (17 more...)

Country: North America (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

arXiv.org Machine LearningAug-15-2018

Shedding Light on Black Box Machine Learning Algorithms: Development of an Axiomatic Framework to Assess the Quality of Methods that Explain Individual Predictions

Honegger, Milo

From self-driving vehicles and back-flipping robots to virtual assistants who book our next appointment at the hair salon or at that restaurant for dinner - machine learning systems are becoming increasingly ubiquitous. The main reason for this is that these methods boast remarkable predictive capabilities. However, most of these models remain black boxes, meaning that it is very challenging for humans to follow and understand their intricate inner workings. Consequently, interpretability has suffered under this ever-increasing complexity of machine learning models. Especially with regards to new regulations, such as the General Data Protection Regulation (GDPR), the necessity for plausibility and verifiability of predictions made by these black boxes is indispensable. Driven by the needs of industry and practice, the research community has recognised this interpretability problem and focussed on developing a growing number of so-called explanation methods over the past few years. These methods explain individual predictions made by black box machine learning models and help to recover some of the lost interpretability. With the proliferation of these explanation methods, it is, however, often unclear, which explanation method offers a higher explanation quality, or is generally better-suited for the situation at hand. In this thesis, we thus propose an axiomatic framework, which allows comparing the quality of different explanation methods amongst each other. Through experimental validation, we find that the developed framework is useful to assess the explanation quality of different explanation methods and reach conclusions that are consistent with independent research.

explanation, machine learning, natural language, (22 more...)

1808.05054

Country:

North America > United States > Wisconsin > Price County (0.04)
North America > United States > New York > New York County > New York City (0.04)
Europe > Germany > Baden-Württemberg > Karlsruhe Region > Karlsruhe (0.04)

Genre:

Research Report > New Finding (1.00)
Overview (0.92)

Industry:

Transportation (1.00)
Law (1.00)
Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Explanation & Argumentation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(6 more...)