Goto

Collaborating Authors

 Decision Tree Learning



Machine learning as a service ? Might lose sleep over this !

@machinelearnbot

This post is'not' intended to teach people how to use popular predictive modelling APIs for free. Although, to your surprise, this isn't a far fetched possibility. Trained Machine learning models are basically a function that maps feature vectors to the output variable. Upon querying with a test instance, the model predicts an outcome, assigning probability scores to all the possible classes. Google, Amazon etc provides public facing APIs to train predictive models on the subscriber's data, the model can further be used for prediction purposes .


Random Forest Missing Data Algorithms

arXiv.org Machine Learning

Random forest (RF) missing data algorithms are an attractive approach for dealing with missing data. They have the desirable properties of being able to handle mixed types of missing data, they are adaptive to interactions and nonlinearity, and they have the potential to scale to big data settings. Currently there are many different RF imputation algorithms but relatively little guidance about their efficacy, which motivated us to study their performance. Using a large, diverse collection of data sets, performance of various RF algorithms was assessed under different missing data mechanisms. Algorithms included proximity imputation, on the fly imputation, and imputation utilizing multivariate unsupervised and supervised splitting---the latter class representing a generalization of a new promising imputation algorithm called missForest. Performance of algorithms was assessed by ability to impute data accurately. Our findings reveal RF imputation to be generally robust with performance improving with increasing correlation. Performance was good under moderate to high missingness, and even (in certain cases) when data was missing not at random.


Farm Robot Learns What Weeds Look Like, Smashes Them

AITopics Original Links

Bonirob is more than 90 percent effective in destroying weeds in carrot cultivation trials. While the world's first fully-robotic farm will operate indoors, traditional outdoor farms aren't immune to the coming robotic revolution. Bonirob, developed by Bosch's Deepfield Robotics, is billed to eliminate some of the most tedious tasks in modern farming, plant breeding, and weeding. The autonomous robot is built to be a mobile plant lab, able to decide which strains of plant are most apt to survive insects and viruses and how much fertilizer they would need, and then smash any weeds with a ramming rod. Bonirob employs a type of machine learning (a stab at artificial intelligence) called decision tree learning. Researchers show Bonirob lots of pictures of healthy leaves that are tagged to be good, and pictures of weeds that are tagged to be bad, and the machine makes a series of choices based on observed in new data to judge whether a plant in the field is good or bad.


Software for Data Mining, Analytics,Data Science, and Knowledge Discovery

AITopics Original Links

Classification software: building models to separate 2 or more discrete classes using Multiple methods Decision Tree Rules Neural Bayesian SVM Genetic, Rough Sets, Fuzzy Logic and other approaches Analysis of results, ROC Social Network Analysis, Link Analysis, and Visualization software Text Analysis, Text Mining, and Information Retrieval (IR) Web Analytics and Social Media Analytics software. BI (Business Intelligence), Database and OLAP software Data Transformation, Data Cleaning, Data Cleansing Libraries, Components and Developer Kits for creating embedded data mining applications Web Content Mining, web scraping, screen scraping.


R: Decision Trees (Regression)

#artificialintelligence

Decision Trees are popular supervised machine learning algorithms. You will often find the abbreviation CART when reading up on decision trees. CART stands for Classification and Regression Trees. In this example we are going to create a Regression Tree. Meaning we are going to attempt to build a model that can predict a numeric value.


Machine Learning and the Law โ€“ Louis Dorard -- Blog

#artificialintelligence

Last week I went to the workshops at NIPS (biggest ML conference in the world) and I also attended part of the ML and the Law symposium the day before. I found out a little bit too late about the symposia but I was still able to attend two panels on which there were both lawyers and computer scientists. They were very insightful and informative -- did you know that this Spring, the European Union passed a regulation giving its citizens a "right to an explanation" for decisions made by machine-learning systems? The panel discussions were motivated by the problem of explaining ML-powered decisions which have an important impact on people's lives: We need to be able to test how systems get to their conclusions; if we can't test, we can't contest. Individuals are entitled to know which data is being processed of them, and to explanations of how predictions & decisions work, in terms they can understand.


Classification Using Tree Based Models

#artificialintelligence

Machine Learning can sound very complicated, but anyone with a will to learn can successfully apply it, if they approach it from first principles. This course, Classification Using Tree Based Models, covers a specific class of Machine Learning problems - classification problems and how to solve these problems using Tree based models. First, you'll learn about building and visualizing decision trees as well as recognizing the serious problem of overfitting and its causes. Next, you'll learn about using ensemble learning to overcome overfitting. Finally, you'll explore 2 specific ensemble learning techniques - Random Forests and Gradient boosted trees By the end of this course, you'll be able to recognize opportunities where you can use Tree based models to solve classification problems and measure how well your solution is doing.



Why do Decision Trees Work?

#artificialintelligence

In this article we will discuss the machine learning method called "decision trees", moving quickly over the usual "how decision trees work" and spending time on "why decision trees work." We will write from a computational learning theory perspective, and hope this helps make both decision trees and computational learning theory more comprehensible. The goal of this article is to set up terminology so we can state in one or two sentences why decision trees tend to work well in practice. Newcomers to data science are often disappointed to learn that the job of the data scientist isn't tweaking and inventing new machine learning algorithms. In the "big data" world supervised learning has been a solved problem since at least 1951 (see [FixHodges1951] for neighborhood density methods, see [GordonOlshen1978] for k-nearest neighbor and decision tree methods).