Performance Analysis
Dimension Reduction Using Rule Ensemble Machine Learning Methods: A Numerical Study of Three Ensemble Methods
DeMasi, Orianna, Meza, Juan, Bailey, David H.
Ensemble methods for supervised machine learning have become popular due to their ability to accurately predict class labels with groups of simple, lightweight "base learners." While ensembles offer computationally efficient models that have good predictive capability they tend to be large and offer little insight into the patterns or structure in a dataset. We consider an ensemble technique that returns a model of ranked rules. The model accurately predicts class labels and has the advantage of indicating which parameter constraints are most useful for predicting those labels. An example of the rule ensemble method successfully ranking rules and selecting attributes is given with a dataset containing images of potential supernovas where the number of necessary features is reduced from 39 to 21. We also compare the rule ensemble method on a set of multi-class problems with boosting and bagging, which are two well known ensemble techniques that use decision trees as base learners, but do not have a rule ranking scheme.
Structure Selection from Streaming Relational Data
Mihalkova, Lilyana, Moustafa, Walaa Eldin
Statistical relational learning techniques have been successfully applied in a wide range of relational domains. In most of these applications, the human designers capitalized on their background knowledge by following a trial-and-error trajectory, where relational features are manually defined by a human engineer, parameters are learned for those features on the training data, the resulting model is validated, and the cycle repeats as the engineer adjusts the set of features. This paper seeks to streamline application development in large relational domains by introducing a light-weight approach that efficiently evaluates relational features on pieces of the relational graph that are streamed to it one at a time. We evaluate our approach on two social media tasks and demonstrate that it leads to more accurate models that are learned faster.
Robust graphical modeling of gene networks using classical and alternative T-distributions
Finegold, Michael, Drton, Mathias
Graphical Gaussian models have proven to be useful tools for exploring network structures based on multivariate data. Applications to studies of gene expression have generated substantial interest in these models, and resulting recent progress includes the development of fitting methodology involving penalization of the likelihood function. In this paper we advocate the use of multivariate $t$-distributions for more robust inference of graphs. In particular, we demonstrate that penalized likelihood inference combined with an application of the EM algorithm provides a computationally efficient approach to model selection in the $t$-distribution case. We consider two versions of multivariate $t$-distributions, one of which requires the use of approximation techniques. For this distribution, we describe a Markov chain Monte Carlo EM algorithm based on a Gibbs sampler as well as a simple variational approximation that makes the resulting method feasible in large problems.
Believe Me—We Can Do This! Annotating Persuasive Acts in Blog Text
Anand, Pranav (University of California, Santa Cruz) | King, Joseph (University of California, Santa Cruz) | Boyd-Graber, Jordan (University of Maryland) | Wagner, Earl (University of Maryland) | Martell, Craig (The Naval Postgraduate School) | Oard, Doug (University of Maryland) | Resnik, Philip (University of Maryland)
This paper describes the development of a corpus of blog posts that are annotated for the presence of attempts to persuade and corresponding tactics employed in persuasive messages. We investigate the feasibility of classifying blog posts as persuasive or non-persuasive on the basis of lexical features in the text and the tactics (as provided by human annotators). Annotated tactics provide substantial assistance in classifying persuasion, particularly tactics indicating formal reasoning, deontic obligation, and discussions of possible outcomes, suggesting that learning to identify tactics may be an excellent first step to detecting attempts to persuade.
Automatic Identity Inference for Smart TVs
Saluja, Avneesh Singh (Carnegie Mellon University) | Mokaya, Frank (Carnegie Mellon University) | Phielipp, Mariano (Intel Corporation) | Kveton, Branislav (Technicolor)
In 2009, an average American spent 3 hours per day watching TV. Recent advances in TV entertainment technologies, such as on-demand content, browsing the Internet, and 3D displays, have changed the traditional role of the TV and turned it into the center of home entertainment. Most of these technologies are personal and would benefit from seamless identification of who sits in front of the TV. In this work, we propose a practical and highly accurate solution to this problem. This solution uses a camera, which is mounted on a TV, to recognize faces of people in front of the TV. To make the approach practical, we employ online learning on graphs and show that we can learn highly accurate face models in difficult circumstances from as little as one labeled example. To evaluate our solutions, we collected a 10-hour long dataset of 8 people who watch TV. Our precision and recall are in the upper nineties, and show the promise of utilizing our approach in an embedded setting.
Untangling Topic Threads in Chat-Based Communication: A Case Study
Ramachandran, Sowmya (Stottler Henke Associates Inc.) | Jensen, Randy (Stottler Henke Associates Inc.) | Bascara, Oscar (Stottler Henke Associates Inc.) | Carpenter, Tamitha (Stottler Henke Associates Inc.) | Denning, Todd (US Air Force) | Sucillon, Lt. Shaun (US Air Force Research Laboratory)
Analyzing chat traffic has important applications for both the military and the civilian world. This paper presents a case study of a real-world application of chat analysis in support of team training exercise in the military. It compares the results of an unsupervised learning approach with those of a supervised classification approach. The paper also discusses some of the specific challenges presented by this domain.
Machine Learning and Sensor Fusion for Estimating Continuous Energy Expenditure
Vyas, Nisarg (BodyMedia Inc.) | Farringdon, Jonathan (BodyMedia Inc.) | Andre, David (BodyMedia Inc.) | Stivoric, John (Ivo) (BodyMedia Inc.)
In this paper we provide insight into the BodyMedia FIT® armband system — a wearable multi-sensor technology that achieves the goals of continuous physiological monitoring (especially energy expenditure estimation) and weight management using machine learning and data modeling methods. This system has been commercially available since 2001 and more than half a million users have used the system to track their physiological parameters and to achieve their individual health goals including weight-loss. We describe several challenges that arise in applying machine learning techniques to the health care domain and present various solutions utilized in the armband system. We demonstrate how machine learning and multi-sensor data fusion techniques are critical to the system’s succ
Playing to Program: Towards an Intelligent Programming Tutor for RUR-PLE
desJardins, Marie (University of Maryland Baltimore County) | Ciavolino, Amy (University of Maryland Baltimore County) | Deloatch, Robert (University of Maryland Baltimore County) | Feasley, Eliana (University of Maryland Baltimore County)
Intelligent tutoring systems (ITSs) provide students with a one-on-one tutor, allowing them to work at their own pace, and helping them to focus on their weaker areas. The RUR1–Python Learning Environment (RUR-PLE), a game-like virtual environment to help students learn to program, provides an interface for students to write their own Python code and visualize the code execution (Roberge 2005). RUR-PLE provides a fixed sequence of learning lessons for students to explore. We are extending RUR-PLE to develop the Playing to Program (PtP) ITS, which consists of three components: (1) a Bayesian student model that tracks student competence, (2) a diagnosis module that provides tailored feedback to students, and (3) a problem selection module that guides the student’s learning process. In this paper, we summarize RUR-PLE and the PtP design, and describe an ongoing user study to evaluate the predictive accuracy of our student modeling approach.
Sparse Matrix-Variate t Process Blockmodels
Xu, Zenglin (Purdue University) | Yan, Feng (Purdue University) | Qi, Yuan (Purdue University)
We consider the problem of modeling network interactions and identifying latent groups of network nodes. This problem is challenging due to the facts i) that the network nodes are interdependent instead of independent, ii) that the network data are very noisy (e.g., missing edges), and iii) that the network interactions are often sparse. To address these challenges, we propose a Sparse Matrix-variate t process Blockmodel (SMTB). In particular, we generalize a matrix-variate t distribution to a t process on matrices with nonlinear covariance functions. Due to this generalization, our model can estimate latent memberships for individual network nodes. This separates our model from previous t distribution based relational models. Also, we introduce sparse prior distributions on the latent membership parameters to select group assignments for individual nodes. To learn the model efficiently from data, we develop a variational method. When compared with several state-of-the-art models, including the predictive matrix-variate t models and mixed membership stochastic blockmodels, our model achieved improved prediction accuracy on real world network datasets.
Trajectory Regression on Road Networks
Ide, Tsuyoshi (IBM Research - Tokyo) | Sugiyama, Masashi (Tokyo Institute of Technology)
This paper addresses the task of trajectory cost prediction, a new learning task for trajectories. The goal of this task is to predict the cost for an arbitrary (possibly unknown) trajectory, based on a set of previous trajectory-cost pairs. A typical example of this task is travel-time prediction on road networks. The main technical challenge here is to infer the costs of trajectories including links with no or little passage history. To tackle this, we introduce a weight propagation mechanism over the links, and show that the problem can be reduced to a simple form of kernel ridge regression. We also show that this new formulation leads us to a unifying view, where a natural choice of the kernel is suggested to an existing kernel-based alternative.