Industry
Machine Learning and Sensor Fusion for Estimating Continuous Energy Expenditure
Vyas, Nisarg (BodyMedia, Inc.) | Farringdon, Jonathan (BodyMedia Inc.) | Andre, David (Cerebellum Capital, Inc.) | Stivoric, John Ivo (BodyMedia)
In this article we provide insight into the BodyMedia FIT armband system โ a wearable multi-sensor technology that continuously monitors physiological events related to energy expenditure for weight management using machine learning and data modeling methods. Since becoming commercially available in 2001, more than half a million users have used the system to track their physiological parameters and to achieve their individual health goals including weight-loss. We describe several challenges that arise in applying machine learning techniques to the health care domain and present various solutions utilized in the armband system. We demonstrate how machine learning and multi-sensor data fusion techniques are critical to the systemโs success.
NewsFinder: Automating an AI News Service
Eckroth, Joshua (The Ohio State University) | Dong, Liang (Clemson University) | Smith, Reid G. (Marathon Oil Corporation) | Buchanan, Bruce G. (University of Pittsburgh)
NewsFinder automates the steps involved in finding, selecting, categorizing, and publishing news stories that meet relevance criteria for the Artificial Intelligence community. The software combines a broad search of online news sources with topic-specific trained models and heuristics. Since August 2010, the program has been used to operate the AI in the News service that is part of the AAAI AITopics website.
Design and Deployment of a Personalized News Service
Stefik, Mark (PARC) | Good, Lange (Google, Inc.)
From 2008-2010 we built an experimental personalized news system where readers subscribe to organized channels of topical information that are curated by experts. AI technology was employed to efficiently present the right information to each reader and to radically reduce the workload of curators. The system went through three implementation cycles and processed over 20 million news stories from about 12,000 RSS feeds on over 8000 topics organized by 160 curators for over 600 registered readers. This paper describes the approach, engineering and AI technology of the system.
Innovative Applications of Artificial Intelligence 2011: Introduction to the Special Issue
Shapiro, Daniel G. (Institute for the Study of Learning and Expertise) | Fromherz, Markus (Xerox)
As a result, it is good to read these articles from a practical perspective. Papers that document deployed systems clarify the motivating application constraints, the match (and mismatch) between problems and technology, the innovations required to surmount barriers to deployment, and the impact of technology on application through practical measures of cost and benefit. Other articles describe applications that are almost feasible, drawn from papers in the IAAI emergent applications track. These papers provide a window into the search for viable applications at an earlier stage in the process of mating task with technology. All of the articles supply insight into the core question of what is feasible and why, which is a useful lens for us, as readers, to employ in viewing our own work. This special issue of AI Magazine contains expanded versions of five papers that describe deployed applications and two papers that discuss emergent applications from IAAI-11 (the article by Warrick and colleagues is from IAAI-10).
Learning by Demonstration for a Collaborative Planning Environment
Myers, Karen (SRI International) | Kolojejchic, Jake (General Dynamics C4 Systems | Viz) | Angiolillo, Carl (General Dynamics C4 Systems | Viz) | Cummings, Tim (General Dynamics C4 Systems | Viz) | Garvey, Tom (SRI International) | Gaston, Matt (Carnegie Mellon University) | Gervasio, Melinda (SRI International) | Haines, Will (SRI International) | Jones, Chris (SRI International) | Keifer, Kellie (SRI International) | Knittel, Janette (General Dynamics C4 Systems | Viz) | Morley, David (SRI International) | Ommert, William (General Dynamics C4 Systems | Viz) | Potter, Scott (General Dynamics C4 Systems | Viz)
Learning by demonstration technology has long held the promise to empower non-programmers to customize and extend software. We describe the deployment of a learning by demonstration capability to support user creation of automated procedures in a collaborative planning environment that is used widely by the U.S. Army. This technology, which has been in operational use since the summer of 2010, has helped to reduce user workloads by automating repetitive and time-consuming tasks. The technology has also provided the unexpected benefit of enabling standardization of products and processes.
Sequential Design for Computer Experiments with a Flexible Bayesian Additive Model
Chipman, Hugh, Ranjan, Pritam, Wang, Weiwei
In computer experiments, a mathematical model implemented on a computer is used to represent complex physical phenomena. These models, known as computer simulators, enable experimental study of a virtual representation of the complex phenomena. Simulators can be thought of as complex functions that take many inputs and provide an output. Often these simulators are themselves expensive to compute, and may be approximated by "surrogate models" such as statistical regression models. In this paper we consider a new kind of surrogate model, a Bayesian ensemble of trees (Chipman et al. 2010), with the specific goal of learning enough about the simulator that a particular feature of the simulator can be estimated. We focus on identifying the simulator's global minimum. Utilizing the Bayesian version of the Expected Improvement criterion (Jones et al. 1998), we show that this ensemble is particularly effective when the simulator is ill-behaved, exhibiting nonstationarity or abrupt changes in the response. A number of illustrations of the approach are given, including a tidal power application.
Distributed Robust Power System State Estimation
Kekatos, Vassilis, Giannakis, Georgios B.
Deregulation of energy markets, penetration of renewables, advanced metering capabilities, and the urge for situational awareness, all call for system-wide power system state estimation (PSSE). Implementing a centralized estimator though is practically infeasible due to the complexity scale of an interconnection, the communication bottleneck in real-time monitoring, regional disclosure policies, and reliability issues. In this context, distributed PSSE methods are treated here under a unified and systematic framework. A novel algorithm is developed based on the alternating direction method of multipliers. It leverages existing PSSE solvers, respects privacy policies, exhibits low communication load, and its convergence to the centralized estimates is guaranteed even in the absence of local observability. Beyond the conventional least-squares based PSSE, the decentralized framework accommodates a robust state estimator. By exploiting interesting links to the compressive sampling advances, the latter jointly estimates the state and identifies corrupted measurements. The novel algorithms are numerically evaluated using the IEEE 14-, 118-bus, and a 4,200-bus benchmarks. Simulations demonstrate that the attainable accuracy can be reached within a few inter-area exchanges, while largest residual tests are outperformed.
Single parameter galaxy classification: The Principal Curve through the multi-dimensional space of galaxy properties
Taghizadeh-Popp, M., Heinis, S., Szalay, A. S.
We propose to describe the variety of galaxies from SDSS by using only one affine parameter. To this aim, we build the Principal Curve (P-curve) passing through the spine of the data point cloud, considering the eigenspace derived from Principal Component Analysis of morphological, physical and photometric galaxy properties. Thus, galaxies can be labeled, ranked and classified by a single arc length value of the curve, measured at the unique closest projection of the data points on the P-curve. We find that the P-curve has a "W" letter shape with 3 turning points, defining 4 branches that represent distinct galaxy populations. This behavior is controlled mainly by 2 properties, namely u-r and SFR. We further present the variations of several galaxy properties as a function of arc length. Luminosity functions variate from steep Schechter fits at low arc length, to double power law and ending in Log-normal fits at high arc length. Galaxy clustering shows increasing autocorrelation power at large scales as arc length increases. PCA analysis allowed to find peculiar galaxy populations located apart from the main cloud of data points, such as small red galaxies dominated by a disk, of relatively high stellar mass-to-light ratio and surface mass density. The P-curve allows not only dimensionality reduction, but also provides supporting evidence for relevant physical models and scenarios in extragalactic astronomy: 1) Evidence for the hierarchical merging scenario in the formation of a selected group of red massive galaxies. These galaxies present a log-normal r-band luminosity function, which might arise from multiplicative processes involved in this scenario. 2) Connection between the onset of AGN activity and star formation quenching, which appears in green galaxies when transitioning from blue to red populations. (Full abstract in downloadable version)
A Hybrid Method for Distance Metric Learning
Kao, Yi-Hao, Van Roy, Benjamin, Rubin, Daniel, Xu, Jiajing, Faruque, Jessica, Napel, Sandy
We consider the problem of learning a measure of distance among vectors in a feature space and propose a hybrid method that simultaneously learns from similarity ratings assigned to pairs of vectors and class labels assigned to individual vectors. Our method is based on a generative model in which class labels can provide information that is not encoded in feature vectors but yet relates to perceived similarity between objects. Experiments with synthetic data as well as a real medical image retrieval problem demonstrate that leveraging class labels through use of our method improves retrieval performance significantly.
Software Verification and Graph Similarity for Automated Evaluation of Students' Assignments
Vujosevic-Janicic, Milena, Nikolic, Mladen, Tosic, Dusan, Kuncak, Viktor
In this paper we promote introducing software verification and control flow graph similarity measurement in automated evaluation of students' programs. We present a new grading framework that merges results obtained by combination of these two approaches with results obtained by automated testing, leading to improved quality and precision of automated grading. These two approaches are also useful in providing a comprehensible feedback that can help students to improve the quality of their programs We also present our corresponding tools that are publicly available and open source. The tools are based on LLVM low-level intermediate code representation, so they could be applied to a number of programming languages. Experimental evaluation of the proposed grading framework is performed on a corpus of university students' programs written in programming language C. Results of the experiments show that automatically generated grades are highly correlated with manually determined grades suggesting that the presented tools can find real-world applications in studying and grading.