Overview
Topic Correlation Analysis for Cross-Domain Text Classification
Li, Lianghao (Tsinghua University) | Jin, Xiaoming (Tsinghua University) | Long, Mingsheng (Tsinghua University)
Cross-domain text classification aims to automatically train a precise text classifier for a target domain by using labeled text data from a related source domain. To this end, the distribution gap between different domains has to be reduced. In previous works, a certain number of shared latent features (e.g., latent topics, principal components, etc.) are extracted to represent documents from different domains, and thus reduce the distribution gap. However, only relying the shared latent features as the domain bridge may limit the amount of knowledge transferred. This limitation is more serious when the distribution gap is so large that only a small number of latent features can be shared between domains. In this paper, we propose a novel approach named Topic Correlation Analysis (TCA), which extracts both the shared and the domain-specific latent features to facilitate effective knowledge transfer. In TCA, all word features are first grouped into the shared and the domain-specific topics using a joint mixture model. Then the correlations between the two kinds of topics are inferred and used to induce a mapping between the domain-specific topics from different domains. Finally, both the shared and the mapped domain-specific topics are utilized to span a new shared feature space where the supervised knowledge can be effectively transferred. The experimental results on two real-world data sets justify the superiority of the proposed method over the stat-of-the-art baselines.
Kernel-Based Reinforcement Learning on Representative States
Kveton, Branislav (Technicolor Labs) | Theocharous, Georgios (Yahoo Labs)
Markov decision processes (MDPs) are an established framework for solving sequential decision-making problems under uncertainty. In this work, we propose a new method for batch-mode reinforcement learning (RL) with continuous state variables. The method is an approximation to kernel-based RL on a set of k representative states. Similarly to kernel-based RL, our solution is a fixed point of a kernelized Bellman operator and can approximate the optimal solution to an arbitrary level of granularity. Unlike kernel-based RL, our method is fast. In particular, our policies can be computed in O ( n ) time, where n is the number of training examples. The time complexity of kernel-based RL is ฮฉ( n 2 ). We introduce our method, analyze its convergence, and compare it to existing work. The method is evaluated on two existing control problems with 2 to 4 continuous variables and a new problem with 64 variables. In all cases, we outperform state-of-the-art results and offer simpler solutions.
A Bayesian Approach to the Data Description Problem
Ghasemi, Alireza (Ecole Polytechnique Federale de Lausanne (EPFL)) | Rabiee, Hamid R. (Sharif University of Technology) | Manzuri, Mohammad Taghi (Sharif University of Technology) | Rohban, Mohammad Hossein (Sharif University of Technology)
In this paper, we address the problem of data description using a Bayesian framework. The goal of data description is to draw a boundary around objects of a certain class of interest to discriminate that class from the rest of the feature space. Data description is also known as one-class learning and has a wide range of applications. The proposed approach uses a Bayesian framework to precisely compute the class boundary and therefore can utilize domain information in form of prior knowledge in the framework. It can also operate in the kernel space and therefore recognize arbitrary boundary shapes. Moreover, the proposed method can utilize unlabeled data in order to improve accuracy of discrimination. We evaluate our method using various real-world datasets and compare it with other state of the art approaches of data description. Experiments show promising results and improved performance over other data description and one-class learning algorithms.
Automated Inference System for End-To-End Diagnosis of Network Performance Issues in Client-Terminal Devices
Widanapathirana, Chathuranga, ลekercioวงlu, Y. Ahmet, Ivanovich, Milosh V., Fitzpatrick, Paul G., Li, Jonathan C.
Traditional network diagnosis methods of Client-Terminal Device (CTD) problems tend to be laborintensive, time consuming, and contribute to increased customer dissatisfaction. In this paper, we propose an automated solution for rapidly diagnose the root causes of network performance issues in CTD. Based on a new intelligent inference technique, we create the Intelligent Automated Client Diagnostic (IACD) system, which only relies on collection of Transmission Control Protocol (TCP) packet traces. Using soft-margin Support Vector Machine (SVM) classifiers, the system (i) distinguishes link problems from client problems and (ii) identifies characteristics unique to the specific fault to report the root cause. The modular design of the system enables support for new access link and fault types. Experimental evaluation demonstrated the capability of the IACD system to distinguish between faulty and healthy links and to diagnose the client faults with 98% accuracy. The system can perform fault diagnosis independent of the user's specific TCP implementation, enabling diagnosis of diverse range of client devices.
Innovative Applications of Artificial Intelligence 2011: Introduction to the Special Issue
Shapiro, Daniel G. (Institute for the Study of Learning and Expertise) | Fromherz, Markus (Xerox)
Every year, AI Magazine devotes one fourth of its annual production to a special issue based on the Innovative Applications of Artificial Intelligence conference. Because IAAI is the premier venue for documenting the transition of AI technology into application, these special issues provide a snapshot of the state of the art in AI with the practical syllogism in mind; they present work that has value because it delivers value in use.
Innovative Applications of Artificial Intelligence 2011: Introduction to the Special Issue
Shapiro, Daniel G. (Institute for the Study of Learning and Expertise) | Fromherz, Markus (Xerox)
As a result, it is good to read these articles from a practical perspective. Papers that document deployed systems clarify the motivating application constraints, the match (and mismatch) between problems and technology, the innovations required to surmount barriers to deployment, and the impact of technology on application through practical measures of cost and benefit. Other articles describe applications that are almost feasible, drawn from papers in the IAAI emergent applications track. These papers provide a window into the search for viable applications at an earlier stage in the process of mating task with technology. All of the articles supply insight into the core question of what is feasible and why, which is a useful lens for us, as readers, to employ in viewing our own work. This special issue of AI Magazine contains expanded versions of five papers that describe deployed applications and two papers that discuss emergent applications from IAAI-11 (the article by Warrick and colleagues is from IAAI-10).
Learning by Demonstration for a Collaborative Planning Environment
Myers, Karen (SRI International) | Kolojejchic, Jake (General Dynamics C4 Systems | Viz) | Angiolillo, Carl (General Dynamics C4 Systems | Viz) | Cummings, Tim (General Dynamics C4 Systems | Viz) | Garvey, Tom (SRI International) | Gaston, Matt (Carnegie Mellon University) | Gervasio, Melinda (SRI International) | Haines, Will (SRI International) | Jones, Chris (SRI International) | Keifer, Kellie (SRI International) | Knittel, Janette (General Dynamics C4 Systems | Viz) | Morley, David (SRI International) | Ommert, William (General Dynamics C4 Systems | Viz) | Potter, Scott (General Dynamics C4 Systems | Viz)
Learning by demonstration technology has long held the promise to empower non-programmers to customize and extend software. We describe the deployment of a learning by demonstration capability to support user creation of automated procedures in a collaborative planning environment that is used widely by the U.S. Army. This technology, which has been in operational use since the summer of 2010, has helped to reduce user workloads by automating repetitive and time-consuming tasks. The technology has also provided the unexpected benefit of enabling standardization of products and processes.
Regularizers versus Losses for Nonlinear Dimensionality Reduction: A Factored View with New Convex Relaxations
Yu, Yaoliang, Neufeld, James, Kiros, Ryan, Zhang, Xinhua, Schuurmans, Dale
We demonstrate that almost all non-parametric dimensionality reduction methods can be expressed by a simple procedure: regularized loss minimization plus singular value truncation. By distinguishing the role of the loss and regularizer in such a process, we recover a factored perspective that reveals some gaps in the current literature. Beyond identifying a useful new loss for manifold unfolding, a key contribution is to derive new convex regularizers that combine distance maximization with rank reduction. These regularizers can be applied to any loss.
Optimizing Memory-Bounded Controllers for Decentralized POMDPs
Amato, Christopher, Bernstein, Daniel S, Zilberstein, Shlomo
We present a memory-bounded optimization approach for solving infinite-horizon decentralized POMDPs. Policies for each agent are represented by stochastic finite state controllers. We formulate the problem of optimizing these policies as a nonlinear program, leveraging powerful existing nonlinear optimization techniques for solving the problem. While existing solvers only guarantee locally optimal solutions, we show that our formulation produces higher quality controllers than the state-of-the-art approach. We also incorporate a shared source of randomness in the form of a correlation device to further increase solution quality with only a limited increase in space and time. Our experimental results show that nonlinear optimization can be used to provide high quality, concise solutions to decentralized decision problems under uncertainty.
On Discarding, Caching, and Recalling Samples in Active Learning
Kapoor, Ashish, Horvitz, Eric J.
We address challenges of active learning under scarce informational resources in non-stationary environments. In real-world settings, data labeled and integrated into a predictive model may become invalid over time. However, the data can become informative again with switches in context and such changes may indicate unmodeled cyclic or other temporal dynamics. We explore principles for discarding, caching, and recalling labeled data points in active learning based on computations of value of information. We review key concepts and study the value of the methods via investigations of predictive performance and costs of acquiring data for simulated and real-world data sets.