Education
Learning Multiple Tasks using Manifold Regularization
Agarwal, Arvind, Gerber, Samuel, Daume, Hal
We present a novel method for multitask learning (MTL) based on {\it manifold regularization}: assume that all task parameters lie on a manifold. This is the generalization of a common assumption made in the existing literature: task parameters share a common {\it linear} subspace. One proposed method uses the projection distance from the manifold to regularize the task parameters. The manifold structure and the task parameters are learned using an alternating optimization framework. When the manifold structure is fixed, our method decomposes across tasks which can be learnt independently. An approximation of the manifold regularization scheme is presented that preserves the convexity of the single task learning problem, and makes the proposed MTL framework efficient and easy to implement. We show the efficacy of our method on several datasets.
A Learning Algorithm based on High School Teaching Wisdom
A learning algorithm based on primary school teaching and learning is presented. The methodology is to continuously evaluate a student and to give them training on the examples for which they repeatedly fail, until, they can correctly answer all types of questions. This incremental learning procedure produces better learning curves by demanding the student to optimally dedicate their learning time on the failed examples. When used in machine learning, the algorithm is found to train a machine on a data with maximum variance in the feature space so that the generalization ability of the network improves. The algorithm has interesting applications in data mining, model evaluations and rare objects discovery.
Reinforcement Learning in Partially Observable Markov Decision Processes using Hybrid Probabilistic Logic Programs
We present a probabilistic logic programming framework to reinforcement learning, by integrating reinforce-ment learning, in POMDP environments, with normal hybrid probabilistic logic programs with probabilistic answer set seman-tics, that is capable of representing domain-specific knowledge. We formally prove the correctness of our approach. We show that the complexity of finding a policy for a reinforcement learning problem in our approach is NP-complete. In addition, we show that any reinforcement learning problem can be encoded as a classical logic program with answer set semantics. We also show that a reinforcement learning problem can be encoded as a SAT problem. We present a new high level action description language that allows the factored representation of POMDP. Moreover, we modify the original model of POMDP so that it be able to distinguish between knowledge producing actions and actions that change the environment.
A Large-Deviation Analysis of the Maximum-Likelihood Learning of Markov Tree Structures
Tan, Vincent Y. F., Anandkumar, Animashree, Tong, Lang, Willsky, Alan S.
The problem of maximum-likelihood (ML) estimation of discrete tree-structured distributions is considered. Chow and Liu established that ML-estimation reduces to the construction of a maximum-weight spanning tree using the empirical mutual information quantities as the edge weights. Using the theory of large-deviations, we analyze the exponent associated with the error probability of the event that the ML-estimate of the Markov tree structure differs from the true tree structure, given a set of independently drawn samples. By exploiting the fact that the output of ML-estimation is a tree, we establish that the error exponent is equal to the exponential rate of decay of a single dominant crossover event. We prove that in this dominant crossover event, a non-neighbor node pair replaces a true edge of the distribution that is along the path of edges in the true tree graph connecting the nodes in the non-neighbor pair. Using ideas from Euclidean information theory, we then analyze the scenario of ML-estimation in the very noisy learning regime and show that the error exponent can be approximated as a ratio, which is interpreted as the signal-to-noise ratio (SNR) for learning tree distributions. We show via numerical experiments that in this regime, our SNR approximation is accurate.
Assisting Scientists with Complex Data Analysis Tasks through Semantic Workflows
Gil, Yolanda (Information Sciences Institute, University of Southern California) | Ratnakar, Varun (Information Sciences Institute, University of Southern California) | Fritz, Christian (Information Sciences Institute, University of Southern California)
To assist scientists in data analysis tasks, we have developed semantic workflow representations that support automatic constraint propagation and reasoning algorithms to manage constraints among the individual workflow steps. Semantic constraints can be used to represent requirements of input datasets as well as best practices for the method represented in a workflow. We demonstrate how the Wings workflow system uses semantic workflows to assist users in creating workflows while validating that the workflows comply with the requirements of the software components and datasets. Wings reasons over semantic workflow representations that consist of both a traditional dataflow graph as well as a network of constraints on the data and components of the workflow.
High Dimensional Data Fusion via Joint Manifold Learning
Davenport, Mark A. (Stanford University) | Hegde, Chinmay (Rice University) | Duarte, Marco F. (Princeton University) | Baraniuk, Richard G. (Rice University)
The emergence of low-cost sensing architectures for diverse modalities has made it possible to deploy sensor networks that acquire large amounts of very high-dimensional data. To cope with such a data deluge, manifold models are often developed that provide a powerful theoretical and algorithmic framework for capturing the intrinsic structure of data governed by a low-dimensional set of parameters. However, these models do not typically take into account dependencies among multiple sensors. We thus propose a new joint manifold framework for data ensembles that exploits such dependencies. We show that joint manifold structure can lead to improved performance for manifold learning. Additionally, we leverage recent results concerning random projections of manifolds to formulate a universal, network-scalable dimensionality reduction scheme that efficiently fuses the data from all sensors.
Isometric Correction for Manifold Learning
Behmardi, Behrouz (Oregon State University) | Raich, Raviv (Oregon State University)
In this paper, we present a method for isometric correction of manifold learning techniques. We first present an isometric nonlinear dimension reduction method. Our proposed method overcomes the issues associated with well-known isometric embedding techniques such as ISOMAP and maximum variance unfolding (MVU), i.e., computational complexity and the geodesic convexity requirement. Based on the proposed algorithm, we derive our isometric correction method. Our approach follows an isometric solution to the problem of local tangent space alignment. We provide a derivation of a fast iterative solution. The performance of our algorithm is illustrated on both synthetic and real datasets compared to other methods.
Preface: Manifold Learning and Its Applications
Koyejo, Oluwasanmi (University of Texas at Austin) | Souvenir, Richard (University of North Carolina at Charlotte)
Researchers in many fields such as machine learning, computer vision, bioinformatics and robotics often observe that high dimensional data samples have low degrees of freedom in local neighborhoods, but a more complicated global structure. In many cases, there is enough structure in the data so the degrees of freedom can be described by a lower dimensional object such as a manifold. The goal of manifold learning research is to discover techniques that exploit local structure in data to learn better models, learn better input-output relationships and reduce the computational complexity of learning. The field of manifold learning is truly cross-disciplinary, involving researchers from such varied fields as topology, geometry, machine learning, statistics, computer vision, robotics and many others. This has led to an accelerating pace of research and applications in recent years.
Inconsistency in Behaviors of Virtual Agents and Robots: Case Studies on its Influences into Dialogues with Humans
Nomura, Tatsuya (Ryukoku University)
Inconsistency in behaviors of virtual agents and robots, like that between utterance contents, utterance forms, and postures, has a possibility of influences into human impression, cognition, and memory, and as a result, may lead to inhibition of dialogues between humans and these artifacts. In order to discuss about this possibility and its implications on dialogue design, this paper introduces some case studies using simple animated characters and a small-sized humanoid robot in Japan.
Toward Fast Mapping for Robot Adjective Learning
Petrosino, Allison (Wellesley College) | Gold, Kevin (Rochester Institute of Technology)
Fast mapping is a phenomenon by which children learn the meanings of novel adjectives after a very small number of exposures when the new word is contrasted with a known word. The present study was a preliminary test of whether machine learners could use such contrasts in unconstrained speech to learn adjective meanings and categories. Six decision tree-based learning methods were evaluated that use contrasting examples in order to work toward an adjective fast-mapping system for machine learners. Subjects tended to compare objects using adjectives of the same category, implying that such contrasts may be a useful source of data about adjective meaning, though none of the learning algorithms showed strong advantages over any other.