Goto

Collaborating Authors

 Education


Differentially Private Online Learning

arXiv.org Machine Learning

In this paper, we consider the problem of preserving privacy in the online learning setting. We study the problem in the online convex programming (OCP) framework---a popular online learning setting with several interesting theoretical and practical implications---while using differential privacy as the formal privacy measure. For this problem, we distill two critical attributes that a private OCP algorithm should have in order to provide reasonable privacy as well as utility guarantees: 1) linearly decreasing sensitivity, i.e., as new data points arrive their effect on the learning model decreases, 2) sub-linear regret bound---regret bound is a popular goodness/utility measure of an online learning algorithm. Given an OCP algorithm that satisfies these two conditions, we provide a general framework to convert the given algorithm into a privacy preserving OCP algorithm with good (sub-linear) regret. We then illustrate our approach by converting two popular online learning algorithms into their differentially private variants while guaranteeing sub-linear regret ($O(\sqrt{T})$). Next, we consider the special case of online linear regression problems, a practically important class of online learning problems, for which we generalize an approach by Dwork et al. to provide a differentially private algorithm with just $O(\log^{1.5} T)$ regret. Finally, we show that our online learning framework can be used to provide differentially private algorithms for offline learning as well. For the offline learning problem, our approach obtains better error bounds as well as can handle larger class of problems than the existing state-of-the-art methods Chaudhuri et al.


Convex and Network Flow Optimization for Structured Sparsity

arXiv.org Machine Learning

We consider a class of learning problems regularized by a structured sparsity-inducing norm defined as the sum of l_2- or l_infinity-norms over groups of variables. Whereas much effort has been put in developing fast optimization techniques when the groups are disjoint or embedded in a hierarchy, we address here the case of general overlapping groups. To this end, we present two different strategies: On the one hand, we show that the proximal operator associated with a sum of l_infinity-norms can be computed exactly in polynomial time by solving a quadratic min-cost flow problem, allowing the use of accelerated proximal gradient methods. On the other hand, we use proximal splitting techniques, and address an equivalent formulation with non-overlapping groups, but in higher dimension and with additional constraints. We propose efficient and scalable algorithms exploiting these two strategies, which are significantly faster than alternative approaches. We illustrate these methods with several problems such as CUR matrix factorization, multi-task learning of tree-structured dictionaries, background subtraction in video sequences, image denoising with wavelets, and topographic dictionary learning of natural image patches.


Measuring Intelligence through Games

arXiv.org Artificial Intelligence

Artificial general intelligence (AGI) refers to research aimed at tackling the full problem of artificial intelligence, that is, create truly intelligent agents. This sets it apart from most AI research which aims at solving relatively narrow domains, such as character recognition, motion planning, or increasing player satisfaction in games. But how do we know when an agent is truly intelligent? A common point of reference in the AGI community is Legg and Hutter's formal definition of universal intelligence, which has the appeal of simplicity and generality but is unfortunately incomputable. Games of various kinds are commonly used as benchmarks for "narrow" AI research, as they are considered to have many important properties. We argue that many of these properties carry over to the testing of general intelligence as well. We then sketch how such testing could practically be carried out. The central part of this sketch is an extension of universal intelligence to deal with finite time, and the use of sampling of the space of games expressed in a suitably biased game description language.


Tech Report A Variational HEM Algorithm for Clustering Hidden Markov Models

arXiv.org Artificial Intelligence

The hidden Markov model (HMM) is a generative model that treats sequential data under the assumption that each observation is conditioned on the state of a discrete hidden variable that evolves in time as a Markov chain. In this paper, we derive a novel algorithm to cluster HMMs through their probability distributions. We propose a hierarchical EM algorithm that i) clusters a given collection of HMMs into groups of HMMs that are similar, in terms of the distributions they represent, and ii) characterizes each group by a "cluster center", i.e., a novel HMM that is representative for the group. We present several empirical studies that illustrate the benefits of the proposed algorithm.


ShareBoost: Efficient Multiclass Learning with Feature Sharing

arXiv.org Artificial Intelligence

Multiclass prediction is the problem of classifying an object into a relevant target class. We consider the problem of learning a multiclass predictor that uses only few features, and in particular, the number of used features should increase sub-linearly with the number of possible classes. This implies that features should be shared by several classes. We describe and analyze the ShareBoost algorithm for learning a multiclass predictor that uses few shared features. We prove that ShareBoost efficiently finds a predictor that uses few shared features (if such a predictor exists) and that it has a small generalization error. We also describe how to use ShareBoost for learning a non-linear predictor that has a fast evaluation time. In a series of experiments with natural data sets we demonstrate the benefits of ShareBoost and evaluate its success relatively to other state-of-the-art approaches.


Promoting scientific thinking with robots

arXiv.org Artificial Intelligence

This article describes an exemplary robot exercise which was conducted in a class for mechatronics students. The goal of this exercise was to engage students in scientific thinking and reasoning, activities which do not always play an important role in their curriculum. The robotic platform presented here is simple in its construction and is customizable to the needs of the teacher. Therefore, it can be used for exercises in many different fields of science, not necessarily related to robotics. Here we present a situation where the robot is used like an alien creature from which we want to understand its behavior, resembling an ethological research activity. This robot exercise is suited for a wide range of courses, from general introduction to science, to hardware oriented lectures.


Generalised elastic nets

arXiv.org Machine Learning

The elastic net was introduced as a heuristic algorithm for combinatorial optimisation and has been applied, among other problems, to biological modelling. It has an energy function which trades off a fitness term against a tension term. In the original formulation of the algorithm the tension term was implicitly based on a first-order derivative. In this paper we generalise the elastic net model to an arbitrary quadratic tension term, e.g. derived from a discretised differential operator, and give an efficient learning algorithm. We refer to these as generalised elastic nets (GENs). We give a theoretical analysis of the tension term for 1D nets with periodic boundary conditions, and show that the model is sensitive to the choice of finite difference scheme that represents the discretised derivative. We illustrate some of these issues in the context of cortical map models, by relating the choice of tension term to a cortical interaction function. In particular, we prove that this interaction takes the form of a Mexican hat for the original elastic net, and of progressively more oscillatory Mexican hats for higher-order derivatives. The results apply not only to generalised elastic nets but also to other methods using discrete differential penalties, and are expected to be useful in other areas, such as data analysis, computer graphics and optimisation problems.


Between Frustration and Elation: Sense of Control Regulates the lntrinsic Motivation for Motor Learning

AAAI Conferences

Frustration has been generally viewed in a negative light and its potential role in learning neglected. We propose a new approach to intrinsically motivated learning where frustration is a key factor that allows to dynamically balance exploration and exploitation. Moreover, based on the result obtained from our experiment with older infants, we propose that a temporary decrease in learning from negative feedback can also be beneficial in fine-tuning a newly learned behavior. We suggest that this temporal indifference to the outcome of an action may be related to the sense of control, and results from the state of elation, that is the experience of overcoming a very difficult task after prolonged frustration. Our preliminary simulation results serve as a proof-of-concept for our approach.


Adding Affective Argumentation to the GenIE Assistant

AAAI Conferences

The strategies seem designed to mitigate guilt over the parents' role in their The GenIE Assistant is an implemented proof-of-concept child's inheritance of a genetic condition. The names used computational model of normative biomedical argument to refer to the strategies in this paper and examples of generation informed by study of a corpus of letters each are listed below. All four apply to cases of written by genetic counselors to their clients (Green et al. autosomal recessive inheritance, while only the first two 2011). The goal of the model is to generate transparent apply to cases of autosomal dominant inheritance.


Lifelong Forgetting: A Critical Ingredient of Lifelong Learning, and Its Implementation in the OpenCog Integrative AI Framework

AAAI Conferences

Conceptually founded on the "patternist" systems theory of intelligence outlined in (Goertzel 2006), OCP combines Defining Forgetting In ordinary human discourse, the multiple AI paradigms such as uncertain logic, computational word "forget" has multiple shades of meaning. It can refer linguistics, evolutionary program learning and connectionist to the irreversible elimination of a certain knowledge item attention allocation in a unified architecture. Cognitive from memory; or it can mean something milder, as in cases processes embodying these different paradigms interoperate where someone "forgets" something, but then remembers it together on a common neural-symbolic knowledge shortly after. In the latter case, "forgetting" means that the store called the Atomspace. The interaction of these processes knowledge item has been stored in some portion of memory is designed to encourage the self-organizing emergence from which access is slow and uncertain.