Goto

Collaborating Authors

 Education


Margin-Based Algorithms for Information Filtering

Neural Information Processing Systems

In this work, we study an information filtering model where the relevance labels associated to a sequence of feature vectors are realizations of an unknown probabilistic linear function. Building on the analysis of a restricted version of our model, we derive a general filtering rule based on the margin of a ridge regression estimator. While our rule may observe the label of a vector only by classfying the vector as relevant, experiments on a real-world document filtering problem show that the performance of our rule is close to that of the online classifier which is allowed to observe all labels. These empirical results are complemented by a theoretical analysis where we consider a randomized variant of our rule and prove that its expected number of mistakes is never much larger than that of the optimal filtering rule which knows the hidden linear model.


Learning with Multiple Labels

Neural Information Processing Systems

In this paper, we study a special kind of learning problem in which each training instance is given a set of (or distribution over) candidate class labels and only one of the candidate labels is the correct one. Such a problem can occur, e.g., in an information retrieval setting where a set of words is associated with an image, or if classes labels are organized hierarchically. We propose a novel discriminative approach for handling the ambiguity of class labels in the training examples. The experiments with the proposed approach over five different UCI datasets show that our approach is able to find the correct label among the set of candidate labels and actually achieve performance close to the case when each training instance is given a single correct label. In contrast, naIve methods degrade rapidly as more ambiguity is introduced into the labels. 1 Introduction Supervised and unsupervised learning problems have been extensively studied in the machine learning literature. In supervised classification each training instance is associated with a single class label, while in unsupervised classification (i.e.


Adaptive Caching by Refetching

Neural Information Processing Systems

We are constructing caching policies that have 13-20% lower miss rates than the best of twelve baseline policies over a large variety of request streams. This represents an improvement of 49-63% over Least Recently Used, the most commonly implemented policy. We achieve this not by designing a specific new policy but by using online Machine Learning algorithms to dynamically shift between the standard policies based on their observed miss rates. A thorough experimental evaluation of our techniques is given, as well as a discussion of what makes caching an interesting online learning problem.


Robust Novelty Detection with Single-Class MPM

Neural Information Processing Systems

This algorithm-the "single-class minimax probability machine(MPM)"- is built on a distribution-free methodology that minimizes the worst-case probability of a data point falling outside of a convex set, given only the mean and covariance matrix of the distribution and making no further distributional assumptions. Wepresent a robust approach to estimating the mean and covariance matrix within the general two-class MPM setting, and show how this approach specializes to the single-class problem. We provide empirical results comparing the single-class MPM to the single-class SVM and a two-class SVM method. 1 Introduction Novelty detection is an important unsupervised learning problem in which test data are to be judged as having been generated from the same or a different process as that which generated the training data.


Boosting Density Estimation

Neural Information Processing Systems

Several authors have suggested viewing boosting as a gradient descent search for a good fit in function space. We apply gradient-based boosting methodology to the unsupervised learning problem of density estimation. We show convergence properties of the algorithm and prove that a strength of weak learnability property appliesto this problem as well. We illustrate the potential of this approach through experiments with boosting Bayesian networks to learn density models.


Learning in Spiking Neural Assemblies

Neural Information Processing Systems

We consider a statistical framework for learning in a class of networks ofspiking neurons. Our aim is to show how optimal local learning rules can be readily derived once the neural dynamics and desired functionality of the neural assembly have been specified, in contrast to other models which assume (sub-optimal) learning rules. Within this framework we derive local rules for learning temporal sequencesin a model of spiking neurons and demonstrate its superior performance to correlation (Hebbian) based approaches. We further show how to include mechanisms such as synaptic depression andoutline how the framework is readily extensible to learning in networks of highly complex spiking neurons. A stochastic quantalvesicle release mechanism is considered and implications on the complexity of learning discussed.


How the Poverty of the Stimulus Solves the Poverty of the Stimulus

Neural Information Processing Systems

Language acquisition is a special kind of learning problem because the outcome of learning of one generation is the input for the next. That makes it possible for languages to adapt to the particularities of the learner. In this paper, I show that this type of language change has important consequences for models of the evolution and acquisition of syntax. 1 The Language Acquisition Problem For both artificial systems and nonhuman animals, learning the syntax of natural languages is a notoriously hard problem. All healthy human infants, in contrast, learn any of the approximately 6000 human languages rapidly, accurately and spontaneously. Anyexplanation of how they accomplish this difficult task must specify the (innate) inductive bias that human infants bring to bear, and the input data that is available to them. Traditionally, the inductive bias is termed - somewhat unfortunately -"Universal Grammar", and the input data "primary linguistic data". Over the last 30 years or so, a view on the acquisition of the syntax of natural language has become popular that has put much emphasis on the innate machinery. In this view, that one can call the "Principles and Parameters" model, the Universal Grammar specifies most aspects of syntax in great detail [e.g.


AAAI News

AI Magazine

Program (July 28) is currently accepting nominations accessible to the general public or Tenth AAAI/SIGART Doctoral Consortium for AAAI Fellow. The AAAI Fellows to a broad AI audience (not just a subarea), (July 25-26) program is designed to written within the last two AAAI Intelligent Systems Demonstrations recognize people who have made significant, years.


Qualitative Reasoning about Population and Community Ecology

AI Magazine

Traditional approaches to ecological modeling, based on mathematical equations, are hampered by the qualitative nature of ecological knowledge. In this article, we demonstrate that qualitative reasoning provides alternative and productive ways for ecologists to develop, organize, and implement models. We present a qualitative theory of population dynamics and use this theory to capture and simulate commonsense theories about population and community ecology. Advantages of this approach include the possibility of deriving relevant conclusions about ecological systems without numeric data; a compositional approach that enables the reusability of models representing partial behavior; the use of a rich vocabulary describing objects, situations, relations, and mechanisms of change; and the capability to provide causal interpretations of system behavior.


Qualitative Modeling in Education

AI Magazine

We argue that qualitative modeling provides a valuable way for students to learn. Two modelbuilding environments, VMODEL and HOMER/- VISIGARP, are presented that support learners by constructing conceptual models of systems and their behavior using qualitative formalisms. Both environments use diagrammatic representations to facilitate knowledge articulation. Preliminary evaluations in educational settings provide support for the hypothesis that qualitative modeling tools can be valuable aids for learning.