Country
Automatic State Abstraction from Demonstration
Cobo, Luis Carlos (Georgia Institute of Technology) | Zang, Peng (Georgia Institute of Technology) | Jr., Charles Lee Isbell (Georgia Institute of Technology) | Thomaz, Andrea Lockerd (Georgia Institute of Technology)
Learning from Demonstration (LfD) is a popular technique for building decision-making agents from human help. Traditional LfD methods use demonstrations as training examples for supervised learning, but complex tasks can require more examples than is practical to obtain. We present Abstraction from Demonstration (AfD), a novel form of LfD that uses demonstrations to infer state abstractions and reinforcement learning (RL) methods in those abstract state spaces to build a policy. Empirical results show that AfD is greater than an order of magnitude more sample efficient than jus tusing demonstrations as training examples, and exponentially faster than RL alone.
Generative Structure Learning for Markov Logic Networks Based on Graph of Predicates
Dinh, Quang-Thang (Universite d'Orleans) | Exbrayat, Matthieu (Universite d'Orleans) | Vrain, Christel (Universite d'Orleans)
In this paper we present a new algorithm for generatively learning the structure of Markov Logic Networks. This algorithm relies on a graph of predicates, which summarizes the links existing between predicates and on relational information between ground atoms in the training database. Candidate clauses are produced by means of a heuristical variabilization technique. According to our first experiments, this approach appears to be promising.
Flexible, High Performance Convolutional Neural Networks for Image Classification
Ciresan, Dan Claudiu (IDSIA, USI, SUPSI) | Meier, Ueli (IDSIA, USI, SUPSI) | Masci, Jonathan (IDSIA, USI, SUPSI) | Gambardella, Luca Maria (IDSIA, USI, SUPSI) | Schmidhuber, Jürgen (IDSIA, USI, SUPSI)
We present a fast, fully parameterizable GPU implementation of Convolutional Neural Network variants. Our feature extractors are neither carefully designed nor pre-wired, but rather learned in a supervised way. Our deep hierarchical architectures achieve the best published results on benchmarks for object classification (NORB, CIFAR10) and handwritten digit recognition (MNIST), with error rates of 2.53%, 19.51%, 0.35%, respectively. Deep nets trained by simple back-propagation perform better than more shallow ones. Learning is surprisingly rapid. NORB is completely trained within five epochs. Test error rates on MNIST drop to 2.42%, 0.97% and 0.48% after 1, 3 and 17 epochs, respectively.
Unsupervised Learning of Patterns in Data Streams Using Compression and Edit Distance
Chua, Sook-Ling (Massey University) | Marsland, Stephen (Massey University) | Guesgen, Hans W. (Massey University)
Many unsupervised learning methods for recognising patterns in data streams are based on fixed length data sequences, which makes them unsuitable for applications where the data sequences are of variable length such as in speech recognition, behaviour recognition and text classification. In order to use these methods on variable length data sequences, a pre-processing step is required to manually segment the data and select the appropriate features, which is often not practical in real-world applications. In this paper we suggest an unsupervised learning method that handles variable length data sequences by identifying structure in the data stream using text compression and the edit distance between ‘words’. We demonstrate that using this method we can automatically cluster unlabelled data in a data stream and perform segmentation. We evaluate the effectiveness of our proposed method using both fixed length and variable length benchmark datasets, comparing it to the Self-Organising Map in the first case. The results show a promising improvement over baseline recognition systems.
Concept Labeling: Building Text Classifiers with Minimal Supervision
Chenthamarakshan, Vijil (IBM T J Watson Research Center Yorktown Heights) | Melville, Prem (IBM T J Watson Research Center Yorktown Heights) | Sindhwani, Vikas (IBM T J Watson Research Center Yorktown Heights) | Lawrence, Richard D (IBM T J Watson Research Center Yorktown Heights)
The rapid construction of supervised text classification models is becoming a pervasive need across many modern applications. To reduce human-labeling bottlenecks, many new statistical paradigms (e.g., active, semi-supervised, transfer and multi-task learning) have been vigorously pursued in recent literature with varying degrees of empirical success. Concurrently, the emergence of Web 2.0 platforms in the last decade has enabled a world-wide, collaborative human effort to construct a massive ontology of concepts with very rich, detailed and accurate descriptions. In this paper we propose a new framework to extract supervisory information from such ontologies and complement it with a shift in human effort from direct labeling of examples in the domain of interest to the much more efficient identification of concept-class associations. Through empirical studies on text categorization problems using the Wikipedia ontology, we show that this shift allows very high-quality models to be immediately induced at virtually no cost.
Increasing the Scalability of the Fitting of Generalised Block Models for Social Networks
Chan, Jeffrey (National University Ireland, Galway) | Lam, Samantha (National University Ireland, Galway) | Hayes, Conor (National University Ireland, Galway)
In recent years, the summarisation and decomposition of social networks has become increasingly popular, from community finding to role equivalence. However, these approaches concentrate on one type of model only. Generalised block modelling decomposes a network into independent, interpretable, labeled blocks, where the block labels summarise the relationship between two sets of users. Existing algorithms for fitting generalised block models do not scale beyond networks of 100 vertices. In this paper, we introduce two new algorithms, one based on genetic algorithms and the other on simulated annealing, that is at least two orders of magnitude faster than existing algorithms and obtaining similar accuracy. Using synthetic and real datasets, we demonstrate their efficiency and accuracy and show how generalised block modelling and our new approaches enable tractable network summarisation and modelling of medium sized networks.
Using Cases as Heuristics in Reinforcement Learning: A Transfer Learning Application
Jr., Luiz A. Celiberto (Technological Institute of Aeronautics) | Matsuura, Jackson P. (Technological Institute of Aeronautics) | Mantaras, Ramon Lopez de (Artificial Intelligence Research Institute (IIIA-CSIC)) | Bianchi, Reinaldo A. C. (Centro Universitario da FEI)
Another way to speed up a RL algorithm is by using Transfer Learning, a paradigm of machine learning that In this paper we propose to combine three AI techniques reuses knowledge accumulated in a previous task to speed up to speed up a Reinforcement Learning algorithm the learning of a novel, but related, target task [Taylor and in a Transfer Learning problem: Casebased Stone, 2009]. Reasoning, Heuristically Accelerated Reinforcement This paper investigates the use of the Case-Based Heuristically Learning and Neural Networks. To do Accelerated Reinforcement Learning (CB-HARL) algorithm so, we propose a new algorithm, called L3, which [Bianchi et al., 2009] as a means to transfer learning works in 3 stages: in the first stage, it uses Reinforcement acquired by one agent during its training in one problem to Learning to learn how to perform one another agent that has to learn how to solve a similar, but task, and stores the optimal policy for this problem more complex, problem. To do so, we propose a new algorithm, as a case-base; in the second stage, it uses a Neural called L3, which works in 3 stages: in the first stage, Network to map actions from one domain to actions it uses the Q-learning algorithm [Watkins, 1989] to learn how in the other domain and; in the third stage, it uses to perform one task, and stores the optimal policy for this the case-base learned in the first stage as heuristics problem as a case-base; in the second stage, it uses a Neural to speed up the learning performance in a related, Network to map actions from one domain to actions in but different, task. The RL algorithm used the other domain and; in the third stage, it uses the case-base in the first phase is the Q-learning and in the third learned in the first stage as heuristics in the CB-HARL algorithm, phase is the recently proposed Case-based Heuristically speeding up the learning process.
Distance Metric Learning under Covariate Shift
Cao, Bin (Hong Kong University of Science and Technology) | Ni, Xiaochuan (Microsoft Research Asia) | Sun, Jian-Tao (Microsoft Research Asia) | Wang, Gang (Microsoft) | Yang, Qiang (Hong Kong University of Science and Technology)
Learning distance metrics is a fundamental problem in machine learning. Previous distance-metric learning research assumes that the training and test data are drawn from the same distribution, which may be violated in practical applications. When the distributions differ, a situation referred to as covariate shift, the metric learned from training data may not work well on the test data. In this case the metric is said to be inconsistent. In this paper, we address this problem by proposing a novel metric learning framework known as consistent distance metric learning (CDML), which solves the problem under covariate shift situations. We theoretically analyze the conditions when the metrics learned under covariate shift are consistent. Based on the analysis, a convex optimization problem is proposed to deal with the CDML problem. An importance sampling method is proposed for metric learning and two importance weighting strategies are proposed and compared in this work. Experiments are carried out on synthetic and real world datasets to show the effectiveness of the proposed method.
A Hidden Markov Model Variant for Sequence Classification
Blasiak, Sam (George Mason University) | Rangwala, Huzefa (George Mason University)
Sequence classification is central to many practical problems within machine learning. Distances metrics between arbitrary pairs of sequences can be hard to define because sequences can vary in length and the information contained in the order of sequence elements is lost when standard metrics such as Euclidean distance are applied. We present a scheme that employs a Hidden Markov Model variant to produce a set of fixed-length description vectors from a set of sequences. We then define three inference algorithms, a Baum-Welch variant, a Gibbs Sampling algorithm, and a variational algorithm, to infer model parameters. Finally, we show experimentally that the fixed length representation produced by these inference methods is useful for classifying sequences of amino acids into structural classes
Learning a Distance Metric by Empirical Loss Minimization
Bian, Wei (University of Technology) | Tao, Dacheng (University of Technology)
In this paper, we study the problem of learning ametric and propose a loss function based metriclearning framework, in which the metric is estimatedby minimizing an empirical risk over a trainingset. With mild conditions on the instance distributionand the used loss function, we prove that theempirical risk converges to its expected counterpartat rate O(1/\sqrt{n}), wherein n is the cardinality of the training set. In addition, with the assumption thatthe best metric that minimizes the expected risk isbounded, we prove that the learned metric is consistent. Two example algorithms are presented by usingthe proposed loss function based metric learningframework, each of which uses a log loss functionand a smoothed hinge loss function, respectively. Experimental results on data sets from the UCI machine learning repository suggest the effectivenessof the proposed algorithms.