Goto

Collaborating Authors

 Unsupervised or Indirectly Supervised Learning


Conditional Generative-Adversarial Network for inverse Holography

#artificialintelligence

Output of a conditional convolutional GAN trained with spectral normalization on light-field/holography pairs. The generator learns the conditional distribution of holograms (grey, left panels) given a certain light field (red colours on the right).


How Alexa Learns

#artificialintelligence

Over the past 10 years, commercial AI has enjoyed what we at Amazon call the flywheel effect: customer interactions with AI systems generate data; with more data, machine learning algorithms perform better, which leads to better customer experiences; better customer experiences drive more usage and engagement, which in turn generate more data. Those data are used to train machine learning systems in three chief ways. The first is supervised learning, in which the training data are hand-labeled (with, say, words' parts of speech or the names of objects in an image) and the system learns to apply labels to unlabeled data. A variation of this is weakly supervised learning, which uses easily acquired but imprecise labels to enable machine learning at scale. If a website visitor performs a search, for instance, the links she clicks indicate which search results should have been at the top of the list; that kind of implicit information can be used to automatically label data. Training with entirely unlabeled data is called unsupervised learning.


Virtual Adversarial Training on Graph Convolutional Networks in Node Classification

arXiv.org Machine Learning

The effectiveness of Graph Convolutional Networks (GCNs) has been demonstrated in a wide range of graph-based machine learning tasks. However, the update of parameters in GCNs is only from labeled nodes, lacking the utilization of unlabeled data. In this paper, we apply Virtual Adversarial Training (VAT), an adversarial regularization method based on both labeled and unlabeled data, on the supervised loss of GCN to enhance its generalization performance. By imposing virtually adversarial smoothness on the posterior distribution in semi-supervised learning, VAT yields improvement on the Symmetrical Laplacian Smoothness of GCNs. In addition, due to the difference of property in features, we perturb virtual adversarial perturbations on sparse and dense features, resulting in GCN Sparse VAT (GCNSVAT) and GCN Dense VAT (GCNDVAT) algorithms, respectively. Extensive experiments verify the effectiveness of our two methods across different training sizes. Our work paves the way towards better understanding the direction of improvement on GCNs in the future.


Semi-Generative Modelling: Covariate-Shift Adaptation with Cause and Effect Features

arXiv.org Machine Learning

Current methods for covariate-shift adaptation use unlabelled data to compute importance weights or domain-invariant features, while the final model is trained on labelled data only. Here, we consider a particular case of covariate shift which allows us also to learn from unlabelled data, that is, combining adaptation with semi-supervised learning. Using ideas from causality, we argue that this requires learning with both causes, $X_C$, and effects, $X_E$, of a target variable, $Y$, and show how this setting leads to what we call a semi-generative model, $P(Y,X_E|X_C,\theta)$. Our approach is robust to domain shifts in the distribution of causal features and leverages unlabelled data by learning a direct map from causes to effects. Experiments on synthetic data demonstrate significant improvements in classification over purely-supervised and importance-weighting baselines.


Unsupervised Visual Domain Adaptation: A Deep Max-Margin Gaussian Process Approach

arXiv.org Machine Learning

In unsupervised domain adaptation, it is widely known that the target domain error can be provably reduced by having a shared input representation that makes the source and target domains indistinguishable from each other. Very recently it has been studied that not just matching the marginal input distributions, but the alignment of output (class) distributions is also critical. The latter can be achieved by minimizing the maximum discrepancy of predictors (classifiers). In this paper, we adopt this principle, but propose a more systematic and effective way to achieve hypothesis consistency via Gaussian processes (GP). The GP allows us to define/induce a hypothesis space of the classifiers from the posterior distribution of the latent random functions, turning the learning into a simple large-margin posterior separation problem, far easier to solve than previous approaches based on adversarial minimax optimization. We formulate a learning objective that effectively pushes the posterior to minimize the maximum discrepancy. This is further shown to be equivalent to maximizing margins and minimizing uncertainty of the class predictions in the target domain, a well-established principle in classical (semi-)supervised learning. Empirical results demonstrate that our approach is comparable or superior to the existing methods on several benchmark domain adaptation datasets.


Machine Learning vs. Artificial Intelligence: Definitions and Importance

#artificialintelligence

Machine learning, sometimes called computational intelligence, has broken down barriers in recent years and has made significant advances in a number of areas, such as robotics, machine translation, social networking, e-commerce, and even in areas such as medicine and healthcare. Machine Learning is an area of AI with a goal to develop computational techniques on learning as well as the construction of systems capable of acquiring knowledge automatically. A learning system is a computer program that makes decisions based on accumulated experiences through the successful solution of past problems. Despite the short definition, there are numerous different learning algorithms and the area is one of the hottest in the field of computing, with several new techniques and algorithms being published regularly. Many people think machine learning and artificial intelligence mean the same thing, but that's not quite accurate.


Confidence-based Graph Convolutional Networks for Semi-Supervised Learning

arXiv.org Machine Learning

Predicting properties of nodes in a graph is an important problem with applications in a variety of domains. Graph-based Semi-Supervised Learning (SSL) methods aim to address this problem by labeling a small subset of the nodes as seeds and then utilizing the graph structure to predict label scores for the rest of the nodes in the graph. Recently, Graph Convolutional Networks (GCNs) have achieved impressive performance on the graph-based SSL task. In addition to label scores, it is also desirable to have confidence scores associated with them. Unfortunately, confidence estimation in the context of GCN has not been previously explored. We fill this important gap in this paper and propose ConfGCN, which estimates labels scores along with their confidences jointly in GCN-based setting. ConfGCN uses these estimated confidences to determine the influence of one node on another during neighborhood aggregation, thereby acquiring anisotropic capabilities. Through extensive analysis and experiments on standard benchmarks, we find that ConfGCN is able to outperform state-of-the-art baselines. We have made ConfGCN's source code available to encourage reproducible research.


Yes, we GAN: Applying Adversarial Techniques for Autonomous Driving

arXiv.org Artificial Intelligence

Generative Adversarial Networks (GAN) have gained a lot of popularity from their introduction in 2014 till present. Research on GAN is rapidly growing and there are many variants of the original GAN focusing on various aspects of deep learning. GAN are perceived as the most impactful direction of machine learning in the last decade. This paper focuses on the application of GAN in autonomous driving including topics such as advanced data augmentation, loss function learning, semi-supervised learning, etc. We formalize and review key applications of adversarial techniques and discuss challenges and open problems to be addressed.


Semi-Supervised Learning by Label Gradient Alignment

arXiv.org Machine Learning

We present label gradient alignment, a novel algorithm for semi-supervised learning which imputes labels for the unlabeled data and trains on the imputed labels. We define a semantically meaningful distance metric on the input space by mapping a point (x, y) to the gradient of the model at (x, y). We then formulate an optimization problem whose objective is to minimize the distance between the labeled and the unlabeled data in this space, and we solve it by gradient descent on the imputed labels. We evaluate label gradient alignment using the standardized architecture introduced by Oliver et al. (2018) and demonstrate state-of-the-art accuracy in semi-supervised CIFAR-10 classification.


The Wilderness Area Data Set: Adapting the Covertype data set for unsupervised learning

arXiv.org Machine Learning

Benchmark data sets are of vital importance in machine learning research, as indicated by the number of repositories that exist to make them publicly available. Although many of these are usable in the stream mining context as well, it is less obvious which data sets can be used to evaluate data stream clustering algorithms. We note that the classic Covertype data set's size makes it attractive for use in stream mining but unfortunately it is specifically designed for classification. Here we detail the process of transforming the Covertype data set into one amenable for unsupervised learning, which we call the Wilderness Area data set. Our quantitative analysis allows us to conclude that the Wilderness Area data set is more appropriate for unsupervised learning than the original Covertype data set.