Inductive Learning
Generalized Label Propagation Methods for Semi-Supervised Learning
Li, Qimai, Wu, Xiao-Ming, Guan, Zhichao
The key challenge in semi-supervised learning is how to effectively leverage unlabeled data to improve learning performance. The classical label propagation method, despite its popularity, has limited modeling capability in that it only exploits graph information for making predictions. In this paper, we consider label propagation from a graph signal processing perspective and decompose it into three components: signal, filter, and classifier. By extending the three components, we propose a simple generalized label propagation (GLP) framework for semi-supervised learning. GLP naturally integrates graph and data feature information, and offers the flexibility of selecting appropriate filters and domain-specific classifiers for different applications. Interestingly, GLP also provides new insight into the popular graph convolutional network and elucidates its working mechanisms. Extensive experiments on three citation networks, one knowledge graph, and one image dataset demonstrate the efficiency and effectiveness of GLP.
Semi-supervised Learning in Network-Structured Data via Total Variation Minimization
Jung, Alexander, Hero, Alfred O. III, Mara, Alexandru, Jahromi, Saeed, Heimowitz, Ayelet, Eldar, Yonina C.
We propose and analyze a method for semi-supervised learning from partially-labeled network-structured data. Our approach is based on a graph signal recovery interpretation under a clustering hypothesis that labels of data points belonging to the same well-connected subset (cluster) are similar valued. This lends naturally to learning the labels by total variation (TV) minimization, which we solve by applying a recently proposed primal-dual method for non-smooth convex optimization. The resulting algorithm allows for a highly scalable implementation using message passing over the underlying empirical graph, which renders the algorithm suitable for big data applications. By applying tools of compressed sensing, we derive a sufficient condition on the underlying network structure such that TV minimization recovers clusters in the empirical graph of the data. In particular, we show that the proposed primal-dual method amounts to maximizing network flows over the empirical graph of the dataset. Moreover, the learning accuracy of the proposed algorithm is linked to the set of network flows between data points having known labels. The effectiveness and scalability of our approach is verified by numerical experiments.
Few-shot Learning with Meta Metric Learners
Cheng, Yu, Yu, Mo, Guo, Xiaoxiao, Zhou, Bowen
Few-shot Learning aims to learn classifiers for new classes with only a few training examples per class. Existing meta-learning or metric-learning based few-shot learning approaches are limited in handling diverse domains with various number of labels. The meta-learning approaches train a meta learner to predict weights of homogeneous-structured task-specific networks, requiring a uniform number of classes across tasks. The metric-learning approaches learn one task-invariant metric for all the tasks, and they fail if the tasks diverge. We propose to deal with these limitations with meta metric learning. Our meta metric learning approach consists of task-specific learners, that exploit metric learning to handle flexible labels, and a meta learner, that discovers good parameters and gradient decent to specify the metrics in task-specific learners. Thus the proposed model is able to handle unbalanced classes as well as to generate task-specific metrics. We test our approach in the `$k$-shot $N$-way' few-shot learning setting used in previous work and new realistic few-shot setting with diverse multi-domain tasks and flexible label numbers. Experiments show that our approach attains superior performances in both settings.
The Use of Unlabeled Data versus Labeled Data for Stopping Active Learning for Text Classification
Beatty, Garrett, Kochis, Ethan, Bloodgood, Michael
Abstract-- Annotation of training data is the major bottleneck in the creation of text classification systems. Active learning is a commonly used technique to reduce the amount of training data one needs to label. A crucial aspect of active learning is determining when to stop labeling data. Three potential sources for informing when to stop active learning are an additional labeled set of data, an unlabeled set of data, and the training data that is labeled during the process of active learning. To date, no one has compared and contrasted the advantages and disadvantages of stopping methods based on these three information sources. We find that stopping methods that use unlabeled data are more effective than methods that use labeled data. I. INTRODUCTION The use of active learning to train machine learning models has been used as a way to reduce annotation costs for text and speech processing applications [1], [2], [3], [4], [5]. Active learning has been shown to have a particularly large potential for reducing annotation cost for text classification [6], [7]. Text classification is one of the most important fields in semantic computing and it has been used in many applications [8], [9], [10], [11], [12].
The implicit fairness criterion of unconstrained learning
Liu, Lydia T., Simchowitz, Max, Hardt, Moritz
We clarify what fairness guarantees we can and cannot expect to follow from unconstrained machine learning. Specifically, we characterize when unconstrained learning on its own implies group calibration, that is, the outcome variable is conditionally independent of group membership given the score. We show that under reasonable conditions, the deviation from satisfying group calibration is upper bounded by the excess risk of the learned score relative to the Bayes optimal score function. A lower bound confirms the optimality of our upper bound. Moreover, we prove that as the excess risk of the learned score decreases, it strongly violates separation and independence, two other standard fairness criteria. Our results show that group calibration is the fairness criterion that unconstrained learning implicitly favors. On the one hand, this means that calibration is often satisfied on its own without the need for active intervention, albeit at the cost of violating other criteria that are at odds with calibration. On the other hand, it suggests that we should be satisfied with calibration as a fairness criterion only if we are at ease with the use of unconstrained machine learning in a given application.
Positive and Unlabeled Learning through Negative Selection and Imbalance-aware Classification
Frasca, Marco, Cesa-Bianchi, Nicolรฒ
Motivated by applications in protein function prediction, we consider a challenging supervised classification setting in which positive labels are scarce and there are no explicit negative labels. The learning algorithm must thus select which unlabeled examples to use as negative training points, possibly ending up with an unbalanced learning problem. We address these issues by proposing an algorithm that combines active learning (for selecting negative examples) with imbalance-aware learning (for mitigating the label imbalance). In our experiments we observe that these two techniques operate synergistically, outperforming state-of-the-art methods on standard protein function prediction benchmarks.
Machine Learning Consultant
We are looking for outstanding Data Engineers to join our team. This is a great opportunity for a Data Science Consultant to join a consulting firm that offers a variety of projects and a structured learning and development path. You will work alongside a talented team of consultants who all share your passion in building great solutions and learning new skills. Skills in Data Engineering and Machine Learning: as a data science consultant you will have proficiency in one or more of Python, R, Scala, Matlab/Octave, Java, C/C, Go, Javascript, Clojure etc. You will also have either hands on experience or good knowledge of the one of the following concepts such as supervised learning, unsupervised learning, reinforcement learning, deep learning, feature engineering, natural language processing, computer vision, signal processing etc.
An overview of proxy-label approaches for semi-supervised learning
This post discusses semi-supervised learning algorithms that learn from proxy labels assigned to unlabelled data. Note: Parts of this post are based on my ACL 2018 paper Strong Baselines for Neural Semi-supervised Learning under Domain Shift with Barbara Plank. Unsupervised learning constitutes one of the main challenges for current machine learning models and one of the key elements that is missing for general artificial intelligence. While unsupervised learning on its own is still elusive, researchers have a made a lot of progress in combining unsupervised learning with supervised learning. This branch of machine learning research is called semi-supervised learning. Semi-supervised learning has a long history. For a (slightly outdated) overview, refer to Zhu (2005) [1] and Chapelle et al. (2006) [2].
Introduction to machine learning with Weka - Target Veb
In this tutorial a small introduction of machine learning focused on development will be done with one of the most used Java libraries for this purpose, Weka. The machine learning is a subfield of data science . If data science covers the entire process of obtaining knowledge, cleaning, analysis, visualization and data deployment, machine learning are the algorithms and techniques used in the analysis and modeling phase of this process. Within these, we will focus on supervised learning, which is often used for classification and regression problems. The classification can be applied when dealing with a discrete class, where the objective is to predict one of the mutually exclusive values in the target variable.
10 Major Machine Learning Algorithms And Their Application
Algorithms are the smart and powerful soldier of a complex machine learning model. In other words, machine learning algorithms are the core foundation when we play with data or when it's come to training the model. In this article, you and I are going on a tour called "7 major machine learning algorithms and their application " The purpose of this tour is to either brush up the mind or to gain an essential understanding of machine learning algorithm. We will find the major answer in this tour like for what purpose machine learning algorithms works, where to use them, when to use them and how to use them. Before getting deeper let's have a brief introduction. Machine learning algorithms are mainly classified into 3 broad categories i.e supervised learning, unsupervised learning, and reinforcement learning. In supervised learning machine learning algorithms, the machine is taught by example. Here the operator provides the machine learning algorithm with the dataset. This dataset includes desired inputs and outputs variables. By the use of these set of variables, we generate a function that map inputs to desired outputs.