AITopics

1901.09993

Country:

Asia > China > Hong Kong (0.04)
North America > United States > Wisconsin > Dane County > Madison (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.92)

Jung, Alexander, Hero, Alfred O. III, Mara, Alexandru, Jahromi, Saeed, Heimowitz, Ayelet, Eldar, Yonina C.

Semi-supervised Learning in Network-Structured Data via Total Variation Minimization

arXiv.org Machine LearningJan-28-2019

We propose and analyze a method for semi-supervised learning from partially-labeled network-structured data. Our approach is based on a graph signal recovery interpretation under a clustering hypothesis that labels of data points belonging to the same well-connected subset (cluster) are similar valued. This lends naturally to learning the labels by total variation (TV) minimization, which we solve by applying a recently proposed primal-dual method for non-smooth convex optimization. The resulting algorithm allows for a highly scalable implementation using message passing over the underlying empirical graph, which renders the algorithm suitable for big data applications. By applying tools of compressed sensing, we derive a sufficient condition on the underlying network structure such that TV minimization recovers clusters in the empirical graph of the data. In particular, we show that the proposed primal-dual method amounts to maximizing network flows over the empirical graph of the dataset. Moreover, the learning accuracy of the proposed algorithm is linked to the set of network flows between data points having known labels. The effectiveness and scalability of our approach is verified by numerical experiments.

graph, minimization, tv minimization, (16 more...)

1901.09838

Country:

North America > United States > New York > New York County > New York City (0.14)
North America > United States > Michigan > Washtenaw County > Ann Arbor (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(17 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.61)

Few-shot Learning with Meta Metric Learners

Cheng, Yu, Yu, Mo, Guo, Xiaoxiao, Zhou, Bowen

Few-shot Learning aims to learn classifiers for new classes with only a few training examples per class. Existing meta-learning or metric-learning based few-shot learning approaches are limited in handling diverse domains with various number of labels. The meta-learning approaches train a meta learner to predict weights of homogeneous-structured task-specific networks, requiring a uniform number of classes across tasks. The metric-learning approaches learn one task-invariant metric for all the tasks, and they fail if the tasks diverge. We propose to deal with these limitations with meta metric learning. Our meta metric learning approach consists of task-specific learners, that exploit metric learning to handle flexible labels, and a meta learner, that discovers good parameters and gradient decent to specify the metrics in task-specific learners. Thus the proposed model is able to handle unbalanced classes as well as to generate task-specific metrics. We test our approach in the `$k$-shot $N$-way' few-shot learning setting used in previous work and new realistic few-shot setting with diverse multi-domain tasks and flexible label numbers. Experiments show that our approach attains superior performances in both settings.

few-shot learning task, learner, learning, (11 more...)

1901.0989

Country:

North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)

Genre:

Research Report (0.50)
Instructional Material (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.69)

Beatty, Garrett, Kochis, Ethan, Bloodgood, Michael

The Use of Unlabeled Data versus Labeled Data for Stopping Active Learning for Text Classification

Abstract-- Annotation of training data is the major bottleneck in the creation of text classification systems. Active learning is a commonly used technique to reduce the amount of training data one needs to label. A crucial aspect of active learning is determining when to stop labeling data. Three potential sources for informing when to stop active learning are an additional labeled set of data, an unlabeled set of data, and the training data that is labeled during the process of active learning. To date, no one has compared and contrasted the advantages and disadvantages of stopping methods based on these three information sources. We find that stopping methods that use unlabeled data are more effective than methods that use labeled data. I. INTRODUCTION The use of active learning to train machine learning models has been used as a way to reduce annotation costs for text and speech processing applications [1], [2], [3], [4], [5]. Active learning has been shown to have a particularly large potential for reducing annotation cost for text classification [6], [7]. Text classification is one of the most important fields in semantic computing and it has been used in many applications [8], [9], [10], [11], [12].

active learning, learning, validation, (14 more...)

1901.09126

Country:

North America > United States > New Jersey > Mercer County > Ewing (0.14)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > New York > New York County > New York City (0.04)
(10 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Classification (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.68)

Liu, Lydia T., Simchowitz, Max, Hardt, Moritz

The implicit fairness criterion of unconstrained learning

We clarify what fairness guarantees we can and cannot expect to follow from unconstrained machine learning. Specifically, we characterize when unconstrained learning on its own implies group calibration, that is, the outcome variable is conditionally independent of group membership given the score. We show that under reasonable conditions, the deviation from satisfying group calibration is upper bounded by the excess risk of the learned score relative to the Bayes optimal score function. A lower bound confirms the optimality of our upper bound. Moreover, we prove that as the excess risk of the learned score decreases, it strongly violates separation and independence, two other standard fairness criteria. Our results show that group calibration is the fairness criterion that unconstrained learning implicitly favors. On the one hand, this means that calibration is often satisfied on its own without the need for active intervention, albeit at the cost of violating other criteria that are at odds with calibration. On the other hand, it suggests that we should be satisfied with calibration as a fairness criterion only if we are at ease with the use of unconstrained machine learning in a given application.

calibration, excess risk, sufficiency gap, (13 more...)

1808.10013

Country:

North America > United States > Florida > Broward County (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)
North America > United States > New York > New York County > New York City (0.04)
(4 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Law (0.46)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

Frasca, Marco, Cesa-Bianchi, Nicolò

Positive and Unlabeled Learning through Negative Selection and Imbalance-aware Classification

Motivated by applications in protein function prediction, we consider a challenging supervised classification setting in which positive labels are scarce and there are no explicit negative labels. The learning algorithm must thus select which unlabeled examples to use as negative training points, possibly ending up with an unbalanced learning problem. We address these issues by proposing an algorithm that combines active learning (for selecting negative examples) with imbalance-aware learning (for mitigating the label imbalance). In our experiments we observe that these two techniques operate synergistically, outperforming state-of-the-art methods on standard protein function prediction benchmarks.

algorithm, prediction, protein, (15 more...)

1805.07331

Country:

North America > United States > New York > New York County > New York City (0.04)
Europe > Italy (0.04)
Asia > Middle East > Jordan (0.04)
(3 more...)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)

#artificialintelligenceJan-24-2019, 23:28:31 GMT

Machine Learning Consultant

We are looking for outstanding Data Engineers to join our team. This is a great opportunity for a Data Science Consultant to join a consulting firm that offers a variety of projects and a structured learning and development path. You will work alongside a talented team of consultants who all share your passion in building great solutions and learning new skills. Skills in Data Engineering and Machine Learning: as a data science consultant you will have proficiency in one or more of Python, R, Scala, Matlab/Octave, Java, C/C, Go, Javascript, Clojure etc. You will also have either hands on experience or good knowledge of the one of the following concepts such as supervised learning, unsupervised learning, reinforcement learning, deep learning, feature engineering, natural language processing, computer vision, signal processing etc.

artificial intelligence, consultant, machine learning, (12 more...)

#artificialintelligence

Industry:

Education (0.65)
Professional Services (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.61)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.59)

#artificialintelligenceJan-24-2019, 14:21:08 GMT

An overview of proxy-label approaches for semi-supervised learning

This post discusses semi-supervised learning algorithms that learn from proxy labels assigned to unlabelled data. Note: Parts of this post are based on my ACL 2018 paper Strong Baselines for Neural Semi-supervised Learning under Domain Shift with Barbara Plank. Unsupervised learning constitutes one of the main challenges for current machine learning models and one of the key elements that is missing for general artificial intelligence. While unsupervised learning on its own is still elusive, researchers have a made a lot of progress in combining unsupervised learning with supervised learning. This branch of machine learning research is called semi-supervised learning. Semi-supervised learning has a long history. For a (slightly outdated) overview, refer to Zhu (2005) [1] and Chapelle et al. (2006) [2].

artificial intelligence, machine learning, prediction, (16 more...)

#artificialintelligence

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

#artificialintelligenceJan-21-2019, 16:27:19 GMT

Introduction to machine learning with Weka - Target Veb

In this tutorial a small introduction of machine learning focused on development will be done with one of the most used Java libraries for this purpose, Weka. The machine learning is a subfield of data science . If data science covers the entire process of obtaining knowledge, cleaning, analysis, visualization and data deployment, machine learning are the algorithms and techniques used in the analysis and modeling phase of this process. Within these, we will focus on supervised learning, which is often used for classification and regression problems. The classification can be applied when dealing with a discrete class, where the objective is to predict one of the mutually exclusive values in the target variable.

classification, regression, weka, (16 more...)

#artificialintelligence

Country: Oceania > New Zealand > North Island > Waikato (0.05)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.35)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (0.32)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.32)

#artificialintelligenceJan-21-2019, 16:26:22 GMT

10 Major Machine Learning Algorithms And Their Application

Algorithms are the smart and powerful soldier of a complex machine learning model. In other words, machine learning algorithms are the core foundation when we play with data or when it's come to training the model. In this article, you and I are going on a tour called "7 major machine learning algorithms and their application " The purpose of this tour is to either brush up the mind or to gain an essential understanding of machine learning algorithm. We will find the major answer in this tour like for what purpose machine learning algorithms works, where to use them, when to use them and how to use them. Before getting deeper let's have a brief introduction. Machine learning algorithms are mainly classified into 3 broad categories i.e supervised learning, unsupervised learning, and reinforcement learning. In supervised learning machine learning algorithms, the machine is taught by example. Here the operator provides the machine learning algorithm with the dataset. This dataset includes desired inputs and outputs variables. By the use of these set of variables, we generate a function that map inputs to desired outputs.