Goto

Collaborating Authors

 Unsupervised or Indirectly Supervised Learning


Machine Learning for Humans, Part 3: Unsupervised Learning

@machinelearnbot

How do you find the underlying structure of a dataset? How do you summarize it and group it most usefully? How do you effectively represent data in a compressed format? These are the goals of unsupervised learning, which is called "unsupervised" because you start with unlabeled data (there's no Y). The two unsupervised learning tasks we will explore are clustering the data into groups by similarity and reducing dimensionality to compress the data while maintaining its structure and usefulness.


Discriminative Similarity for Clustering and Semi-Supervised Learning

arXiv.org Machine Learning

Similarity-based clustering and semi-supervised learning methods separate the data into clusters or classes according to the pairwise similarity between the data, and the pairwise similarity is crucial for their performance. In this paper, we propose a novel discriminative similarity learning framework which learns discriminative similarity for either data clustering or semi-supervised learning. The proposed framework learns classifier from each hypothetical labeling, and searches for the optimal labeling by minimizing the generalization error of the learned classifiers associated with the hypothetical labeling. Kernel classifier is employed in our framework. By generalization analysis via Rademacher complexity, the generalization error bound for the kernel classifier learned from hypothetical labeling is expressed as the sum of pairwise similarity between the data from different classes, parameterized by the weights of the kernel classifier. Such pairwise similarity serves as the discriminative similarity for the purpose of clustering and semi-supervised learning, and discriminative similarity with similar form can also be induced by the integrated squared error bound for kernel density classification. Based on the discriminative similarity induced by the kernel classifier, we propose new clustering and semi-supervised learning methods.



Which machine learning algorithm should I use? DataScience.US

#artificialintelligence

A typical question asked by a beginner, when facing a wide variety of machine learning algorithms, is "which algorithm should I use?" Even an experienced data scientist cannot tell which algorithm will perform the best before trying different algorithms. We are not advocating a one and done approach, but we do hope to provide some guidance on which algorithms to try first depending on some clear factors. The machine learning algorithm cheat sheet helps you to choose from a variety of machine learning algorithms to find the appropriate algorithm for your specific problems. This article walks you through the process of how to use the sheet.


Apple wins 'Best Paper Award' at prestigious machine learning conference

#artificialintelligence

With recent progress in graphics, it has become more tractable to train models on synthetic images, poten- tially avoiding the need for expensive annotations. How- ever, learning from synthetic images may not achieve the desired performance due to a gap between synthetic and real image distributions. To reduce this gap, we pro- pose Simulated Unsupervised (S U) learning, where the task is to learn a model to improve the realism of a simulator's output using unlabeled real data, while preserving the annotation information from the simula- tor. We develop a method for S U learning that uses an adversarial network similar to Generative Adversarial Networks (GANs), but with synthetic images as inputs instead of random vectors. We make several key modifi- cations to the standard GAN algorithm to preserve an- notations, avoid artifacts, and stabilize training: (i) a'self-regularization' term, (ii) a local adversarial loss, and (iii) updating the discriminator using a history of refined images.


Introduction to Clustering and Unsupervised Learning PACKT Books

#artificialintelligence

The act of clustering, or spotting patterns in data, is not much different from spotting patterns in groups of people. Before jumping into action, we'll begin by taking an in-depth look at exactly what clustering entails. Clustering is an unsupervised machine learning task that automatically divides the data into clusters, or groups of similar items. It does this without having been told how the groups should look ahead of time. As we may not even know what we're looking for, clustering is used for knowledge discovery rather than prediction. It provides an insight into the natural groupings found within data.


Generative Adversarial Networks (GANs): Engine and Applications

@machinelearnbot

The latent layer consists of 5 neurons, one of which is responsible for GI (efficiency against cancer cells) and the four others are discriminated with normal distribution. So, a regression term was added to the Encoder cost function. Furthermore, the Encoder was restricted to map the same fingerprint to the same latent vector, independently from input concentration by additional manifold cost. After training, it is possible to generate molecules from a desired distribution and use a GI-neuron as a tuner of output compounds. Results of this work are the following: the trained AAE model predicted compounds that are already proven to be anticancer agents and new untested compounds that should be validated with experiments on anticancer activity.


Is unsupervised learning the key to artificial intelligence?

@machinelearnbot

Please tell us who we're sharing this with and we'll get you on your way to the report. We will not share your personal info.


Machine Learning: An In-Depth Guide – Unsupervised Learning, Related Fields, and Machine Learning in Practice

#artificialintelligence

Welcome to the fifth and final article in a five-part series about machine learning. In this final article, we will revisit unsupervised learning in greater depth, briefly discuss other fields related to machine learning, and finish the series with some examples of real-world machine learning applications. Recall that unsupervised learning involves learning from data, but without the goal of prediction. This is because the data is either not given with a target response variable (label), or one chooses not to designate a response. It can also be used as a pre-processing step for supervised learning.


Supervised and Unsupervised Machine Learning Algorithms - Machine Learning Mastery

#artificialintelligence

What is supervised machine learning and how does it relate to unsupervised machine learning? In this post you will discover supervised learning, unsupervised learning and semis-supervised learning. Supervised and Unsupervised Machine Learning Algorithms Photo by US Department of Education, some rights reserved. The majority of practical machine learning uses supervised learning. Supervised learning is where you have input variables (x) and an output variable (Y) and you use an algorithm to learn the mapping function from the input to the output.