AITopics | Unsupervised or Indirectly Supervised Learning

Collaborating Authors

Unsupervised or Indirectly Supervised Learning

Unsupervised learning is a branch of machine learning that learns from test data that has not been labeled, classified or categorized. Instead of responding to feedback, unsupervised learning identifies commonalities in the data and reacts based on the presence or absence of such commonalities in each new piece of data. (Wikipedia)

News Overviews Instructional Materials AI-Alerts Classics

CS 229 - Unsupervised Learning Cheatsheet

#artificialintelligenceMar-15-2021, 06:25:10 GMT

Motivation The goal of unsupervised learning is to find hidden patterns in unlabeled data $\{x {(1)},...,x {(m)}\}$. Jensen's inequality Let $f$ be a convex function and $X$ a random variable. Latent variables Latent variables are hidden/unobserved variables that make estimation problems difficult, and are often denoted $z$. We note $c {(i)}$ the cluster of data point $i$ and $\mu_j$ the center of cluster $j$. Algorithm After randomly initializing the cluster centroids $\mu_1,\mu_2,...,\mu_k\in\mathbb{R} n$, the $k$-means algorithm repeats the following step until convergence: Algorithm It is a clustering algorithm with an agglomerative hierarchical approach that build nested clusters in a successive manner. In an unsupervised learning setting, it is often hard to assess the performance of a model since we don't have the ground truth labels as was the case in the supervised learning setting.

algorithm, matrix, unsupervised learning cheatsheet, (8 more...)

#artificialintelligence

Country: North America > United States > California > Santa Clara County > Palo Alto (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.54)

Add feedback

Machine Learning in a Day

#artificialintelligenceMar-11-2021, 05:50:09 GMT

Learn "Machine Learning" in a Day.An eBook specially designed for novicesFundamentals of Machine Learning with real-life examples and exercises. Specially for professionals such as Doctors, Lawyers, Business Professionals, Artists and Content Creators.Chapter 1:Introduction:What is Artificial Intelligence (AI) ?What is Machine Learning (ML) ?Different types of Machine LearningChapter 2:Supervised Machine LearningReal-life examples of Supervised Machine LearningApplications of Supervised learning in medicine, law, finance and artChapter 3:Semi-Supervised Machine LearningHow to use Semi-Supervised Machine Learning?Applications of Semi-Supervised learning in medicine, law, finance and artChapter 4:Weakly-Supervised Machine LearningExamples of Weakly-Supervised Machine LearningApplications of Weakly-Supervised learning in medicine, law, finance and artChapter 5:Unsupervised Machine LearningDescription about Unsupervised Machine LearningPromising applications of Unsupervised learning in medicine, law, finance and artChapter 6:Self-supervised Machine LearningFuture of Self-supervised Machine LearningHow self-supervised learning can be used in law, finance, medicine and artChapter 7:Deep LearningHow Deep learning is changing the scope of Artificial IntelligenceApplications of Self-supervised learning in medicine, law, finance and artChapter 7:Future Directions10 futuristic applications of Machine LearningIf you are not satisfied, email us to get your money back. 100% Moneyback Guarantee!

application, learning, machine learning, (6 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.60)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)

Add feedback

Unsupervised machine learning: Dealing with unknown data

#artificialintelligenceMar-7-2021, 15:35:43 GMT

The unsupervised learning model of machine learning uses specific algorithms to deal with unclassified and unlabeled data. This Arcitura Education article explains the model, dimension reduction algorithms, concept of reinforcement learning and more.

algorithm, toy company, unsupervised learning, (16 more...)

#artificialintelligence

Genre: Questionnaire & Opinion Survey (0.30)

Industry: Health & Medicine (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.76)

Add feedback

Essential Skills You Need For Doing Machine Learning

#artificialintelligenceMar-3-2021, 14:30:37 GMT

Tagged by many as the technology with the highest demand in the modern era, Machine Learning (ML) is a field of study within the Artificial Intelligence (AI) domain that allows computers to learn from experience and improve on its own when exposed to new data, independent of human intervention or explicit programming. It uses an algorithm method to extract patterns out of raw data. In Machine learning, a computer is made to perform a task without explicitly programming it. Basically, there are two kinds of machine learning tasks. They are: Supervised Learning and Unsupervised Learning. In supervised learning, the system is presented with some example inputs, based on which the desired outputs are to be formed.

machine learning, programming language, python, (13 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.36)

Add feedback

Comparing the Value of Labeled and Unlabeled Data in Method-of-Moments Latent Variable Estimation

Chen, Mayee F., Cohen-Wang, Benjamin, Mussmann, Stephen, Sala, Frederic, Ré, Christopher

arXiv.org Machine LearningMar-3-2021

Labeling data for modern machine learning is expensive and time-consuming. Latent variable models can be used to infer labels from weaker, easier-to-acquire sources operating on unlabeled data. Such models can also be trained using labeled data, presenting a key question: should a user invest in few labeled or many unlabeled points? We answer this via a framework centered on model misspecification in method-of-moments latent variable estimation. Our core result is a bias-variance decomposition of the generalization error, which shows that the unlabeled-only approach incurs additional bias under misspecification. We then introduce a correction that provably removes this bias in certain cases. We apply our decomposition framework to three scenarios -- well-specified, misspecified, and corrected models -- to 1) choose between labeled and unlabeled data and 2) learn from their combination. We observe theoretically and with synthetic experiments that for well-specified models, labeled points are worth a constant factor more than unlabeled points. With misspecification, however, their relative value is higher due to the additional bias but can be reduced with correction. We also apply our approach to study real-world weak supervision techniques for dataset construction.

estimator, generalization error, misspecification, (15 more...)

arXiv.org Machine Learning

2103.02761

Country:

Asia > Japan > Shikoku > Kagawa Prefecture > Takamatsu (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
(2 more...)

Genre: Research Report (0.82)

Industry: Semiconductors & Electronics (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Explaining Adversarial Vulnerability with a Data Sparsity Hypothesis

Paknezhad, Mahsa, Ngo, Cuong Phuc, Winarto, Amadeus Aristo, Cheong, Alistair, Yang, Beh Chuen, Jiayang, Wu, Kuan, Lee Hwee

arXiv.org Artificial IntelligenceMar-1-2021

Despite many proposed algorithms to provide robustness to deep learning (DL) models, DL models remain susceptible to adversarial attacks. We hypothesize that the adversarial vulnerability of DL models stems from two factors. The first factor is data sparsity which is that in the high dimensional data space, there are large regions outside the support of the data distribution. The second factor is the existence of many redundant parameters in the DL models. Owing to these factors, different models are able to come up with different decision boundaries with comparably high prediction accuracy. The appearance of the decision boundaries in the space outside the support of the data distribution does not affect the prediction accuracy of the model. However, they make an important difference in the adversarial robustness of the model. We propose that the ideal decision boundary should be as far as possible from the support of the data distribution.\par In this paper, we develop a training framework for DL models to learn such decision boundaries spanning the space around the class distributions further from the data points themselves. Semi-supervised learning was deployed to achieve this objective by leveraging unlabeled data generated in the space outside the support of the data distribution. We measure adversarial robustness of the models trained using this training framework against well-known adversarial attacks We found that our results, other regularization methods and adversarial training also support our hypothesis of data sparcity. We show that the unlabeled data generated by noise using our framework is almost as effective as unlabeled data, sourced from existing data sets or generated by synthesis algorithms, on adversarial robustness. Our code is available at https://github.com/MahsaPaknezhad/AdversariallyRobustTraining.

accuracy, training framework, unlabeled data, (15 more...)

arXiv.org Artificial Intelligence

2103.00778

Country:

Asia > Singapore (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.66)

Industry: Information Technology > Security & Privacy (0.54)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback

Data Science Techniques & Applications - Workflow

#artificialintelligenceFeb-25-2021, 22:00:31 GMT

Editor's note: In their book, "Data Science: Concepts and Practice," authors Vijay Kotu and Bala Deshpande explain the core principles and applications of modern data science. Kotu is vice president of analytics at ServiceNow; Deshpande is a data scientist and consultant. This article, which focuses on primary applications of data science, is adapted with permission. Data science problems can be broadly categorized into supervised or unsupervised learning models. Supervised or directed data science tries to infer a function or relationship based on known (labeled) training data and uses this function to map new unknown (unlabeled) data--for example, predicting if a current customer will not return based on the behaviors of all customers who have left before.

application, data science, output variable, (8 more...)

#artificialintelligence

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.61)

Add feedback

MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in Frames

Kaneko, Takuhiro, Kameoka, Hirokazu, Tanaka, Kou, Hojo, Nobukatsu

arXiv.org Machine LearningFeb-25-2021

Non-parallel voice conversion (VC) is a technique for training voice converters without a parallel corpus. Cycle-consistent adversarial network-based VCs (CycleGAN-VC and CycleGAN-VC2) are widely accepted as benchmark methods. However, owing to their insufficient ability to grasp time-frequency structures, their application is limited to mel-cepstrum conversion and not mel-spectrogram conversion despite recent advances in mel-spectrogram vocoders. To overcome this, CycleGAN-VC3, an improved variant of CycleGAN-VC2 that incorporates an additional module called time-frequency adaptive normalization (TFAN), has been proposed. However, an increase in the number of learned parameters is imposed. As an alternative, we propose MaskCycleGAN-VC, which is another extension of CycleGAN-VC2 and is trained using a novel auxiliary task called filling in frames (FIF). With FIF, we apply a temporal mask to the input mel-spectrogram and encourage the converter to fill in missing frames based on surrounding frames. This task allows the converter to learn time-frequency structures in a self-supervised manner and eliminates the need for an additional module such as TFAN. A subjective evaluation of the naturalness and speaker similarity showed that MaskCycleGAN-VC outperformed both CycleGAN-VC2 and CycleGAN-VC3 with a model size similar to that of CycleGAN-VC2. Audio samples are available at http://www.kecl.ntt.co.jp/people/kaneko.takuhiro/projects/maskcyclegan-vc/index.html.

conversion, proc, voice conversion, (12 more...)

arXiv.org Machine Learning

2102.12841

Country:

Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
Asia > Japan (0.04)
South America > Colombia > Meta Department > Villavicencio (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Representational aspects of depth and conditioning in normalizing flows

AIHubFeb-15-2021, 10:50:41 GMT

The promise of unsupervised learning lies in its potential to take advantage of cheap and plentiful unlabeled data to learn useful representations or generate high-quality samples. For the latter task, neural network-based generative models have recently enjoyed a lot of success in producing realistic images and text. Two major paradigms in deep generative modeling are generative adversarial networks (GANs) and normalizing flows. When successfully scaled up and trained, both can generate high-quality and diverse samples from high-dimensional distributions. The training procedure for GANs involves min-max (saddle-point) optimization, which is considerably more difficult than standard loss minimization, leading to problems like mode dropping.

affine coupling, conditioning, transformation, (16 more...)

AIHub

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

Multi-class Generative Adversarial Nets for Semi-supervised Image Classification

Motamed, Saman, Khalvati, Farzad

arXiv.org Artificial IntelligenceFeb-13-2021

From generating never-before-seen images to domain adaptation, applications of Generative Adversarial Networks (GANs) spread wide in the domain of vision and graphics problems. With the remarkable ability of GANs in learning the distribution and generating images of a particular class, they can be used for semi-supervised classification tasks. However, the problem is that if two classes of images share similar characteristics, the GAN might learn to generalize and hinder the classification of the two classes. In this paper, we use various images from MNIST and Fashion-MNIST datasets to illustrate how similar images cause the GAN to generalize, leading to the poor classification of images. We propose a modification to the traditional training of GANs that allows for improved multi-class classification in similar classes of images in a semi-supervised learning framework.

classification, gan, generate image, (16 more...)

arXiv.org Artificial Intelligence

2102.06944

Country: North America > Canada > Ontario > Toronto (0.15)

Genre: Research Report (0.64)

Industry: Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.33)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.56)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.52)

Add feedback