AITopics | Unsupervised or Indirectly Supervised Learning

Collaborating Authors

Unsupervised or Indirectly Supervised Learning

Unsupervised learning is a branch of machine learning that learns from test data that has not been labeled, classified or categorized. Instead of responding to feedback, unsupervised learning identifies commonalities in the data and reacts based on the presence or absence of such commonalities in each new piece of data. (Wikipedia)

News Overviews Instructional Materials AI-Alerts Classics

Contrastive Regularization for Semi-Supervised Learning

Lee, Doyup, Kim, Sungwoong, Kim, Ildoo, Cheon, Yeongjae, Cho, Minsu, Han, Wook-Shin

arXiv.org Machine LearningJan-17-2022

Consistency regularization on label predictions becomes a fundamental technique in semi-supervised learning, but it still requires a large number of training iterations for high performance. In this study, we analyze that the consistency regularization restricts the propagation of labeling information due to the exclusion of samples with unconfident pseudo-labels in the model updates. Then, we propose contrastive regularization to improve both efficiency and accuracy of the consistency regularization by well-clustered features of unlabeled data. In specific, after strongly augmented samples are assigned to clusters by their pseudo-labels, our contrastive regularization updates the model so that the features with confident pseudo-labels aggregate the features in the same cluster, while pushing away features in different clusters. As a result, the information of confident pseudo-labels can be effectively propagated into more unlabeled samples during training by the well-clustered features. On benchmarks of semi-supervised learning tasks, our contrastive regularization improves the previous consistency-based methods and achieves state-of-the-art results, especially with fewer training iterations. Our method also shows robust performance on open-set semi-supervised learning where unlabeled data includes out-of-distribution samples.

contrastive regularization, regularization, unlabeled sample, (13 more...)

arXiv.org Machine Learning

2201.06247

Country: Asia > South Korea (0.04)

Genre: Research Report (0.84)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)

Add feedback

Graph Machine Learning with Python Part 3: Unsupervised Learning

#artificialintelligenceJan-14-2022, 18:28:26 GMT

In part 1, I introduced how we can reason from graphs, why they're so useful, metrics for analyzing and condensing large information, and more. In part 2, I took a look at the CryptoPunks trading network to introduce a higher level of reasoning of graphs -- random worlds and diffusion models. I then took a little bit of a tangent to discuss how we can use Network and Graph Analysis to look at NBA games. This part will use concepts introduced in that story to further my analysis of Graph Machine Learning. I highly recommend reviewing these previous parts before diving into this one as they set this one up well, and many concepts I won't dive into here are already discussed and shown in each of those.

dataset, graph machine learning, learning, (7 more...)

#artificialintelligence

Country: North America > United States > New York > New York County > New York City (0.11)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.40)

Add feedback

S$^2$FPR: Crowd Counting via Self-Supervised Coarse to Fine Feature Pyramid Ranking

Gao, Jiaqi, Huang, Zhizhong, Lei, Yiming, Wang, James Z., Wang, Fei-Yue, Zhang, Junping

arXiv.org Artificial IntelligenceJan-13-2022

Most conventional crowd counting methods utilize a fully-supervised learning framework to learn a mapping between scene images and crowd density maps. Under the circumstances of such fully-supervised training settings, a large quantity of expensive and time-consuming pixel-level annotations are required to generate density maps as the supervision. One way to reduce costly labeling is to exploit self-structural information and inner-relations among unlabeled images. Unlike the previous methods utilizing these relations and structural information from the original image level, we explore such self-relations from the latent feature spaces because it can extract more abundant relations and structural information. Specifically, we propose S$^2$FPR which can extract structural information and learn partial orders of coarse-to-fine pyramid features in the latent space for better crowd counting with massive unlabeled images. In addition, we collect a new unlabeled crowd counting dataset (FUDAN-UCC) with 4,000 images in total for training. One by-product is that our proposed S$^2$FPR method can leverage numerous partial orders in the latent space among unlabeled images to strengthen the model representation capability and reduce the estimation errors for the crowd counting task. Extensive experiments on four benchmark datasets, i.e. the UCF-QNRF, the ShanghaiTech PartA and PartB, and the UCF-CC-50, show the effectiveness of our method compared with previous semi-supervised methods. The source code and dataset are available at https://github.com/bridgeqiqi/S2FPR.

computer vision, dataset, proceedings, (13 more...)

arXiv.org Artificial Intelligence

2201.04819

Country:

Asia > China > Shanghai > Shanghai (0.04)
Asia > Macao (0.04)
Asia > China > Shandong Province > Qingdao (0.04)
(2 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.68)

Add feedback

High Resolution Generative Adversarial Networks

#artificialintelligenceJan-11-2022, 09:44:24 GMT

Create a GAN capable of generating high resolution images using TensorFlow 2.0 · Distribute training on a TPU or multiple GPUS · Implement the R2 This course covers the fundamentals necessary for a state-of-the-art GAN. Anyone who experimented with GANs on their own knows that it's easy to throw together a GAN that spits out MNIST digits, but it's another level of difficulty entirely to produce photorealistic images at a resolution higher than a thumbnail. You'll create and train a GAN that can be used in real-world applications. And because training high-resolution networks of any kind is computationally expensively, you'll also learn how to distribute your training across multiple GPUs or TPUs. This allows students to train generators up to 512x512 resolution with no hardware costs at all.

gan, high resolution generative adversarial network, mnist digit, (1 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.53)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.40)

Add feedback

Review -- Unsupervised Learning of Visual Representations using Videos

#artificialintelligenceJan-4-2022, 14:29:47 GMT

Thus the final output of each single network is 1024 dimensional feature space f(). The distance between query image patch and the tracked patch is small and the distance between query patch and other random patches is encouraged to be larger.

unsupervised learning, video, visual representation

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.40)

Add feedback

A General Framework for Treatment Effect Estimation in Semi-Supervised and High Dimensional Settings

Chakrabortty, Abhishek, Dai, Guorong, Tchetgen, Eric Tchetgen

arXiv.org Machine LearningJan-2-2022

In this article, we aim to provide a general and complete understanding of semi-supervised (SS) causal inference for treatment effects. Specifically, we consider two such estimands: (a) the average treatment effect and (b) the quantile treatment effect, as prototype cases, in an SS setting, characterized by two available data sets: (i) a labeled data set of size $n$, providing observations for a response and a set of high dimensional covariates, as well as a binary treatment indicator; and (ii) an unlabeled data set of size $N$, much larger than $n$, but without the response observed. Using these two data sets, we develop a family of SS estimators which are ensured to be: (1) more robust and (2) more efficient than their supervised counterparts based on the labeled data set only. Beyond the 'standard' double robustness results (in terms of consistency) that can be achieved by supervised methods as well, we further establish root-n consistency and asymptotic normality of our SS estimators whenever the propensity score in the model is correctly specified, without requiring specific forms of the nuisance functions involved. Such an improvement of robustness arises from the use of the massive unlabeled data, so it is generally not attainable in a purely supervised setting. In addition, our estimators are shown to be semi-parametrically efficient as long as all the nuisance functions are correctly specified. Moreover, as an illustration of the nuisance estimators, we consider inverse-probability-weighting type kernel smoothing estimators involving unknown covariate transformation mechanisms, and establish in high dimensional scenarios novel results on their uniform convergence rates, which should be of independent interest. Numerical results on both simulated and real data validate the advantage of our methods over their supervised counterparts with respect to both robustness and efficiency.

assumption 4, estimator, quad 0, (16 more...)

arXiv.org Machine Learning

2201.00468

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > United States > California > Santa Clara County > Palo Alto (0.14)
North America > United States > Wisconsin > Dane County > Madison (0.04)
(2 more...)

Genre:

Research Report > New Finding (0.67)
Research Report > Experimental Study (0.46)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.68)

Add feedback

DCGANs: Key Takeaways

#artificialintelligenceDec-26-2021, 19:35:16 GMT

When we use labeled data to train a machine-learning algorithm (e.g. When we use unlabeled data to train a machine-learning algorithm and allow it to find patterns in the data(e.g. Using dimensionality reduction to transform raw data into numerical features that can be processed with machine learning that contain information about the original data (e.g. An unsupervised learning task where the algorithm learns patterns in input data to generate new examples (fake data) that would appear to have been drawn from the original dataset. The part of the GAN that generates fake data.

convolution, dcgan, key takeaway, (1 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.63)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.43)

Add feedback

Inductive Semi-supervised Learning Through Optimal Transport

Hamri, Mourad El, Bennani, Younès, Falih, Issam

arXiv.org Machine LearningDec-14-2021

In this paper, we tackle the inductive semi-supervised learning problem that aims to obtain label predictions for out-of-sample data. The proposed approach, called Optimal Transport Induction (OTI), extends efficiently an optimal transport based transductive algorithm (OTP) to inductive tasks for both binary and multi-class settings. A series of experiments are conducted on several datasets in order to compare the proposed approach with state-of-the-art methods. Experiments demonstrate the effectiveness of our approach. We make our code publicly available.

inductive semi-supervised learning, objective function, optimal transport, (11 more...)

arXiv.org Machine Learning

doi: 10.1007/978-3-030-92307-5_78

2112.07262

Country:

Europe > France (0.05)
Europe > Russia (0.04)
Asia > Russia (0.04)

Genre: Research Report (0.85)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.90)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.76)

Add feedback

Sequence-level self-learning with multiple hypotheses

Kumatani, Kenichi, Dimitriadis, Dimitrios, Gaur, Yashesh, Gmyr, Robert, Eskimez, Sefik Emre, Li, Jinyu, Zeng, Michael

arXiv.org Artificial IntelligenceDec-10-2021

In this work, we develop new self-learning techniques with an attention-based sequence-to-sequence (seq2seq) model for automatic speech recognition (ASR). For untranscribed speech data, the hypothesis from an ASR system must be used as a label. However, the imperfect ASR result makes unsupervised learning difficult to consistently improve recognition performance especially in the case that multiple powerful teacher models are unavailable. In contrast to conventional unsupervised learning approaches, we adopt the \emph{multi-task learning} (MTL) framework where the $n$-th best ASR hypothesis is used as the label of each task. The seq2seq network is updated through the MTL framework so as to find the common representation that can cover multiple hypotheses. By doing so, the effect of the \emph{hard-decision} errors can be alleviated. We first demonstrate the effectiveness of our self-learning methods through ASR experiments in an accent adaptation task between the US and British English speech. Our experiment results show that our method can reduce the WER on the British speech data from 14.55\% to 10.36\% compared to the baseline model trained with the US English data only. Moreover, we investigate the effect of our proposed methods in a federated learning scenario.

hypothesis, learning, seed model, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.21437/Interspeech.2020-2020

2112.05826

Country:

North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Europe > Greece > Attica > Athens (0.04)
(2 more...)

Genre: Research Report (0.84)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.68)

Add feedback

Edge-Enhanced Dual Discriminator Generative Adversarial Network for Fast MRI with Parallel Imaging Using Multi-view Information

Huang, Jiahao, Ding, Weiping, Lv, Jun, Yang, Jingwen, Dong, Hao, Del Ser, Javier, Xia, Jun, Ren, Tiaojuan, Wong, Stephen, Yang, Guang

arXiv.org Artificial IntelligenceDec-10-2021

In clinical medicine, magnetic resonance imaging (MRI) is one of the most important tools for diagnosis, triage, prognosis, and treatment planning. However, MRI suffers from an inherent slow data acquisition process because data is collected sequentially in k-space. In recent years, most MRI reconstruction methods proposed in the literature focus on holistic image reconstruction rather than enhancing the edge information. This work steps aside this general trend by elaborating on the enhancement of edge information. Specifically, we introduce a novel parallel imaging coupled dual discriminator generative adversarial network (PIDD-GAN) for fast multi-channel MRI reconstruction by incorporating multi-view information. The dual discriminator design aims to improve the edge information in MRI reconstruction. One discriminator is used for holistic image reconstruction, whereas the other one is responsible for enhancing edge information. An improved U-Net with local and global residual learning is proposed for the generator. Frequency channel attention blocks (FCA Blocks) are embedded in the generator for incorporating attention mechanisms. Content loss is introduced to train the generator for better reconstruction quality. We performed comprehensive experiments on Calgary-Campinas public brain MR dataset and compared our method with state-of-the-art MRI reconstruction methods. Ablation studies of residual learning were conducted on the MICCAI13 dataset to validate the proposed modules. Results show that our PIDD-GAN provides high-quality reconstructed MR images, with well-preserved edge information. The time of single-image reconstruction is below 5ms, which meets the demand of faster processing.

dual discriminator generative adversarial network, multi-view information, parallel imaging, (1 more...)

arXiv.org Artificial Intelligence

2112.05758

Country: North America > Canada > Alberta > Census Division No. 6 > Calgary Metropolitan Region > Calgary (0.24)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.53)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.60)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.60)

Add feedback