Goto

Collaborating Authors

 Unsupervised or Indirectly Supervised Learning


CCGL: Contrastive Cascade Graph Learning

arXiv.org Artificial Intelligence

Supervised learning, while prevalent for information cascade modeling, often requires abundant labeled data in training, and the trained model is not easy to generalize across tasks and datasets. Semi-supervised learning facilitates unlabeled data for cascade understanding in pre-training. It often learns fine-grained feature-level representations, which can easily result in overfitting for downstream tasks. Recently, contrastive self-supervised learning is designed to alleviate these two fundamental issues in linguistic and visual tasks. However, its direct applicability for cascade modeling, especially graph cascade related tasks, remains underexplored. In this work, we present Contrastive Cascade Graph Learning (CCGL), a novel framework for cascade graph representation learning in a contrastive, self-supervised, and task-agnostic way. In particular, CCGL first designs an effective data augmentation strategy to capture variation and uncertainty. Second, it learns a generic model for graph cascade tasks via self-supervised contrastive pre-training using both unlabeled and labeled data. Third, CCGL learns a task-specific cascade model via fine-tuning using labeled data. Finally, to make the model transferable across datasets and cascade applications, CCGL further enhances the model via distillation using a teacher-student architecture. We demonstrate that CCGL significantly outperforms its supervised and semi-supervised counterpartsfor several downstream tasks.


A Review of Generative Adversarial Networks in Cancer Imaging: New Applications, New Solutions

#artificialintelligence

Despite technological and medical advances, the detection, interpretation, and treatment of cancer based on imaging data continue to pose significant challenges. These include high inter-observer variability, difficulty of small-sized lesion detection, nodule interpretation and malignancy determination, inter- and intra-tumour heterogeneity, class imbalance, segmentation inaccuracies, and treatment effect uncertainty. The recent advancements in Generative Adversarial Networks (GANs) in computer vision as well as in medical imaging may provide a basis for enhanced capabilities in cancer detection and analysis. In this review, we assess the potential of GANs to address a number of key challenges of cancer imaging, including data scarcity and imbalance, domain and dataset shifts, data access and privacy, data annotation and quantification, as well as cancer detection, tumour profiling and treatment planning. We provide a critical appraisal of the existing literature of GANs applied to cancer imagery, together with suggestions on future research directions to address these challenges.


A Review of Generative Adversarial Networks in Cancer Imaging: New Applications, New Solutions

arXiv.org Artificial Intelligence

Despite technological and medical advances, the detection, interpretation, and treatment of cancer based on imaging data continue to pose significant challenges. These include high inter-observer variability, difficulty of small-sized lesion detection, nodule interpretation and malignancy determination, inter- and intra-tumour heterogeneity, class imbalance, segmentation inaccuracies, and treatment effect uncertainty. The recent advancements in Generative Adversarial Networks (GANs) in computer vision as well as in medical imaging may provide a basis for enhanced capabilities in cancer detection and analysis. In this review, we assess the potential of GANs to address a number of key challenges of cancer imaging, including data scarcity and imbalance, domain and dataset shifts, data access and privacy, data annotation and quantification, as well as cancer detection, tumour profiling and treatment planning. We provide a critical appraisal of the existing literature of GANs applied to cancer imagery, together with suggestions on future research directions to address these challenges. We analyse and discuss 163 papers that apply adversarial training techniques in the context of cancer imaging and elaborate their methodologies, advantages and limitations. With this work, we strive to bridge the gap between the needs of the clinical cancer imaging community and the current and prospective research on GANs in the artificial intelligence community.


Machine Learning: Unsupervised Learning

#artificialintelligence

This is the second course in the 3-course Machine Learning Series and is offered at Georgia Tech as CS7641. Taking this class here does not earn Georgia Tech credit. Ever wonder how Netflix can predict what movies you'll like? Or how Amazon knows what you want to buy before you do? The answer can be found in Unsupervised Learning!


Semi-supervised Learning for Marked Temporal Point Processes

arXiv.org Artificial Intelligence

Temporal Point Processes (TPPs) are often used to represent the sequence of events ordered as per the time of occurrence. Owing to their flexible nature, TPPs have been used to model different scenarios and have shown applicability in various real-world applications. While TPPs focus on modeling the event occurrence, Marked Temporal Point Process (MTPP) focuses on modeling the category/class of the event as well (termed as the marker). Research in MTPP has garnered substantial attention over the past few years, with an extensive focus on supervised algorithms. Despite the research focus, limited attention has been given to the challenging problem of developing solutions in semi-supervised settings, where algorithms have access to a mix of labeled and unlabeled data. This research proposes a novel algorithm for Semi-supervised Learning for Marked Temporal Point Processes (SSL-MTPP) applicable in such scenarios. The proposed SSL-MTPP algorithm utilizes a combination of labeled and unlabeled data for learning a robust marker prediction model. The proposed algorithm utilizes an RNN-based Encoder-Decoder module for learning effective representations of the time sequence. The efficacy of the proposed algorithm has been demonstrated via multiple protocols on the Retweet dataset, where the proposed SSL-MTPP demonstrates improved performance in comparison to the traditional supervised learning approach.


Visa on using advanced AI such as unsupervised learning to fight fraud

#artificialintelligence

Join executive leaders at the Data, Analytics, & Intelligent Automation Summit, presented by Accenture. The thing about fraud is that it's constantly changing -- looking at a past attack doesn't guarantee the next attack will look the same or target the same kind of victim -- and defenders have to continuously adapt. Visa utilizes artificial intelligence to analyze all of the transactions that go across the network and track large-scale transactional changes as part of its fraud detection efforts, Melissa McSherry, Visa's senior VP and global head of data, security, and identity products, said at VentureBeat's Transform 2021 virtual conference on Monday. Visa scores all of the transactions that go across the Visa network, which allows the company to define a set of behaviors that would be considered "normal." The team is "constantly" updating the model's view of history and updates the model itself to reflect the data on a fairly regular basis, McSherry said.


Less Labeled Data? Here's the Solution: The SimCLRv2

#artificialintelligence

The complication of learning information from only a few labeled data has troubled machine learning researchers for a long time, especially in applications of computer vision. To tackle this problem, a new research shows promising solution. What's new: A Google Brain Team led by Ting Chen and other fellow colleagues have formulated a simple framework for semi-supervised learning, which utilises very few labeled data and a large amount of unlabeled data to perform classification on the ImageNet database with an accuracy that outperforms the standard supervised training. Key insight: Semi-supervised learning which involves unsupervised pretraining followed by supervised fine-tuning has been copiously used for natural language processing, however, their application in computer vision has shown propitious results only very recently. The researchers carried forward this idea for use in computer vision by developing an improved variant of a previously proposed contrastive learning framework, SimCLR.


Semi-Supervised Object Detection with Adaptive Class-Rebalancing Self-Training

arXiv.org Artificial Intelligence

This study delves into semi-supervised object detection (SSOD) to improve detector performance with additional unlabeled data. State-of-the-art SSOD performance has been achieved recently by self-training, in which training supervision consists of ground truths and pseudo-labels. In current studies, we observe that class imbalance in SSOD severely impedes the effectiveness of self-training. To address the class imbalance, we propose adaptive class-rebalancing self-training (ACRST) with a novel memory module called CropBank. ACRST adaptively rebalances the training data with foreground instances extracted from the CropBank, thereby alleviating the class imbalance. Owing to the high complexity of detection tasks, we observe that both self-training and data-rebalancing suffer from noisy pseudo-labels in SSOD. Therefore, we propose a novel two-stage filtering algorithm to generate accurate pseudo-labels. Our method achieves satisfactory improvements on MS-COCO and VOC benchmarks. When using only 1\% labeled data in MS-COCO, our method achieves 17.02 mAP improvement over supervised baselines, and 5.32 mAP improvement compared with state-of-the-art methods.


Machine Learning Bootcamp: SVM,Kmeans,KNN,LinReg,PCA,DBS

#artificialintelligence

The course covers Machine Learning in exhaustive way. The presentations and hands-on practical are made such that it's made easy. The knowledge gained through this tutorial series can be applied to various real world scenarios. UnSupervised learning does not require to supervise the model. Instead, it allows the model to work on its own to discover patterns and information that was previously undetected. It mainly deals with the unlabeled data.


Microsoft's Project Alexandria parses documents using unsupervised learning

#artificialintelligence

Where does your enterprise stand on the AI adoption curve? Take our AI survey to find out. In 2014, Microsoft launched Project Alexandria, a research effort within its Cambridge research division dedicated to discovering entities -- topics of information -- and their associated properties. Building on the research lab's work in knowledge mining research using probabilistic programming, the aim of Alexandria was to construct a full knowledge base from a set of documents automatically. Alexandria technology powers the recently announced Microsoft Viva Topics, which automatically organizes large amounts of content and expertise in an organization.