Goto

Collaborating Authors

 Transfer Learning


A Survey on Deep Transfer Learning

arXiv.org Machine Learning

As a new classification platform, deep learning has recently received increasing attention from researchers and has been successfully applied to many domains. In some domains, like bioinformatics and robotics, it is very difficult to construct a large-scale well-annotated dataset due to the expense of data acquisition and costly annotation, which limits its development. Transfer learning relaxes the hypothesis that the training data must be independent and identically distributed (i.i.d.) with the test data, which motivates us to use transfer learning to solve the problem of insufficient training data. This survey focuses on reviewing the current researches of transfer learning by using deep neural network and its applications. We defined deep transfer learning, category and review the recent research works based on the techniques used in deep transfer learning.


Smart City Development with Urban Transfer Learning

arXiv.org Artificial Intelligence

The rapid development of big data techniques has offered great opportunities to develop smart city services in public safety, transportation management, city planning, etc. Meanwhile, the smart city development levels of different cities are still unbalanced. For a large of number of cities which just start development, the governments will face a critical cold-start problem, 'how to develop a new smart city service suffering from data scarcity?'. To address this problem, transfer learning is recently leveraged to accelerate the smart city development, which we term the urban transfer learning paradigm. This article investigates the common process of urban transfer learning, aiming to provide city governors and relevant practitioners with guidelines of applying this novel learning paradigm. Our guidelines include common transfer strategies to take, general steps to follow, and case studies to refer. We also summarize a few future research opportunities in urban transfer learning, and expect this article can attract more researchers into this promising area.


Transfer Learning in Tensorflow: Part 2 – Towards Data Science

#artificialintelligence

This is the second part of the Transfer Learning in Tensorflow (VGG19 on CIFAR-10). The first part can be found here. The previous article has given descriptions about'Transfer Learning', 'Choice of Model', 'Choice of the Model Implementation', 'Know How to Create the Model', and'Know About the Last Layer'. In short, the Part 1 is a kind of preparational step before training and prediction. In this article (Part 2), I will go over how to load pre-trained parameters, how to re-scale input images, how to choose batch-size, and then we will look into the result.


Visual Analogies between Atari Games for Studying Transfer Learning in RL

arXiv.org Machine Learning

In this work, we ask the following question: Can visual analogies, learned in an unsupervised way, be used in order to transfer knowledge between pairs of games and even play one game using an agent trained for another game? We attempt to answer this research question by creating visual analogies between a pair of games: a source game and a target game. For example, given a video frame in the target game, we map it to an analogous state in the source game and then attempt to play using a trained policy learned for the source game. We demonstrate convincing visual mapping between four pairs of games (eight mappings), which are used to evaluate three transfer learning approaches.


Transfer Learning Overview -- Episode 1 – Above Intelligent (AI)

#artificialintelligence

The great strength of CNN architectures is their capability to automatically learn a hierarchy of feature detectors in order to solve some task. What it has been observed is that regardless of the architecture, the dataset and the target semantic space (and of course the initialization assuming it's random), the first layers seem to always converge to specific kinds of feature detectors: the Gabor Filters. This is actually a very interesting and important phenomenon as it seems to suggest the Gabor Filters are the most efficient way to start the semantic extraction process from an image. It would mean Gabor Filters block is a sort of "generic building block" which could be used to design NN aimed at solving computer vision problems This is one of the main goal of Transfer Learning: finding "building blocks" which can be composed to build a NN and fine tuned, instead of trained from scratch, on the Dataset Being able to properly understand how the CNN specializes while training is important to get to Transfer Learning: "transferring" the network "capability of solving a problem" which means basically adapting its weights properly, to another similar problem in a Data Efficient Way The data efficiency is in fact one of the most important aspects of transfer learning: it is well known that Supervised Learning is an effective way to make a certain, typically big, NN become able to solve a problem but it scales badly in terms of data as it typically requires A LOT OF supervision signal which, in case of manual annotation, is expensive to collect as it relies on humans to provide it. Furthermore the more the task is difficult, the more the annotations need to be provided by human experts instead of normal people and the former one's time is more expensive than the latter ones


Learning to Learn

#artificialintelligence

A key aspect of intelligence is versatility – the capability of doing many different things. Current AI systems excel at mastering a single skill, such as Go, Jeopardy, or even helicopter aerobatics. But, when you instead ask an AI system to do a variety of seemingly simple problems, it will struggle. A champion Jeopardy program cannot hold a conversation, and an expert helicopter controller for aerobatics cannot navigate in new, simple situations such as locating, navigating to, and hovering over a fire to put it out. In contrast, a human can act and adapt intelligently to a wide variety of new, unseen situations.


Multi-Task Learning with Incomplete Data for Healthcare

arXiv.org Machine Learning

Multi-task learning is a type of transfer learning that trains multiple tasks simultaneously and leverages the shared information between related tasks to improve the generalization performance. However, missing features in the input matrix is a much more difficult problem which needs to be carefully addressed. Removing records with missing values can significantly reduce the sample size, which is impractical for datasets with large percentage of missing values. Popular imputation methods often distort the covariance structure of the data, which causes inaccurate inference. In this paper we propose using plug-in covariance matrix estimators to tackle the challenge of missing features. Specifically, we analyze the plug-in estimators under the framework of robust multi-task learning with LASSO and graph regularization, which captures the relatedness between tasks via graph regularization. We use the Alzheimer's disease progression dataset as an example to show how the proposed framework is effective for prediction and model estimation when missing data is present.


Towards more Reliable Transfer Learning

arXiv.org Machine Learning

Multi-source transfer learning has been proven effective when within-target labeled data is scarce. Previous work focuses primarily on exploiting domain similarities and assumes that source domains are richly or at least comparably labeled. While this strong assumption is never true in practice, this paper relaxes it and addresses challenges related to sources with diverse labeling volume and diverse reliability. The first challenge is combining domain similarity and source reliability by proposing a new transfer learning method that utilizes both source-target similarities and inter-source relationships. The second challenge involves pool-based active learning where the oracle is only available in source domains, resulting in an integrated active transfer learning framework that incorporates distribution matching and uncertainty sampling. Extensive experiments on synthetic and two real-world datasets clearly demonstrate the superiority of our proposed methods over several baselines including state-of-the-art transfer learning methods.


Balanced Distribution Adaptation for Transfer Learning

arXiv.org Machine Learning

Transfer learning has achieved promising results by leveraging knowledge from the source domain to annotate the target domain which has few or none labels. Existing methods often seek to minimize the distribution divergence between domains, such as the marginal distribution, the conditional distribution or both. However, these two distances are often treated equally in existing algorithms, which will result in poor performance in real applications. Moreover, existing methods usually assume that the dataset is balanced, which also limits their performances on imbalanced tasks that are quite common in real problems. To tackle the distribution adaptation problem, in this paper, we propose a novel transfer learning approach, named as Balanced Distribution \underline{A}daptation~(BDA), which can adaptively leverage the importance of the marginal and conditional distribution discrepancies, and several existing methods can be treated as special cases of BDA. Based on BDA, we also propose a novel Weighted Balanced Distribution Adaptation~(W-BDA) algorithm to tackle the class imbalance issue in transfer learning. W-BDA not only considers the distribution adaptation between domains but also adaptively changes the weight of each class. To evaluate the proposed methods, we conduct extensive experiments on several transfer learning tasks, which demonstrate the effectiveness of our proposed algorithms over several state-of-the-art methods.


A probabilistic constrained clustering for transfer learning and image category discovery

arXiv.org Artificial Intelligence

Neural network-based clustering has recently gained popularity, and in particular a constrained clustering formulation has been proposed to perform transfer learning and image category discovery using deep learning. The core idea is to formulate a clustering objective with pairwise constraints that can be used to train a deep clustering network; therefore the cluster assignments and their underlying feature representations are jointly optimized end-to-end. In this work, we provide a novel clustering formulation to address scalability issues of previous work in terms of optimizing deeper networks and larger amounts of categories. The proposed objective directly minimizes the negative log-likelihood of cluster assignment with respect to the pairwise constraints, has no hyper-parameters, and demonstrates improved scalability and performance on both supervised learning and unsupervised transfer learning.