AITopics | Transfer Learning

Collaborating Authors

Transfer Learning

Transfer Learning is the reuse of a pre-trained model on a new problem. (Towards Data Science)

News Overviews Instructional Materials AI-Alerts Classics

3 Pre-Trained Model Series to Use for NLP with Transfer Learning

#artificialintelligenceDec-6-2020, 08:05:40 GMT

Before we start, if you are reading this article, I am sure that we share similar interests and are/will be in similar industries. So let's connect via Linkedin! Please do not hesitate to send a contact request! If you have been trying to build machine learning models with high accuracy; but never tried Transfer Learning, this article will change your life. At least, it did mine!

knowledge, pre-trained model, transfer learning, (11 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.32)

Add feedback

SB-MTL: Score-based Meta Transfer-Learning for Cross-Domain Few-Shot Learning

Cai, John, Cai, Bill, Shen, Sheng Mei

arXiv.org Artificial IntelligenceDec-3-2020

While many deep learning methods have seen significant success in tackling the problem of domain adaptation and few-shot learning separately, far fewer methods are able to jointly tackle both problems in Cross-Domain Few-Shot Learning (CD-FSL). This problem is exacerbated under sharp domain shifts that typify common computer vision applications. In this paper, we present a novel, flexible and effective method to address the CD-FSL problem. Our method, called Score-based Meta Transfer-Learning (SB-MTL), combines transfer-learning and meta-learning by using a MAML-optimized feature encoder and a score-based Graph Neural Network. First, we have a feature encoder with specific layers designed to be fine-tuned. To do so, we apply a first-order MAML algorithm to find good initializations. Second, instead of directly taking the classification scores after fine-tuning, we interpret the scores as coordinates by mapping the pre-softmax classification scores onto a metric space. Subsequently, we apply a Graph Neural Network to propagate label information from the support set to the query set in our score-based metric space. We test our model on the Broader Study of Cross-Domain Few-Shot Learning (BSCD-FSL) benchmark, which includes a range of target domains with highly varying dissimilarity to the miniImagenet source domain. We observe significant improvements in accuracy across 5, 20 and 50 shot, and on the four target domains. In terms of average accuracy, our model outperforms previous transfer-learning methods by 5.93% and previous meta-learning methods by 14.28%.

data augmentation, feature encoder, metric space, (13 more...)

arXiv.org Artificial Intelligence

2012.01784

Country: North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Genre: Research Report (0.64)

Industry: Health & Medicine > Therapeutic Area (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Transfer Learning in Action: From ImageNet to Tiny-ImageNet

#artificialintelligenceDec-2-2020, 10:20:06 GMT

Transfer learning is an important topic. As a civilization, we have been passing on the knowledge from one generation to the other, enabling the technological advancement that we enjoy today. It's the edifice that supports most of the state-of-the-art models that are blowing steam, empowering many services that we take for granted. Transfer learning is about having a good starting point for the downstream task we're interested in solving. In this article, we're going to discuss how to piggyback on transfer learning to get a warm start to solve an image classification task. The content of this article is based on "TensorFlow 2 in Action" by Manning and on TensorFlow 2.2.

dataset, tensorflow 2, tiny-imagenet, (13 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (1.00)

Add feedback

Transfer Learning

#artificialintelligenceNov-30-2020, 11:45:11 GMT

The concept of transfer learning lies in imparting knowledge learned for performing a task to another task that is different but similar. How is Transfer Learning Useful to Me? In the context of humans, transfer learning is crucial to our lives. Let us use the CIFAR-10 dataset that contains 10 categories of images -- airplane, automobile, bird, cat, deer, dog, frog, horse, ship and truck. Our task of interest is to classify every image to its corresponding category.

category, knowledge, transfer learning, (6 more...)

#artificialintelligence

Industry: Transportation (0.88)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (1.00)

Add feedback

Predicting S&P500 Index direction with Transfer Learning and a Causal Graph as main Input

Romain, Djoumbissie David

arXiv.org Artificial IntelligenceNov-25-2020

We propose a unified multi-tasking framework to represent the complex and uncertain causal process of financial market dynamics, and then to predict the movement of any type of index with an application on the monthly direction of the S&P500 index. our solution is based on three main pillars: (i) the use of transfer learning to share knowledge and feature (representation, learning) between all financial markets, increase the size of the training sample and preserve the stability between training, validation and test sample. (ii) The combination of multidisciplinary knowledge (Financial economics, behavioral finance, market microstructure and portfolio construction theories) to represent a global top-down dynamics of any financial market, through a graph. (iii) The integration of forward looking unstructured data, different types of contexts (long, medium and short term) through latent variables/nodes and then, use a unique VAE network (parameter sharing) to learn simultaneously their distributional representation. We obtain Accuracy, F1-score, and Matthew Correlation of 74.3 %, 67 % and 0.42 above the industry and other benchmark on 12 years test period which include three unstable and difficult sub-period to predict.

layer 10, market index, prediction, (17 more...)

arXiv.org Artificial Intelligence

2011.13113

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > Canada > Quebec > Montreal (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.82)

Industry: Banking & Finance > Trading (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

Add feedback

Planar 3D Transfer Learning for End to End Unimodal MRI Unbalanced Data Segmentation

#artificialintelligenceNov-24-2020, 20:04:51 GMT

We present a novel approach of 2D to 3D transfer learning based on mapping pre-trained 2D convolutional neural network weights into planar 3D kernels. The method is validated by the proposed planar 3D res-u-net network with encoder transferred from the 2D VGG-16, which is applied for a single-stage unbalanced 3D image data segmentation. In particular, we evaluate the method on the MICCAI 2016 MS lesion segmentation challenge dataset utilizing solely fluid-attenuated inversion recovery (FLAIR) sequence without brain extraction for training and inference to simulate real medical praxis. The planar 3D res-u-net network performed the best both in sensitivity and Dice score amongst end to end methods processing raw MRI scans and achieved comparable Dice score to a state-of-the-art unimodal not end to end approach. Complete source code was released under the open-source license, and this paper complies with the Machine learning reproducibility checklist.

artificial intelligence, machine learning, unimodal mri unbalanced data segmentation, (4 more...)

#artificialintelligence

Genre: Research Report > Promising Solution (0.41)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.98)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.67)

Add feedback

Learning to learn Artificial Intelligence

#artificialintelligenceNov-20-2020, 08:45:06 GMT

In traditional Machine Learning domains, we usually take a huge dataset which is specific to a particular task and wish to train a model for regression/classification purposes using this dataset. That's radically far from how humans take advantage of their past experiences to learn very quickly a new task from only a handset of examples. Meta-Learning is essentially learning to learn. Formally, it can be defined as using metadata of an algorithm or a model to understand how automatic learning can become flexible in solving learning problems, hence to improve the performance of existing learning algorithms or to learn (induce) the learning algorithm itself. Each learning algorithm is based on a set of assumptions about the data, which is called its inductive bias.

algorithm, category, learn artificial intelligence, (9 more...)

#artificialintelligence

Industry: Education (0.37)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.63)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.31)

Add feedback

Robotic self-representation improves manipulation skills and transfer learning

Nguyen, Phuong D. H., Eppe, Manfred, Wermter, Stefan

arXiv.org Artificial IntelligenceNov-13-2020

Cognitive science suggests that the self-representation is critical for learning and problem-solving. However, there is a lack of computational methods that relate this claim to cognitively plausible robots and reinforcement learning. In this paper, we bridge this gap by developing a model that learns bidirectional action-effect associations to encode the representations of body schema and the peripersonal space from multisensory information, which is named multimodal BidAL. Through three different robotic experiments, we demonstrate that this approach significantly stabilizes the learning-based problem-solving under noisy conditions and that it improves transfer learning of robotic manipulation skills.

international conference, representation, robot, (15 more...)

arXiv.org Artificial Intelligence

2011.06985

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > Germany > Hamburg (0.04)

Genre: Research Report (0.50)

Industry: Health & Medicine (0.47)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.61)

Add feedback

Deep Learning :Adv. Computer Vision (object detection+more!)

#artificialintelligenceNov-12-2020, 10:36:06 GMT

Preview this course - GET COUPON CODE Latest update: I will show you both how to use a pretrained model and how to train one yourself with a custom dataset on Google Colab. This course is a complete guide for setting up TensorFlow object detection api, Transfer learning and a lot more I think what you'll find is that, this course is so entirely different from the previous one, you will be impressed at just how much material we have to cover. Here is the details about the project. Here we will star from colab understating because that will help to use free GPU provided by google to train up our model. We're going to bridge the gap between the basic CNN architecture you already know and love, to modern, novel architectures such as ResNet, and Inception.

algorithm, computer vision, preferably, (13 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry: Education > Educational Setting > Online (0.32)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.64)

Add feedback

FaceLeaks: Inference Attacks against Transfer Learning Models via Black-box Queries

Liew, Seng Pei, Takahashi, Tsubasa

arXiv.org Machine LearningOct-26-2020

Transfer learning is a useful machine learning framework that allows one to build task-specific models (student models) without significantly incurring training costs using a single powerful model (teacher model) pre-trained with a large amount of data. The teacher model may contain private data, or interact with private inputs. We investigate if one can leak or infer such private information without interacting with the teacher model directly. We describe such inference attacks in the context of face recognition, an application of transfer learning that is highly sensitive to personal privacy. Under black-box and realistic settings, we show that existing inference techniques are ineffective, as interacting with individual training instances through the student models does not reveal information about the teacher. We then propose novel strategies to infer from aggregate-level information. Consequently, membership inference attacks on the teacher model are shown to be possible, even when the adversary has access only to the student models. We further demonstrate that sensitive attributes can be inferred, even in the case where the adversary has limited auxiliary information. Finally, defensive strategies are discussed and evaluated. Our extensive study indicates that information leakage is a real privacy threat to the transfer learning framework widely used in real-life situations.

artificial intelligence, inductive learning, machine learning, (20 more...)

arXiv.org Machine Learning

2010.14023

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > Austria > Vienna (0.14)
(18 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision > Face Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(3 more...)

Add feedback