AITopics | Transfer Learning

Collaborating Authors

Transfer Learning

Transfer Learning is the reuse of a pre-trained model on a new problem. (Towards Data Science)

News Overviews Instructional Materials AI-Alerts Classics

Transfer Learning of Artist Group Factors to Musical Genre Classification

Kim, Jaehun, Won, Minz, Serra, Xavier, Liem, Cynthia C. S.

arXiv.org Machine LearningMay-5-2018

The automated recognition of music genres from audio information is a challenging problem, as genre labels are subjective and noisy. Artist labels are less subjective and less noisy, while certain artists may relate more strongly to certain genres. At the same time, at prediction time, it is not guaranteed that artist labels are available for a given audio segment. Therefore, in this work, we propose to apply the transfer learning framework, learning artist-related information which will be used at inference time for genre classification. We consider different types of artist-related information, expressed through artist group factors, which will allow for more efficient learning and stronger robustness to potential label noise. Furthermore, we investigate how to achieve the highest validation accuracy on the given FMA dataset, by experimenting with various kinds of transfer methods, including single-task transfer, multi-task transfer and finally multi-task learning.

artificial intelligence, artist, machine learning, (16 more...)

arXiv.org Machine Learning

doi: 10.1145/3184558.3191823

1805.02043

Country:

Europe > Netherlands (0.15)
Europe > Spain (0.14)
Europe > France (0.14)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.86)

Add feedback

Taskonomy: Disentangling Task Transfer Learning

Zamir, Amir, Sax, Alexander, Shen, William, Guibas, Leonidas, Malik, Jitendra, Savarese, Silvio

arXiv.org Artificial IntelligenceApr-23-2018

Do visual tasks have a relationship, or are they unrelated? For instance, could having surface normals simplify estimating the depth of an image? Intuition answers these questions positively, implying existence of a structure among visual tasks. Knowing this structure has notable values; it is the concept underlying transfer learning and provides a principled way for identifying redundancies across tasks, e.g., to seamlessly reuse supervision among related tasks or solve many tasks in one system without piling up the complexity. We proposes a fully computational approach for modeling the structure of space of visual tasks. This is done via finding (first and higher-order) transfer learning dependencies across a dictionary of twenty six 2D, 2.5D, 3D, and semantic tasks in a latent space. The product is a computational taxonomic map for task transfer learning. We study the consequences of this structure, e.g. nontrivial emerged relationships, and exploit them to reduce the demand for labeled data. For example, we show that the total number of labeled datapoints needed for solving a set of 10 tasks can be reduced by roughly 2/3 (compared to training independently) while keeping the performance nearly the same. We provide a set of tools for computing and probing this taxonomical structure including a solver that users can employ to devise efficient supervision policies for their use cases.

artificial intelligence, machine learning, segm, (17 more...)

arXiv.org Artificial Intelligence

1804.08328

Country:

Asia > China (0.04)
North America > United States > New York (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (1.00)

Industry:

Education (0.46)
Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
(2 more...)

Add feedback

Learning to Learn Deep Learning E-Learning

#artificialintelligenceApr-20-2018, 12:41:33 GMT

Welcome to this e-learning course developed and produced by Dr Neil Thompson and hosted by Simpliv. Neil is a well-published author in the people professions field, an international conference speaker and sought-after consultant.The overall aim of this course is to help you broaden and deepen your understanding of what is involved in learning, what can prevent it from happening and what you can do to maximize your learning. Learning is part of everyday life and something we are very familiar with. But, that does not mean that we are making the most of the learning opportunities we encounter. Indeed, it is fair to say that, despite the emphasis on the importance of learning, relatively few people achieve optimal learning.

artificial intelligence, learn deep learning e-learning, machine learning, (1 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (0.63)
Education > Educational Setting > Online (0.63)

Technology:

Information Technology > Enterprise Applications > Human Resources > Learning Management (0.63)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)

Add feedback

Demis Hassabis: Transfer learning is key to AGI

#artificialintelligenceApr-18-2018, 21:06:16 GMT

I think transfer learning is the key to general intelligence. And I think the key to doing transfer learning will be the acquisition of conceptual knowledge that is abstracted away from perceptual details of where you learned it from.

agi, demis hassabis

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Add feedback

[D]What makes "Meta-SGD: Learning to Learn Quickly for Few-Shot Learning" to work so good? • r/MachineLearning

@machinelearnbotApr-12-2018, 08:57:57 GMT

I'm interested in Few-Shot-Learning, so this paper is really intriguing for me either. I think that I still don't get paper (I'm not familiar with Meta-Learning), but learning algorithm look completely different than in normal supervised learning. So for weight update they use test set (which could be also a part of train set, not sure of proper name, but it would be better if we call it train-test and second one train-train). Do you see the difference? Why they use such idea?

few-shot learning, learning, machinelearning, (5 more...)

@machinelearnbot

Industry: Media > News (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.40)

Add feedback

Transfer Learning for Traffic Speed Prediction: A Preliminary Study

Lin, Bill Y. (Shanghai Jiao Tong University) | Xu, Frank F. (Shanghai Jiao Tong University) | Liao, Eve Q. (Shanghai Jiao Tong University) | Zhu, Kenny Q. (Shanghai Jiao Tong University)

AAAI ConferencesApr-6-2018

Traffic speed prediction can benefit a wide range of IoT applications in intelligent transportation and smart city. Recent supervised machine learning approaches heavily leverage vast amount of historical time-series data. Consequently, they degrade dramatically in the areas where collecting a large traffic data is not quite feasible. With the aim of predicting the traffic speed of such urban areas, we propose a transfer learning framework that exploits historical data of some other data abundant areas by utilizing various spatio-temporal semantic features. Experimental results show that classic regression models and our proposed kernel regression model can achieve competitive performance comparing to baseline methods that heavily rely on the historical data of target areas.

artificial intelligence, machine learning, traffic speed prediction, (2 more...)

AAAI Conferences

Workshops at the Thirty-Second AAAI Conference on Artificial Intelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.60)

Add feedback

Incremental Learning-to-Learn with Statistical Guarantees

Denevi, Giulia, Ciliberto, Carlo, Stamos, Dimitris, Pontil, Massimiliano

arXiv.org Machine LearningMar-21-2018

In learning-to-learn the goal is to infer a learning algorithm that works well on a class of tasks sampled from an unknown meta distribution. In contrast to previous work on batch learning-to-learn, we consider a scenario where tasks are presented sequentially and the algorithm needs to adapt incrementally to improve its performance on future tasks. Key to this setting is for the algorithm to rapidly incorporate new observations into the model as they arrive, without keeping them in memory. We focus on the case where the underlying algorithm is ridge regression parameterized by a positive semidefinite matrix. We propose to learn this matrix by applying a stochastic strategy to minimize the empirical error incurred by ridge regression on future tasks sampled from the meta distribution. We study the statistical properties of the proposed algorithm and prove non-asymptotic bounds on its excess transfer risk, that is, the generalization performance on new tasks from the same meta distribution. We compare our online learning-to-learn approach with a state of the art batch method, both theoretically and empirically.

algorithm, artificial intelligence, machine learning, (14 more...)

arXiv.org Machine Learning

1803.08089

Country:

North America > United States (0.14)
Europe > Italy (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
(3 more...)

Genre: Research Report (0.50)

Industry: Education > Educational Setting > Online (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback

Pseudo-task Augmentation: From Deep Multitask Learning to Intratask Sharing---and Back

Meyerson, Elliot, Miikkulainen, Risto

arXiv.org Machine LearningMar-11-2018

Deep multitask learning boosts performance by sharing learned structure across related tasks. This paper adapts ideas from deep multitask learning to the setting where only a single task is available. The method is formalized as pseudo-task augmentation, in which models are trained with multiple decoders for each task. Pseudo-tasks simulate the effect of training towards closely-related tasks drawn from the same universe. In a suite of experiments, pseudo-task augmentation is shown to improve performance on single-task learning problems. When combined with multitask learning, further improvements are achieved, including state-of-the-art performance on the CelebA dataset, showing that pseudo-task augmentation and multitask learning have complementary value. All in all, pseudo-task augmentation is a broadly applicable and efficient way to boost performance in deep learning systems.

artificial intelligence, decoder, machine learning, (15 more...)

arXiv.org Machine Learning

1803.04062

Country:

North America > United States > Texas > Travis County > Austin (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)

Genre: Research Report (1.00)

Industry: Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Learning and Transferring IDs Representation in E-commerce

Zhao, Kui, Li, Yuechuan, Shuai, Zhaoqian, Yang, Cheng

arXiv.org Machine LearningFeb-26-2018

Many machine intelligence techniques are developed in E-commerce and one of the most essential components is the representation of IDs, including user ID, item ID, product ID, store ID, brand ID, category ID etc. The classical encoding based methods (like one-hot encoding) are inefficient in that it suffers sparsity problems due to its high dimension, and it cannot reflect the relationships among IDs, either homogeneous or heterogeneous ones. In this paper, we propose an embedding based framework to learn and transfer the representation of IDs. As the the implicit feedbacks of users, a tremendous amount of item ID sequences can be easily collected from the interactive sessions. By jointly using these informative sequences and the structural connections among IDs, all types of IDs can be embedded into one low-dimensional semantic space. Subsequently, the learned representations are utilized and transferred in four scenarios: (i) measuring the similarity between items, (ii) transferring from seen items to unseen items, (iii) transferring across different domains, (iv) transferring across different tasks. We deploy and evaluate the proposed approach in Hema App and the results validate its effectiveness.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Machine Learning

1712.08289

Country: Asia > China > Zhejiang Province > Hangzhou (0.04)

Genre: Research Report (0.64)

Industry: Information Technology > Services > e-Commerce Services (0.86)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(2 more...)

Add feedback

Not to Cry Wolf: Distantly Supervised Multitask Learning in Critical Care

Schwab, Patrick, Keller, Emanuela, Muroi, Carl, Mack, David J., Strässle, Christian, Karlen, Walter

arXiv.org Artificial IntelligenceFeb-14-2018

Patients in the intensive care unit (ICU) require constant and close supervision. To assist clinical staff in this task, hospitals use monitoring systems that trigger audiovisual alarms if their algorithms indicate that a patient's condition may be worsening. However, current monitoring systems are extremely sensitive to movement artefacts and technical errors. As a result, they typically trigger hundreds to thousands of false alarms per patient per day - drowning the important alarms in noise and adding to the exhaustion of clinical staff. In this setting, data is abundantly available, but obtaining trustworthy annotations by experts is laborious and expensive. We frame the problem of false alarm reduction from multivariate time series as a machine-learning task and address it with a novel multitask network architecture that utilises distant supervision through multiple related auxiliary tasks in order to reduce the number of expensive labels required for training. We show that our approach leads to significant improvements over several state-of-the-art baselines on real-world ICU data and provide new insights on the importance of task selection and architectural choices in distantly supervised multitask learning.

artificial intelligence, auxiliary task, machine learning, (15 more...)

arXiv.org Artificial Intelligence

1802.05027

Country:

Europe > Switzerland > Zürich > Zürich (0.15)
North America > United States (0.14)

Genre: Research Report > Promising Solution (0.46)

Industry:

Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Diagnostic Medicine (1.00)
Health & Medicine > Health Care Technology (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.63)

Add feedback