AITopics | Transfer Learning

Collaborating Authors

Transfer Learning

Transfer Learning is the reuse of a pre-trained model on a new problem. (Towards Data Science)

News Overviews Instructional Materials AI-Alerts Classics

Transfer Learning on Heterogeneous Feature Spaces for Treatment Effects Estimation

arXiv.org Artificial IntelligenceOct-8-2022

Consider the problem of improving the estimation of conditional average treatment effects (CATE) for a target domain of interest by leveraging related information from a source domain with a different feature space. This heterogeneous transfer learning problem for CATE estimation is ubiquitous in areas such as healthcare where we may wish to evaluate the effectiveness of a treatment for a new patient population for which different clinical covariates and limited data are available. In this paper, we address this problem by introducing several building blocks that use representation learning to handle the heterogeneous feature spaces and a flexible multi-task architecture with shared and private layers to transfer information between potential outcome functions across domains. Then, we show how these building blocks can be used to recover transfer learning equivalents of the standard CATE learners. On a new semi-synthetic data simulation benchmark for heterogeneous transfer learning we not only demonstrate performance improvements of our heterogeneous transfer causal effect learners across datasets, but also provide insights into the differences between these learners from a transfer perspective.

artificial intelligence, latexit sha1, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2210.06183

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
Europe > United Kingdom > England > Greater London > London (0.04)

Genre:

Research Report > Experimental Study (0.67)
Research Report > Strength High (0.46)

Industry: Health & Medicine > Therapeutic Area (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

The Power of Transfer Learning in Agricultural Applications: AgriNet

Sahili, Zahraa Al, Awad, Mariette

arXiv.org Artificial IntelligenceOct-6-2022

Advances in deep learning and transfer learning have paved the way for various automation classification tasks in agriculture, including plant diseases, pests, weeds, and plant species detection. However, agriculture automation still faces various challenges, such as the limited size of datasets and the absence of plant-domain-specific pretrained models. Domain specific pretrained models have shown state of art performance in various computer vision tasks including face recognition and medical imaging diagnosis. In this paper, we propose AgriNet dataset, a collection of 160k agricultural images from more than 19 geographical locations, several images captioning devices, and more than 423 classes of plant species and diseases. We also introduce AgriNet models, a set of pretrained models on five ImageNet architectures: VGG16, VGG19, Inception-v3, InceptionResNet-v2, and Xception. AgriNet-VGG19 achieved the highest classification accuracy of 94 % and the highest F1-score of 92%. Additionally, all proposed models were found to accurately classify the 423 classes of plant species, diseases, pests, and weeds with a minimum accuracy of 87% for the Inception-v3 model.Finally, experiments to evaluate of superiority of AgriNet models compared to ImageNet models were conducted on two external datasets: pest and plant diseases dataset from Bangladesh and a plant diseases dataset from Kashmir.

accuracy, artificial intelligence, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2207.03881

Country:

Asia > Bangladesh (0.24)
North America > United States (0.04)
Europe > Denmark (0.04)
(9 more...)

Genre: Research Report (0.50)

Industry: Food & Agriculture > Agriculture (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.48)

Add feedback

Transfer Learning Framework for Low-Resource Text-to-Speech using a Large-Scale Unlabeled Speech Corpus

Kim, Minchan, Jeong, Myeonghun, Choi, Byoung Jin, Ahn, Sunghwan, Lee, Joun Yeop, Kim, Nam Soo

arXiv.org Artificial IntelligenceOct-6-2022

Training a text-to-speech (TTS) model requires a large scale text labeled speech corpus, which is troublesome to collect. In this paper, we propose a transfer learning framework for TTS that utilizes a large amount of unlabeled speech dataset for pre-training. By leveraging wav2vec2.0 representation, unlabeled speech can highly improve performance, especially in the lack of labeled speech. We also extend the proposed method to zero-shot multi-speaker TTS (ZS-TTS). The experimental results verify the effectiveness of the proposed method in terms of naturalness, intelligibility, and speaker generalization. We highlight that the single speaker TTS model fine-tuned on the only 10 minutes of labeled dataset outperforms the other baselines, and the ZS-TTS model fine-tuned on the only 30 minutes of single speaker dataset can generate the voice of the arbitrary speaker, by pre-training on unlabeled multi-speaker speech corpus.

artificial intelligence, machine learning, preprint arxiv, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.21437/Interspeech.2022-225

2203.15447

Country: Asia > South Korea > Seoul > Seoul (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Speech > Speech Synthesis (0.73)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.63)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.62)

Add feedback

Transfer Learning for Time Series Forecasting

#artificialintelligenceOct-5-2022, 05:30:17 GMT

In this article, we will see how transfer learning can be applied to time series forecasting, and how forecasting models can be trained once on a diverse time series dataset and used later on to obtain forecasts on different datasets without training. We will use the open-source Darts library to do all this with in a few lines of code. A self-contained notebook containing everything needed to reproduce the results is available here. Time series forecasting has numerous applications in supply chain, energy, agriculture, control, IT operations, finance and other domains. For a long time, the best-performing approaches were relatively sophisticated statistical methods such as Exponential Smoothing or ARIMA. However, since recently, machine learning and deep learning have started to outperform these classical approaches on a number of forecasting tasks and competitions.

dataset, forecast, time sery, (11 more...)

#artificialintelligence

Country: North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.27)

Industry:

Transportation > Passenger (0.32)
Transportation > Air (0.30)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.52)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.51)

Add feedback

Star-Graph Multimodal Matching Component Analysis for Data Fusion and Transfer Learning

Lorenzo, Nick

arXiv.org Artificial IntelligenceOct-5-2022

The matching component analysis (MCA) technique for transfer learning [1] finds two maps - one from each of two data domains to a lower-dimensional, common domain - using only a small number of matched data pairs, where each matched data pair is comprised of one point from each data domain. These maps minimize the expected distance between mapped data pairs within the common domain, subject to an identity matrix covariance constraint and an affine linear structure. Learning techniques can then be applied to matched data points after they are mapped to the common domain, where each such point is encoded with information from both data domains via its respective optimal affine linear transformation. In [2], the covariance-generalized MCA (CGMCA) technique was developed in order to allow for the encoding of additional statistical information into the MCA maps. This was done by generalizing the identity matrix covariance constraint of MCA to accommodate any covariance matrix (compare Figures 1a and 1b). We are interested in extending the application space of CGMCA to accommodate three or more data domains simultaneously.

artificial intelligence, machine learning, modality, (18 more...)

arXiv.org Artificial Intelligence

2210.0259

Country:

North America > United States > Ohio > Montgomery County > Dayton (0.04)
North America > United States > New York > Rensselaer County > Troy (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.61)
Information Technology > Artificial Intelligence > Representation & Reasoning > Information Fusion (0.41)

Add feedback

Neural Style Transfer -- A practice in transfer learning

#artificialintelligenceOct-3-2022, 11:40:21 GMT

The picture above is photoed by me in London. There is always an idea that pops up in my mind when I looked at it -- what if I make this picture into an oil painting? It must be a masterpiece! Thanks to Gatys et al. their article helped me dive into the beauty of Deep learning, and this whole article is based on their paper. Before we begin, let's talk something interesting: What are deep convnets really learning?

cost function, neural style transfer, output layer, (9 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.42)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.40)

Add feedback

Feature-based Transfer Learning vs Fine Tuning?

#artificialintelligenceOct-3-2022, 04:35:16 GMT

You can follow me on Linkedin! Note: There are different angles to answer an interview question. The author of this newsletter does not try to find a reference that answers a question exhaustively. Rather, the author would like to share some quick insights and help the readers to think, practice and do further research as necessary.

deeplearning, newsletter, transfer learning vs fine tuning

#artificialintelligence

Technology:

Information Technology > Communications > Social Media (0.74)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.71)

Add feedback

Transfer Learning with Pre-trained Conditional Generative Models

Yamaguchi, Shin'ya, Kanai, Sekitoshi, Kumagai, Atsutoshi, Chijiwa, Daiki, Kashima, Hisashi

arXiv.org Artificial IntelligenceSep-29-2022

Transfer learning is crucial in training deep neural networks on new target tasks. Current transfer learning methods always assume at least one of (i) source and target task label spaces overlap, (ii) source datasets are available, and (iii) target network architectures are consistent with source ones. However, holding these assumptions is difficult in practical settings because the target task rarely has the same labels as the source task, the source dataset access is restricted due to storage costs and privacy, and the target architecture is often specialized to each task. To transfer source knowledge without these assumptions, we propose a transfer learning method that uses deep generative models and is composed of the following two stages: pseudo pre-training (PP) and pseudo semi-supervised learning (P-SSL). PP trains a target architecture with an artificial dataset synthesized by using conditional source generative models. P-SSL applies SSL algorithms to labeled target data and unlabeled pseudo samples, which are generated by cascading the source classifier and generative models to condition them with target samples. Our experimental results indicate that our method can outperform the baselines of scratch training and knowledge distillation. For training deep neural networks on new tasks, transfer learning is essential, which leverages the knowledge of related (source) tasks to the new (target) tasks via the joint-or pre-training of source models. There are many transfer learning methods for deep models under various conditions (Pan & Yang, 2010; Wang & Deng, 2018). For instance, domain adaptation leverages source knowledge to the target task by minimizing the domain gaps (Ganin et al., 2016), and fine-tuning uses the pre-trained weights on source tasks as the initial weights of the target models (Yosinski et al., 2014).

artificial intelligence, deep learning, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2204.12833

Country:

Europe > United Kingdom > England > Staffordshire (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
Oceania > New Zealand > South Island > Marlborough District > Blenheim (0.04)
(8 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)
Automobiles & Trucks (1.00)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.34)

Add feedback

An Efficient Multitask Learning Architecture for Affective Vocal Burst Analysis

Hallmen, Tobias, Mertes, Silvan, Schiller, Dominik, André, Elisabeth

arXiv.org Artificial IntelligenceSep-28-2022

Affective speech analysis is an ongoing topic of research. A relatively new problem in this field is the analysis of vocal bursts, which are nonverbal vocalisations such as laughs or sighs. Current state-of-the-art approaches to address affective vocal burst analysis are mostly based on wav2vec2 or HuBERT features. In this paper, we investigate the use of the wav2vec successor data2vec in combination with a multitask learning pipeline to tackle different analysis problems at once. To assess the performance of our efficient multitask learning architecture, we participate in the 2022 ACII Affective Vocal Burst Challenge, showing that our approach substantially outperforms the baseline established there in three different subtasks.

artificial intelligence, machine learning, vocal burst, (14 more...)

arXiv.org Artificial Intelligence

2209.13914

Country:

Europe > Germany (0.05)
Europe > Portugal (0.04)
Asia > India > Telangana > Hyderabad (0.04)
Africa > Guinea-Bissau (0.04)

Genre:

Research Report > New Finding (0.46)
Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.81)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.68)
Information Technology > Artificial Intelligence > Systems & Languages > Problem-Independent Architectures (0.61)

Add feedback

SHiFT: An Efficient, Flexible Search Engine for Transfer Learning

Renggli, Cedric, Yao, Xiaozhe, Kolar, Luka, Rimanic, Luka, Klimovic, Ana, Zhang, Ce

arXiv.org Artificial IntelligenceSep-28-2022

Transfer learning can be seen as a data- and compute-efficient alternative to training models from scratch. The emergence of rich model repositories, such as TensorFlow Hub, enables practitioners and researchers to unleash the potential of these models across a wide range of downstream tasks. As these repositories keep growing exponentially, efficiently selecting a good model for the task at hand becomes paramount. By carefully comparing various selection and search strategies, we realize that no single method outperforms the others, and hybrid or mixed strategies can be beneficial. Therefore, we propose SHiFT, the first downstream task-aware, flexible, and efficient model search engine for transfer learning. These properties are enabled by a custom query language SHiFT-QL together with a cost-based decision maker, which we empirically validate. Motivated by the iterative nature of machine learning development, we further support efficient incremental executions of our queries, which requires a careful implementation when jointly used with our optimizations.

information retrieval, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2204.01457

Country:

Europe > Switzerland > Zürich > Zürich (0.04)
Europe > Romania > Sud - Muntenia Development Region > Giurgiu County > Giurgiu (0.04)

Genre: Research Report > New Finding (0.45)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.83)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback