AITopics | Transfer Learning

Collaborating Authors

Transfer Learning

Transfer Learning is the reuse of a pre-trained model on a new problem. (Towards Data Science)

News Overviews Instructional Materials AI-Alerts Classics

Grounding Foundation Models through Federated Transfer Learning: A General Framework

Kang, Yan, Fan, Tao, Gu, Hanlin, Zhang, Xiaojin, Fan, Lixin, Yang, Qiang

arXiv.org Artificial IntelligenceFeb-6-2024

Foundation Models (FMs) such as GPT-4 encoded with vast knowledge and powerful emergent abilities have achieved remarkable success in various natural language processing and computer vision tasks. Grounding FMs by adapting them to domain-specific tasks or augmenting them with domain-specific knowledge enables us to exploit the full potential of FMs. However, grounding FMs faces several challenges, stemming primarily from constrained computing resources, data privacy, model heterogeneity, and model ownership. Federated Transfer Learning (FTL), the combination of federated learning and transfer learning, provides promising solutions to address these challenges. In recent years, the need for grounding FMs leveraging FTL, coined FTL-FM, has arisen strongly in both academia and industry. Motivated by the strong growth in FTL-FM research and the potential impact of FTL-FM on industrial applications, we propose an FTL-FM framework that formulates problems of grounding FMs in the federated learning setting, construct a detailed taxonomy based on the FTL-FM framework to categorize state-of-the-art FTL-FM works, and comprehensively overview FTL-FM works based on the proposed taxonomy. We also establish correspondences between FTL-FM and conventional phases of adapting FM so that FM practitioners can align their research works with FTL-FM. In addition, we overview advanced efficiency-improving and privacy-preserving techniques because efficiency and privacy are critical concerns in FTL-FM. Last, we discuss opportunities and future research directions of FTL-FM.

arxiv preprint arxiv, knowledge, server, (12 more...)

arXiv.org Artificial Intelligence

2311.17431

Country:

Asia > China > Hong Kong (0.04)
North America > Canada > Ontario > Toronto (0.04)
Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
(6 more...)

Genre:

Overview (1.00)
Research Report > Promising Solution (0.87)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Transfer Learning for the Prediction of Entity Modifiers in Clinical Text: Application to Opioid Use Disorder Case Detection

Almudaifer, Abdullateef I., Covington, Whitney, Hairston, JaMor, Deitch, Zachary, Anand, Ankit, Carroll, Caleb M., Crisan, Estera, Bradford, William, Walter, Lauren, Ellen, Eaton, Feldman, Sue S., Osborne, John D.

arXiv.org Artificial IntelligenceFeb-5-2024

Background: The semantics of entities extracted from a clinical text can be dramatically altered by modifiers, including entity negation, uncertainty, conditionality, severity, and subject. Existing models for determining modifiers of clinical entities involve regular expression or features weights that are trained independently for each modifier. Methods: We develop and evaluate a multi-task transformer architecture design where modifiers are learned and predicted jointly using the publicly available SemEval 2015 Task 14 corpus and a new Opioid Use Disorder (OUD) data set that contains modifiers shared with SemEval as well as novel modifiers specific for OUD. We evaluate the effectiveness of our multi-task learning approach versus previously published systems and assess the feasibility of transfer learning for clinical entity modifiers when only a portion of clinical modifiers are shared. Results: Our approach achieved state-of-the-art results on the ShARe corpus from SemEval 2015 Task 14, showing an increase of 1.1% on weighted accuracy, 1.7% on unweighted accuracy, and 10% on micro F1 scores. Conclusions: We show that learned weights from our shared model can be effectively transferred to a new partially matched data set, validating the use of transfer learning for clinical text modifiers

corpus, modifier, share corpus, (17 more...)

arXiv.org Artificial Intelligence

2401.15222

Country:

North America > United States > Alabama (0.05)
North America > United States > Colorado > Denver County > Denver (0.04)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Addiction Disorder (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.92)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.88)

Add feedback

Survival and grade of the glioma prediction using transfer learning

Rubio, Santiago Valbuena, García-Ordás, María Teresa, Olivera, Oscar García-Olalla, Alaiz-Moretón, Héctor, González-Alonso, Maria-Inmaculada, Benítez-Andrades, José Alberto

arXiv.org Artificial IntelligenceFeb-4-2024

Glioblastoma is a highly malignant brain tumor with a life expectancy of only 3 to 6 months without treatment. Detecting and predicting its survival and grade accurately are crucial. This study introduces a novel approach using transfer learning techniques. Various pre-trained networks, including EfficientNet, ResNet, VGG16, and Inception, were tested through exhaustive optimization to identify the most suitable architecture. Transfer learning was applied to fine-tune these models on a glioblastoma image dataset, aiming to achieve two objectives: survival and tumor grade prediction.The experimental results show 65% accuracy in survival prediction, classifying patients into short, medium, or long survival categories. Additionally, the prediction of tumor grade achieved an accuracy of 97%, accurately differentiating low-grade gliomas (LGG) and high-grade gliomas (HGG). The success of the approach is attributed to the effectiveness of transfer learning, surpassing the current state-of-the-art methods. In conclusion, this study presents a promising method for predicting the survival and grade of glioblastoma. Transfer learning demonstrates its potential in enhancing prediction models, particularly in scenarios with limited large datasets. These findings hold promise for improving diagnostic and treatment approaches for glioblastoma patients.

classification, doi 10, valbuena rubio, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.7717/peerj-cs.1723

2402.03384

Country:

Europe > Spain > Castile and León > León Province > León (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Asia > Middle East > Jordan (0.04)
North America > United States (0.04)

Genre:

Research Report > Promising Solution (1.00)
Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area > Oncology > Brain Cancer (1.00)
Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Online Transfer Learning for RSV Case Detection

Sun, Yiming, Gao, Yuhe, Bao, Runxue, Cooper, Gregory F., Espino, Jessi, Hochheiser, Harry, Michaels, Marian G., Aronis, John M., Ye, Ye

arXiv.org Artificial IntelligenceFeb-2-2024

In such cases, transferring knowledge from the source domain becomes crucial, particularly because the Machine learning has made substantial advancements in limited initial data in the target domain may be insufficient recent decades, with its applications spanning a wide range of for effective learning. The extensive and diverse information fields such as image and speech recognition, natural language available from the source domains can significantly compensate processing, and autonomous driving. Despite these achievements, for this shortfall, providing a foundational knowledge base machine learning in biomedicine faces significant challenges, that the model can build upon as more target domain data particularly in data collection. The acquisition of labeled becomes available. Therefore, the efficiency and effectiveness data can be very costly or even unfeasible due to factors of learning in the target domain are greatly enhanced by the like ethical considerations, patient privacy, and the scarcity transferred knowledge from the source domains. of certain diseases. These challenges have led researchers to Online transfer learning entails leveraging knowledge from increasingly rely on utilizing data from related domains that a static source domain and applying it to an ongoing, evolving have a more abundant supply of data.

classifier, ensemble model, target domain, (17 more...)

arXiv.org Artificial Intelligence

2402.01987

Country:

North America > United States (0.29)
Asia > Singapore (0.04)
Europe > Greece (0.04)
Asia > Middle East > Israel > Haifa District > Haifa (0.04)

Genre:

Research Report > New Finding (0.95)
Instructional Material > Online (0.72)
Research Report > Experimental Study (0.69)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.66)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Exploring transfer learning for pathological speech feature prediction: Impact of layer selection

Wiepert, Daniela A., Utianski, Rene L., Duffy, Joseph R., Stricker, John L., Barnard, Leland R., Jones, David T., Botha, Hugo

arXiv.org Artificial IntelligenceFeb-2-2024

One approach to There is interest in leveraging AI to conduct automatic, objective address this is to focus on the pathological speech features that assessments of clinical speech, in turn facilitating diagnosis characterize speech disorders and create a model that predicts and treatment of speech disorders. We explore transfer them instead of predicting disease [9]. The key insight behind learning, focusing on the impact of layer selection, for this approach is that there are predictable mappings between the downstream task of predicting the presence of pathological groupings of features and disorder, and not all features speech. We find that selecting an optimal layer offers large are unique with respect to disease [10, 8]. Because of this, performance improvements ( 12.4% average increase in balanced a model could learn the information necessary to recognize a accuracy), though the best layer varies by predicted feature specific type of dysarthria using recordings from a cohort with and does not always generalize well to unseen data. A varying neurological diseases which can cause that dysarthria.

best layer, classifier, representation, (14 more...)

arXiv.org Artificial Intelligence

2402.01796

Country: North America > United States (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.41)

Add feedback

DTL: Disentangled Transfer Learning for Visual Recognition

Fu, Minghao, Zhu, Ke, Wu, Jianxin

arXiv.org Artificial IntelligenceFeb-2-2024

When pre-trained models become rapidly larger, the cost of fine-tuning on downstream tasks steadily increases, too. To economically fine-tune these models, parameter-efficient transfer learning (PETL) is proposed, which only tunes a tiny subset of trainable parameters to efficiently learn quality representations. However, current PETL methods are facing the dilemma that during training the GPU memory footprint is not effectively reduced as trainable parameters. PETL will likely fail, too, if the full fine-tuning encounters the out-of-GPU-memory issue. This phenomenon happens because trainable parameters from these methods are generally entangled with the backbone, such that a lot of intermediate states have to be stored in GPU memory for gradient propagation. To alleviate this problem, we introduce Disentangled Transfer Learning (DTL), which disentangles the trainable parameters from the backbone using a lightweight Compact Side Network (CSN). By progressively extracting task-specific information with a few low-rank linear mappings and appropriately adding the information back to the backbone, CSN effectively realizes knowledge transfer in various downstream tasks. We conducted extensive experiments to validate the effectiveness of our method. The proposed method not only reduces a large amount of GPU memory usage and trainable parameters, but also outperforms existing PETL methods by a significant margin in accuracy, achieving new state-of-the-art on several standard benchmarks. The code is available at https://github.com/heekhero/DTL.

backbone, fine-tuning, trainable parameter, (12 more...)

arXiv.org Artificial Intelligence

2312.07856

Country:

Asia > China > Jiangsu Province > Nanjing (0.04)
Europe > Romania > Sud - Muntenia Development Region > Giurgiu County > Giurgiu (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.81)

Add feedback

Survey of Privacy Threats and Countermeasures in Federated Learning

Hayashitani, Masahiro, Mori, Junki, Teranishi, Isamu

arXiv.org Artificial IntelligenceFeb-1-2024

Federated learning is widely considered to be as a privacy-aware learning method because no training data is exchanged directly between clients. Nevertheless, there are threats to privacy in federated learning, and privacy countermeasures have been studied. However, we note that common and unique privacy threats among typical types of federated learning have not been categorized and described in a comprehensive and specific way. In this paper, we describe privacy threats and countermeasures for the typical types of federated learning; horizontal federated learning, vertical federated learning, and transfer federated learning.

federated learning, learning, privacy, (15 more...)

arXiv.org Artificial Intelligence

2402.00342

Country:

North America > United States > New York > New York County > New York City (0.05)
Asia > Japan (0.04)

Genre: Research Report (0.50)

Industry: Information Technology > Security & Privacy (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.36)

Add feedback

Enhancing Blood Flow Assessment in Diffuse Correlation Spectroscopy: A Transfer Learning Approach with Noise Robustness Analysis

Chen, Xi, Li, Xingda

arXiv.org Artificial IntelligenceFeb-1-2024

Diffuse correlation spectroscopy (DCS) is an emerging noninvasive technique that measures the tissue blood flow, by using near-infrared coherent point-source illumination to detect spectral changes. While machine learning has demonstrated significant potential for measuring blood flow index (BFi), an open question concerning the success of this approach pertains to its robustness in scenarios involving deviations between datasets with varying Signal-to-Noise Ratios (SNRs) originating from diverse clinical applications and various setups. This study proposes a transfer learning approach, aims to assess the influence of SNRs on the generalization ability of learned features, and demonstrate the robustness for transfer learning. A synthetic dataset with varying levels of added noise is utilized to simulate different SNRs. The proposed network takes a 1x64 autocorrelation curve as input and generates BFi and the correlation parameter beta. The proposed model demonstrates excellent performance across different SNRs, exhibiting enhanced fitting accuracy, particularly for low SNR datasets when compared with other fitting methods. This highlights its potential for clinical diagnosis and treatment across various scenarios under different clinical setups.

diffuse correlation spectroscopy, enhancing blood flow assessment, transfer learning approach, (1 more...)

arXiv.org Artificial Intelligence

2401.0558

Genre: Research Report (0.66)

Industry: Health & Medicine (0.80)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.80)

Add feedback

Graph Domain Adaptation: Challenges, Progress and Prospects

Shi, Boshen, Wang, Yongqing, Guo, Fangda, Xu, Bingbing, Shen, Huawei, Cheng, Xueqi

arXiv.org Artificial IntelligenceJan-31-2024

As graph representation learning often suffers from label scarcity problems in real-world applications, researchers have proposed graph domain adaptation (GDA) as an effective knowledge-transfer paradigm across graphs. In particular, to enhance model performance on target graphs with specific tasks, GDA introduces a bunch of task-related graphs as source graphs and adapts the knowledge learnt from source graphs to the target graphs. Since GDA combines the advantages of graph representation learning and domain adaptation, it has become a promising direction of transfer learning on graphs and has attracted an increasing amount of research interest in recent years. In this paper, we comprehensively overview the studies of GDA and present a detailed survey of recent advances. Specifically, we outline the research status and challenges, propose a taxonomy, introduce the details of representative works, and discuss the prospects. To the best of our knowledge, this paper is the first survey for graph domain adaptation. A detailed paper list is available at https://github.com/Skyorca/Awesome-Graph-Domain-Adaptation-Papers.

adaptation, graph, node, (16 more...)

arXiv.org Artificial Intelligence

2402.00904

Country: Asia > Myanmar > Tanintharyi Region > Dawei (0.04)

Genre: Overview (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Data Science > Data Mining (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.35)

Add feedback

Variational Transfer Learning using Cross-Domain Latent Modulation

Hou, Jinyong, Deng, Jeremiah D., Cranefield, Stephen, Din, Xuejie

arXiv.org Artificial IntelligenceJan-31-2024

To successfully apply trained neural network models to new domains, powerful transfer learning solutions are essential. We propose to introduce a novel cross-domain latent modulation mechanism to a variational autoencoder framework so as to achieve effective transfer learning. Our key idea is to procure deep representations from one data domain and use it to influence the reparameterization of the latent variable of another domain. Specifically, deep representations of the source and target domains are first extracted by a unified inference model and aligned by employing gradient reversal. The learned deep representations are then cross-modulated to the latent encoding of the alternative domain, where consistency constraints are also applied. In the empirical validation that includes a number of transfer learning benchmark tasks for unsupervised domain adaptation and image-to-image translation, our model demonstrates competitive performance, which is also supported by evidence obtained from visualization.

domain adaptation, latent space, representation, (14 more...)

arXiv.org Artificial Intelligence

2205.15523

Country:

Oceania > New Zealand (0.04)
Europe > Netherlands > South Holland > Leiden (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.82)

Industry: Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback