AITopics | Transfer Learning

Collaborating Authors

Transfer Learning

Transfer Learning is the reuse of a pre-trained model on a new problem. (Towards Data Science)

News Overviews Instructional Materials AI-Alerts Classics

P2W: From Power Traces to Weights Matrix -- An Unconventional Transfer Learning Approach

Siyadatzadeh, Roozbeh, Mehrafrooz, Fatemeh, Mentens, Nele, Stefanov, Todor

arXiv.org Artificial IntelligenceFeb-20-2025

The rapid growth of deploying machine learning (ML) models within embedded systems on a chip (SoCs) has led to transformative shifts in fields like healthcare and autonomous vehicles. One of the primary challenges for training such embedded ML models is the lack of publicly available high-quality training data. Transfer learning approaches address this challenge by utilizing the knowledge encapsulated in an existing ML model as a starting point for training a new ML model. However, existing transfer learning approaches require direct access to the existing model which is not always feasible, especially for ML models deployed on embedded SoCs. Therefore, in this paper, we introduce a novel unconventional transfer learning approach to train a new ML model by extracting and using weights from an existing ML model running on an embedded SoC without having access to the model within the SoC. Our approach captures power consumption measurements from the SoC while it is executing the ML model and translates them to an approximated weights matrix used to initialize the new ML model. This improves the learning efficiency and predictive performance of the new model, especially in scenarios with limited data available to train the model. Our novel approach can effectively increase the accuracy of the new ML model up to 3 times compared to classical training methods using the same amount of limited training data.

accuracy, ml model, new ml model, (15 more...)

arXiv.org Artificial Intelligence

2502.14968

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York > New York County > New York City (0.04)
Europe > Netherlands > South Holland > Leiden (0.04)
(2 more...)

Genre:

Overview > Innovation (0.48)
Research Report > New Finding (0.46)

Industry:

Information Technology > Security & Privacy (0.93)
Health & Medicine > Therapeutic Area (0.69)
Health & Medicine > Diagnostic Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)

Add feedback

Distribution Matching for Self-Supervised Transfer Learning

Jiao, Yuling, Ma, Wensen, Sun, Defeng, Wang, Hansheng, Wang, Yang

arXiv.org Machine LearningFeb-20-2025

In this paper, we propose a novel self-supervised transfer learning method called Distribution Matching (DM), which drives the representation distribution toward a predefined reference distribution while preserving augmentation invariance. The design of DM results in a learned representation space that is intuitively structured and offers easily interpretable hyperparameters. Experimental results across multiple real-world datasets and evaluation metrics demonstrate that DM performs competitively on target classification tasks compared to existing self-supervised transfer learning methods. Additionally, we provide robust theoretical guarantees for DM, including a population theorem and an end-to-end sample theorem. The population theorem bridges the gap between the self-supervised learning task and target classification accuracy, while the sample theorem shows that, even with a limited number of samples from the target domain, DM can deliver exceptional classification performance, provided the unlabeled sample size is sufficiently large.

augmentation, learning, representation space, (14 more...)

arXiv.org Machine Learning

2502.14424

Country:

Asia > China > Hubei Province > Wuhan (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > China > Hong Kong (0.04)
(9 more...)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.81)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.34)

Add feedback

Enhancing Machine Learning Potentials through Transfer Learning across Chemical Elements

Röcken, Sebastien, Zavadlav, Julija

arXiv.org Artificial IntelligenceFeb-19-2025

Machine Learning Potentials (MLPs) can enable simulations of ab initio accuracy at orders of magnitude lower computational cost. However, their effectiveness hinges on the availability of considerable datasets to ensure robust generalization across chemical space and thermodynamic conditions. The generation of such datasets can be labor-intensive, highlighting the need for innovative methods to train MLPs in data-scarce scenarios. Here, we introduce transfer learning of potential energy surfaces between chemically similar elements. Specifically, we leverage the trained MLP for silicon to initialize and expedite the training of an MLP for germanium. Utilizing classical force field and ab initio datasets, we demonstrate that transfer learning surpasses traditional training from scratch in force prediction, leading to more stable simulations and improved temperature transferability. These advantages become even more pronounced as the training dataset size decreases. The out-of-target property analysis shows that transfer learning leads to beneficial but sometimes adversarial effects. Our findings demonstrate that transfer learning across chemical elements is a promising technique for developing accurate and numerically stable MLPs, particularly in a data-scarce regime.

chemical element, dataset, mlp, (13 more...)

arXiv.org Artificial Intelligence

2502.13522

Country:

Europe > Germany > Bavaria > Upper Bavaria > Munich (0.05)
Africa > Togo (0.04)
Asia > Japan (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Materials > Chemicals (0.74)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (1.00)

Add feedback

Universal Embedding Function for Traffic Classification via QUIC Domain Recognition Pretraining: A Transfer Learning Success

Luxemburk, Jan, Hynek, Karel, Plný, Richard, Čejka, Tomáš

arXiv.org Artificial IntelligenceFeb-18-2025

Encrypted traffic classification (TC) methods must adapt to new protocols and extensions as well as to advancements in other machine learning fields. In this paper, we follow a transfer learning setup best known from computer vision. We first pretrain an embedding model on a complex task with a large number of classes and then transfer it to five well-known TC datasets. The pretraining task is recognition of SNI domains in encrypted QUIC traffic, which in itself is a problem for network monitoring due to the growing adoption of TLS Encrypted Client Hello. Our training pipeline -- featuring a disjoint class setup, ArcFace loss function, and a modern deep learning architecture -- aims to produce universal embeddings applicable across tasks. The proposed solution, based on nearest neighbors search in the embedding space, surpasses SOTA performance on four of the five TC datasets. A comparison with a baseline method utilizing raw packet sequences revealed unexpected findings with potential implications for the broader TC field. We published the model architecture, trained weights, and transfer learning experiments.

artificial intelligence, deep learning, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2502.1293

Country:

Europe (0.46)
North America > United States (0.14)

Genre: Research Report > New Finding (1.00)

Industry:

Telecommunications > Networks (0.67)
Information Technology > Networks (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.81)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Transfer Learning of CATE with Kernel Ridge Regression

Kim, Seok-Jin, Liu, Hongjie, Liu, Molei, Wang, Kaizheng

arXiv.org Machine LearningFeb-16-2025

The proliferation of data has sparked significant interest in leveraging findings from one study to estimate treatment effects in a different target population without direct outcome observations. However, the transfer learning process is frequently hindered by substantial covariate shift and limited overlap between (i) the source and target populations, as well as (ii) the treatment and control groups within the source. We propose a novel method for overlap-adaptive transfer learning of conditional average treatment effect (CATE) using kernel ridge regression (KRR). Our approach involves partitioning the labeled source data into two subsets. The first one is used to train candidate CATE models based on regression adjustment and pseudo-outcomes. An optimal model is then selected using the second subset and unlabeled target data, employing another pseudo-outcome-based strategy. We provide a theoretical justification for our method through sharp non-asymptotic MSE bounds, highlighting its adaptivity to both weak overlaps and the complexity of CATE function. Extensive numerical studies confirm that our method achieves superior finite-sample efficiency and adaptability. We conclude by demonstrating the effectiveness of our approach using a 401(k) eligibility dataset.

artificial intelligence, inequality, machine learning, (16 more...)

arXiv.org Machine Learning

2502.11331

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.84)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.81)

Add feedback

SPIRIT: Short-term Prediction of solar IRradIance for zero-shot Transfer learning using Foundation Models

Mishra, Aditya, T, Ravindra, Iyengar, Srinivasan, Kalyanaraman, Shivkumar, Kumaraguru, Ponnurangam

arXiv.org Artificial IntelligenceFeb-14-2025

Traditional solar forecasting models are based on several years of site-specific historical irradiance data, often spanning five or more years, which are unavailable for newer photovoltaic farms. As renewable energy is highly intermittent, building accurate solar irradiance forecasting systems is essential for efficient grid management and enabling the ongoing proliferation of solar energy, which is crucial to achieve the United Nations' net zero goals. In this work, we propose SPIRIT, a novel approach leveraging foundation models for solar irradiance forecasting, making it applicable to newer solar installations. Our approach outperforms state-of-the-art models in zero-shot transfer learning by about 70%, enabling effective performance at new locations without relying on any historical data. Further improvements in performance are achieved through fine-tuning, as more location-specific data becomes available. These findings are supported by statistical significance, further validating our approach. SPIRIT represents a pivotal step towards rapid, scalable, and adaptable solar forecasting solutions, advancing the integration of renewable energy into global power systems.

large language model, machine learning, skipp, (18 more...)

arXiv.org Artificial Intelligence

2502.10307

Country:

Europe (0.67)
North America > United States > California (0.28)

Genre:

Research Report > Promising Solution (0.68)
Research Report > Experimental Study (0.48)
Research Report > New Finding (0.46)

Industry: Energy > Renewable > Solar (1.00)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
(2 more...)

Add feedback

Revisiting Euclidean Alignment for Transfer Learning in EEG-Based Brain-Computer Interfaces

Wu, Dongrui

arXiv.org Artificial IntelligenceFeb-13-2025

Due to the non-stationarity and large individual differences of EEG signals, EEG-based brain-computer interfaces (BCIs) usually need subject-specific calibration to tailor the decoding algorithm for each new subject, which is time-consuming and user-unfriendly, hindering their real-world applications. Transfer learning (TL) has been extensively used to expedite the calibration, by making use of EEG data from other subjects/sessions. An important consideration in TL for EEG-based BCIs is to reduce the data distribution discrepancies among different subjects/session, to avoid negative transfer. Euclidean alignment (EA) was proposed in 2020 to address this challenge. Numerous experiments from 10 different BCI paradigms demonstrated its effectiveness and efficiency. This paper revisits the EA, explaining its procedure and correct usage, introducing its applications and extensions, and pointing out potential new research directions. It should be very helpful to BCI researchers, especially those who are working on EEG signal decoding.

artificial intelligence, brain-computer interface, machine learning, (11 more...)

arXiv.org Artificial Intelligence

2502.09203

Country:

Europe (1.00)
Asia > China (0.47)

Genre: Research Report (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Health Care Technology (0.69)
Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science > Neuroscience (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.65)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Knowledge-Guided Wasserstein Distributionally Robust Optimization

Wang, Zitao, Wang, Ziyuan, Liu, Molei, Si, Nian

arXiv.org Machine LearningFeb-12-2025

Transfer learning is a popular strategy to leverage external knowledge and improve statistical efficiency, particularly with a limited target sample. We propose a novel knowledge-guided Wasserstein Distributionally Robust Optimization (KG-WDRO) framework that adaptively incorporates multiple sources of external knowledge to overcome the conservativeness of vanilla WDRO, which often results in overly pessimistic shrinkage toward zero. Our method constructs smaller Wasserstein ambiguity sets by controlling the transportation along directions informed by the source knowledge. This strategy can alleviate perturbations on the predictive projection of the covariates and protect against information loss. Theoretically, we establish the equivalence between our WDRO formulation and the knowledge-guided shrinkage estimation based on collinear similarity, ensuring tractability and geometrizing the feasible set. This also reveals a novel and general interpretation for recent shrinkage-based transfer learning approaches from the perspective of distributional robustness. In addition, our framework can adjust for scaling differences in the regression models between the source and target and accommodates general types of regularization such as lasso and ridge. Extensive simulations demonstrate the superior performance and adaptivity of KG-WDRO in enhancing small-sample transfer learning.

artificial intelligence, convex conjugate, machine learning, (15 more...)

arXiv.org Machine Learning

2502.08146

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > New Jersey > Mercer County > Princeton (0.04)
Asia > China > Hong Kong (0.04)

Genre:

Overview (0.67)
Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.75)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.49)

Add feedback

Reviews: Transfer Learning via Minimizing the Performance Gap Between Domains

Neural Information Processing SystemsFeb-11-2025, 23:16:46 GMT

After rebuttal I thank the authors for their reply, they have managed to clarify some of my concerns and overall I vote for acceptance of the paper. This paper introduces a boosting method for transfer learning with instance re-weighting in the setting where labeled data are available in both training and target tasks. Theorem 1 provides a bound for the population error on the target task, and motivates four instance re-weighting principles''. A practical procedure is introduced, which achieves competitive results on two standard datasets for transfer learning. Novelty: To my knowledge, the theoretical analysis carried out by the authors in the context of fully labeled data is novel.

algorithm, performance gap, transfer learning, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.84)

Add feedback

Reviews: Transfer Learning via Minimizing the Performance Gap Between Domains

Neural Information Processing SystemsFeb-11-2025, 23:16:35 GMT

The paper presents a novel theoretical analysis in the context of fully labeled data that is novel and sound. The methodological and algorithmic contributions based on a boosting strategy for a reweighed scheme is novel and shows good results. The experimental study can be improved with more baselines and datasets.

minimizing, performance gap, transfer learning

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.40)

Add feedback