AITopics

2508.17567

Country:

Europe > United Kingdom > England (0.28)
Europe > Austria (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

arXiv.org Artificial IntelligenceAug-22-2025

ITL-LIME: Instance-Based Transfer Learning for Enhancing Local Explanations in Low-Resource Data Settings

Raza, Rehan, Wang, Guanjin, Wong, Kok Wai, Laga, Hamid, Fisichella, Marco

Explainable Artificial Intelligence (XAI) methods, such as Local Interpretable Model-Agnostic Explanations (LIME), have advanced the interpretability of black-box machine learning models by approximating their behavior locally using interpretable surrogate models. However, LIME's inherent randomness in perturbation and sampling can lead to locality and instability issues, especially in scenarios with limited training data. In such cases, data scarcity can result in the generation of unrealistic variations and samples that deviate from the true data manifold. Consequently, the surrogate model may fail to accurately approximate the complex decision boundary of the original model. To address these challenges, we propose a novel Instance-based Transfer Learning LIME framework (ITL-LIME) that enhances explanation fidelity and stability in data-constrained environments. ITL-LIME introduces instance transfer learning into the LIME framework by leveraging relevant real instances from a related source domain to aid the explanation process in the target domain. Specifically, we employ clustering to partition the source domain into clusters with representative prototypes. Instead of generating random perturbations, our method retrieves pertinent real source instances from the source cluster whose prototype is most similar to the target instance. These are then combined with the target instance's neighboring real instances. To define a compact locality, we further construct a contrastive learning-based encoder as a weighting mechanism to assign weights to the instances from the combined set based on their proximity to the target instance. Finally, these weighted source and target instances are used to train the surrogate model for explanation purposes.

artificial intelligence, explanation, machine learning, (15 more...)

2508.13672

Country:

North America > United States (0.67)
Europe (0.46)
Oceania > Australia > Western Australia (0.14)

Genre: Research Report > New Finding (0.68)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.82)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.46)

Neural Information Processing SystemsAug-20-2025, 01:49:30 GMT

Catastrophic Forgetting Meets Negative Transfer: Batch Spectral Shrinkage for Safe Transfer Learning

Xinyang Chen, Sinan Wang, Bo Fu, Mingsheng Long, Jianmin Wang

Neural Information Processing Systems http://nips.cc/

negative transfer, singular value, weight parameter, (14 more...)

Country:

Asia > China (0.05)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > Colorado > El Paso County > Colorado Springs (0.04)
(3 more...)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Neural Information Processing SystemsAug-20-2025, 01:48:55 GMT

Transfer Learning via Minimizing the Performance Gap Between Domains

Boyu Wang, Jorge Mendez, Mingbo Cai, Eric Eaton

Neural Information Processing Systems http://nips.cc/

algorithm, artificial intelligence, machine learning, (14 more...)

Country: North America > Canada (0.28)

Industry: Education (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.58)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Steve Hanneke, Samory Kpotufe

On the Value of Target Data in Transfer Learning

Neural Information Processing SystemsAug-20-2025, 00:17:38 GMT

Neural Information Processing Systems http://nips.cc/

classifier, discrepancy, unlabeled data, (15 more...)

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.41)

Neural Information Processing SystemsAug-19-2025, 05:46:42 GMT

Hub-Pathway: Transfer Learning from A Hub of Pre-trained Models

Prior transfer learning work mainly transfers from a single model.

artificial intelligence, machine learning, pre-trained model, (16 more...)

Country:

Asia > Middle East > Jordan (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.75)

Bonilla, Jose L., Graczyk, Krzysztof M., Ankowski, Artur M., Banerjee, Rwik Dharmapal, Kowal, Beata E., Prasad, Hemant, Sobczyk, Jan T.

Transfer Learning for Neutrino Scattering: Domain Adaptation with GANs

arXiv.org Artificial IntelligenceAug-19-2025

Significant experimental efforts have been devoted to studying (anti)neutrino-nucleus interactions [1, 2] in the energy range relevant for next-generation neutrino oscillation experiments, such as Hyper-Kamiokande [3] and DUNE [4]. In parallel, theoretical models describing these interactions have been developed [2]. The outcomes of both experimental and theoretical advances are incorporated into Monte Carlo (MC) event generators, which simulate (anti)neutrino-nucleus collisions under realistic conditions [5-10]. MC generators are often tuned to reproduce experimental observations, relying on adjustable parameters that are fitted using available data [11]. However, this tuning process cannot fully compensate for the fundamental limitations of the underlying models, especially those relying on complex approximations, such as nuclear modeling. Consequently, there is a growing interest in alternative approaches to traditional MC event generation--methods that can learn directly from experimental data and dynamically refine their predictions.

artificial intelligence, arxiv, machine learning, (16 more...)

2508.12987

Country: North America > United States (0.14)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.44)

arXiv.org Artificial IntelligenceAug-19-2025

SEDEG:Sequential Enhancement of Decoder and Encoder's Generality for Class Incremental Learning with Small Memory

Chen, Hongyang, Pu, Shaoling, Zheng, Lingyu, Sun, Zhongwu

In incremental learning, enhancing the generality of knowledge is crucial for adapting to dynamic data inputs. It can develop generalized representations or more balanced decision boundaries, preventing the degradation of long-term knowledge over time and thus mitigating catastrophic forgetting. Some emerging incremental learning methods adopt an encoder-decoder architecture and have achieved promising results. In the encoder-decoder achitecture, improving the generalization capabilities of both the encoder and decoder is critical, as it helps preserve previously learned knowledge while ensuring adaptability and robustness to new, diverse data inputs. However, many existing continual methods focus solely on enhancing one of the two components, which limits their effectiveness in mitigating catastrophic forgetting. And these methods perform even worse in small-memory scenarios, where only a limited number of historical samples can be stored. To mitigate this limitation, we introduces SEDEG, a two-stage training framework for vision transformers (ViT), focusing on sequentially improving the generality of both Decoder and Encoder. Initially, SEDEG trains an ensembled encoder through feature boosting to learn generalized representations, which subsequently enhance the decoder's generality and balance the classifier. The next stage involves using knowledge distillation (KD) strategies to compress the ensembled encoder and develop a new, more generalized encoder. This involves using a balanced KD approach and feature KD for effective knowledge transfer. Extensive experiments on three benchmark datasets show SEDEG's superior performance, and ablation studies confirm the efficacy of its components. The code is available at https://github.com/ShaolingPu/CIL.

artificial intelligence, encoder, machine learning, (14 more...)

2508.12932

Country: Asia > China (0.15)

Genre: Research Report (0.64)

Industry: Education > Educational Setting (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.34)

Jayasundara, Ridma, Fernando, Ishan, Fernando, Adeepa, Ragel, Roshan, Thambawita, Vajira, Nawinne, Isuru

Inductive transfer learning from regression to classification in ECG analysis

arXiv.org Artificial IntelligenceAug-19-2025

Cardiovascular diseases (CVDs) are the leading cause of mortality worldwide, accounting for over 30% of global deaths according to the World Health Organization (WHO). Importantly, one-third of these deaths are preventable with timely and accurate diagnosis. The electrocardiogram (ECG), a non-invasive method for recording the electrical activity of the heart, is crucial for diagnosing CVDs. However, privacy concerns surrounding the use of patient ECG data in research have spurred interest in synthetic data, which preserves the statistical properties of real data without compromising patient confidentiality. This study explores the potential of synthetic ECG data for training deep learning models from regression to classification tasks and evaluates the feasibility of transfer learning to enhance classification performance on real ECG data. We experimented with popular deep learning models to predict four key cardiac parameters, namely, Heart Rate (HR), PR interval, QT interval, and QRS complex-using separate regression models. Subsequently, we leveraged these regression models for transfer learning to perform 5-class ECG signal classification. Our experiments systematically investigate whether transfer learning from regression to classification is viable, enabling better utilization of diverse open-access and synthetic ECG datasets. Our findings demonstrate that transfer learning from regression to classification improves classification performance, highlighting its potential to maximize the utility of available data and advance deep learning applications in this domain.

artificial intelligence, deep learning, machine learning, (19 more...)

2508.11656

Country: Europe (0.67)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Diagnostic Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Neural Information Processing SystemsAug-17-2025, 17:46:12 GMT

Zero-shot Transfer Learning within a Heterogeneous Graph via Knowledge Transfer Networks

State-of-the-art graph learning methods for HGs known as heterogeneous graph neural networks (HGNNs) are applied to learn deep context-informed node representations.

large language model, machine learning, natural language, (16 more...)

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Asia > Middle East > Jordan (0.04)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.43)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.42)