AITopics

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.60)

Jhajj, Gaganpreet, Lin, Fuhua

Elastic Weight Consolidation for Knowledge Graph Continual Learning: An Empirical Evaluation

arXiv.org Artificial IntelligenceDec-2-2025

Knowledge graphs (KGs) require continual updates as new information emerges, but neural embedding models suffer from catastrophic forgetting when learning new tasks sequentially. We evaluate Elastic Weight Consolidation (EWC), a regularization-based continual learning method, on KG link prediction using TransE embeddings on FB15k-237. Across multiple experiments with five random seeds, we find that EWC reduces catastrophic forgetting from 12.62% to 6.85%, a 45.7% reduction compared to naive sequential training. We observe that the task partitioning strategy affects the magnitude of forgetting: relation-based partitioning (grouping triples by relation type) exhibits 9.8 percentage points higher forgetting than randomly partitioned tasks (12.62% vs 2.81%), suggesting that task construction influences evaluation outcomes. While focused on a single embedding model and dataset, our results demonstrate that EWC effectively mitigates catastrophic forgetting in KG continual learning and highlight the importance of evaluation protocol design.

large language model, machine learning, natural language, (15 more...)

2512.0189

Country: North America > United States > New York > New York County > New York City (0.14)

Genre: Research Report > New Finding (0.86)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (0.64)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.47)

Harit, Anoushka, Prew, William, Sun, Zhongtian, Markowetz, Florian

EWC-Guided Diffusion Replay for Exemplar-Free Continual Learning in Medical Imaging

arXiv.org Artificial IntelligenceSep-30-2025

Medical imaging foundation models must adapt over time, yet full retraining is often blocked by privacy constraints and cost. We present a continual learning framework that avoids storing patient exemplars by pairing class conditional diffusion replay with Elastic Weight Consolidation. Using a compact Vision Transformer backbone, we evaluate across eight MedMNIST v2 tasks and CheXpert. On CheXpert our approach attains 0.851 AUROC, reduces forgetting by more than 30\% relative to DER\texttt{++}, and approaches joint training at 0.869 AUROC, while remaining efficient and privacy preserving. Analyses connect forgetting to two measurable factors: fidelity of replay and Fisher weighted parameter drift, highlighting the complementary roles of replay diffusion and synaptic stability. The results indicate a practical route for scalable, privacy aware continual adaptation of clinical imaging models.

continual learning, data mining, machine learning, (17 more...)

2509.23906

Country:

North America > United States (0.68)
Europe > United Kingdom > England (0.28)

Genre: Research Report (0.82)

Industry:

Health & Medicine > Diagnostic Medicine > Imaging (1.00)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

arXiv.org Artificial IntelligenceSep-17-2025

Unbiased Online Curvature Approximation for Regularized Graph Continual Learning

Yin, Jie, Sun, Ke, Wu, Han

Graph continual learning (GCL) aims to learn from a continuous sequence of graph-based tasks. Regularization methods are vital for preventing catastrophic forgetting in GCL, particularly in the challenging replay-free, class-incremental setting, where each task consists of a set of unique classes. In this work, we first establish a general regularization framework for GCL based on the curved parameter space induced by the Fisher information matrix (FIM). We show that the dominant Elastic Weight Consolidation (EWC) and its variants are a special case within this framework, using a diagonal approximation of the empirical FIM based on parameters from previous tasks. To overcome their limitations, we propose a new unbiased online curvature approximation of the full FIM based on the model's current learning state. Our method directly estimates the regularization term in an online manner without explicitly evaluating and storing the FIM itself. This enables the model to better capture the loss landscape during learning new tasks while retaining the knowledge learned from previous tasks. Extensive experiments on three graph datasets demonstrate that our method significantly outperforms existing regularization-based methods, achieving a superior trade-off between stability (retaining old knowledge) and plasticity (acquiring new knowledge).

approximation, artificial intelligence, machine learning, (15 more...)

2509.12727

Genre: Research Report (0.64)

Industry: Education > Educational Setting (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Neural Information Processing SystemsAug-16-2025, 00:24:39 GMT

Few-shot Image Generation with Elastic Weight Consolidation

Crucially, we regularize the changes of the weights during this adaptation, in order to best preserve the "information" of the source dataset, while fitting the target. We demonstrate the effectiveness of our algorithm by generating high-quality results of different target domains, including those with extremely few examples (e.g.,

adaptation, source domain, target domain, (14 more...)

Country: North America > Canada (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Gaurav, Suyash, Heikkonen, Jukka, Chaudhary, Jatin

Pathway-based Progressive Inference (PaPI) for Energy-Efficient Continual Learning

arXiv.org Artificial IntelligenceJun-24-2025

Continual learning systems face the dual challenge of preventing catastrophic forgetting while maintaining energy efficiency, particularly in resource-constrained environments. This paper introduces Pathway-based Progressive Inference (PaPI), a novel theoretical framework that addresses these challenges through a mathematically rigorous approach to pathway selection and adaptation. We formulate continual learning as an energy-constrained optimization problem and provide formal convergence guarantees for our pathway routing mechanisms. Our theoretical analysis demonstrates that PaPI achieves an $\mathcal{O}(K)$ improvement in the stability-plasticity trade-off compared to monolithic architectures, where $K$ is the number of pathways. We derive tight bounds on forgetting rates using Fisher Information Matrix analysis and prove that PaPI's energy consumption scales with the number of active parameters rather than the total model size. Comparative theoretical analysis shows that PaPI provides stronger guarantees against catastrophic forgetting than Elastic Weight Consolidation (EWC) while maintaining better energy efficiency than both EWC and Gradient Episodic Memory (GEM). Our experimental validation confirms these theoretical advantages across multiple benchmarks, demonstrating PaPI's effectiveness for continual learning in energy-constrained settings. Our codes are available at https://github.com/zser092/PAPI_FILES.

artificial intelligence, machine learning, natural language, (15 more...)

2506.17848

Country:

Europe (0.28)
Asia > Japan (0.28)

Genre: Research Report (0.82)

Industry: Energy (0.35)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)

Neural Information Processing SystemsJan-27-2025, 20:34:55 GMT

Review for NeurIPS paper: Few-shot Image Generation with Elastic Weight Consolidation

Weaknesses: The quality of the paper is already great, but there are a few comments. In equation 3 (page 4), it is not clear whether you compute F of the generated source or target data. Also, I don't quite understand why the FI is computed for the difference between the pretrained and finetuned parameters, and not just for the pretrained parameters. Finally, I assume i in this equation is the layer index, but this should be clearly stated. Update: In the rebuttal, the authors kindly explained that the F is computed for each individual parameter in the network rather than for the entire layer.

elastic weight consolidation, few-shot image generation, neurips paper, (3 more...)

Technology:

Information Technology > Artificial Intelligence > Vision (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.40)

Neural Information Processing SystemsJan-27-2025, 20:34:48 GMT

Review for NeurIPS paper: Few-shot Image Generation with Elastic Weight Consolidation

This paper proposes an interesting framework for generating images from few-shot data. While some reviewers were concerned about the quality of generated images, the ideas in this paper is interesting enough to justify publicaition.

elastic weight consolidation, few-shot image generation, neurips paper

Technology:

Information Technology > Artificial Intelligence > Vision (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.40)

Neural Information Processing SystemsOct-11-2024, 04:37:39 GMT

Few-shot Image Generation with Elastic Weight Consolidation

elastic weight consolidation, few-shot image generation, target domain, (1 more...)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.65)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.64)

McAlister, Hayden, Robins, Anthony, Szymanski, Lech

Sequential Learning in the Dense Associative Memory

arXiv.org Artificial IntelligenceSep-24-2024

Sequential learning involves learning tasks in a sequence, and proves challenging for most neural networks. Biological neural networks regularly conquer the sequential learning challenge and are even capable of transferring knowledge both forward and backwards between tasks. Artificial neural networks often totally fail to transfer performance between tasks, and regularly suffer from degraded performance or catastrophic forgetting on previous tasks. Models of associative memory have been used to investigate the discrepancy between biological and artificial neural networks due to their biological ties and inspirations, of which the Hopfield network is perhaps the most studied model. The Dense Associative Memory, or modern Hopfield network, generalizes the Hopfield network, allowing for greater capacities and prototype learning behaviors, while still retaining the associative memory structure. We investigate the performance of the Dense Associative Memory in sequential learning problems, and benchmark various sequential learning techniques in the network. We give a substantial review of the sequential learning space with particular respect to the Hopfield network and associative memories, as well as describe the techniques we implement in detail. We also draw parallels between the classical and Dense Associative Memory in the context of sequential learning, and discuss the departures from biological inspiration that may influence the utility of the Dense Associative Memory as a tool for studying biological neural networks. We present our findings, and show that existing sequential learning methods can be applied to the Dense Associative Memory to improve sequential learning performance.

dense associative memory, interaction vertex, sequential, (11 more...)

2409.15729

Country:

North America > Canada > Ontario > Toronto (0.14)
Europe > France > Occitanie > Haute-Garonne > Toulouse (0.04)
Oceania > New Zealand > South Island > Otago > Dunedin (0.04)
(5 more...)

Genre: Research Report > New Finding (0.66)

Industry: Education (0.66)

Technology:

Information Technology > Artificial Intelligence > Systems & Languages > Programming Languages (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)