Flashbacks to Harmonize Stability and Plasticity in Continual Learning

Mahmoodi, Leila, Moghadam, Peyman, Hayat, Munawar, Simon, Christian, Harandi, Mehrtash

arXiv.org Machine Learning 

We introduce Flashback Learning (FL), a novel method designed to harmonize the stability and plasticity of models in Continual Learning (CL). Unlike prior approaches that primarily focus on regularizing model updates to preserve old information while learning new concepts, FL explicitly balances this trade-off through a bidirectional form of regularization. This approach effectively guides the model to swiftly incorporate new knowledge while actively retaining its old knowledge. FL operates through a two-phase training process and can be seamlessly integrated into various CL methods, including replay, parameter regularization, distillation, and dynamic architecture techniques. In designing FL, we use two distinct knowledge bases: one to enhance plasticity and another to improve stability. FL ensures a more balanced model by utilizing both knowledge bases to regularize model updates. Theoretically, we analyze how the FL mechanism enhances the stability-plasticity balance. Empirically, FL demonstrates tangible improvements over baseline methods within the same training budget. By integrating FL into at least one representative baseline from each CL category, we observed an average accuracy improvement of up to 4.91% in Class-Incremental and 3.51% in Task-Incremental settings on standard image classification benchmarks. Additionally, measurements of the stability-to-plasticity ratio confirm that FL effectively enhances this balance. FL also outperforms state-of-the-art CL methods on more challenging datasets like ImageNet. Introduction Our brain excels at learning new information without significantly disrupting or forgetting previously acquired knowledge. In contrast, Deep Neural Networks (DNNs) often struggle with Catastrophic Forgetting (CF) [1], where new information can overwrite and interfere with existing knowledge. This phenomenon poses a significant challenge when continuous learning of new tasks is required. Continual Learning (CL) methods have been developed to mitigate catastrophic forgetting by enabling DNNs to retain prior knowledge while acquiring new skills. Deploying AI models in realistic settings demands that models function effectively when encountering data with varying distributions over time.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found