Make Continual Learning Stronger via C-Flat

May-26-2025, 16:13:25 GMT–Neural Information Processing Systems

How to balance the learning'sensitivity-stability' upon new task training and memory preserving is critical in CL to resolve catastrophic forgetting. Improving model generalization ability within each learning phase is one solution to help CL learning overcome the gap in the joint knowledge space. Zeroth-order loss landscape sharpness-aware minimization is a strong training regime improving model generalization in transfer learning compared with optimizer like SGD. It has also been introduced into CL to improve memory representation or learning efficiency. However, zeroth-order sharpness alone could favors sharper over flatter minima in certain scenarios, leading to a rather sensitive minima rather than a global optima. To further enhance learning stability, we propose a Continual Flatness (C-Flat) method featuring a flatter loss landscape tailored for CL.

artificial intelligence, c-flat, machine learning, (3 more...)

Neural Information Processing Systems

May-26-2025, 16:13:25 GMT

Conferences Web Page

Add feedback

Industry:
- Media > Music (0.77)
- Leisure & Entertainment (0.77)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.80)