Bilevel Coreset Selection in Continual Learning: A New Formulation and Algorithm

May-25-2025, 06:59:02 GMT–Neural Information Processing Systems

Coreset is a small set that provides a data summary for a large dataset, such that training solely on the small set achieves competitive performance compared with a large dataset. In rehearsal-based continual learning, the coreset is typically used in the memory replay buffer to stand for representative samples in previous tasks, and the coreset selection procedure is typically formulated as a bilevel problem. However, the typical bilevel formulation for coreset selection explicitly performs optimization over discrete decision variables with greedy search, which is computationally expensive. Several works consider other formulations to address this issue, but they ignore the nested nature of bilevel optimization problems and may not solve the bilevel coreset selection problem accurately. To address these issues, we propose a new bilevel formulation, where the inner problem tries to find a model which minimizes the expected training error sampled from a given probability distribution, and the outer problem aims to learn the probability distribution with approximately K (coreset size) nonzero entries such that learned model in the inner problem minimizes the training error over the whole data. To ensure the learned probability has approximately K nonzero entries, we introduce a novel regularizer based on the smoothed top-K loss in the upper problem.

artificial intelligence, machine learning, optimization problem, (13 more...)

Neural Information Processing Systems

May-25-2025, 06:59:02 GMT

Conferences PDF

Add feedback

Genre:
- Research Report > New Finding (0.46)

Industry:
- Education (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.93)
  - Representation & Reasoning > Optimization (1.00)

Duplicate Docs Excel Report

Title
Bilevel Coreset Selection in Continual Learning: A New Formulation and Algorithm Jie Hao

Similar Docs Excel Report more

Title	Similarity	Source
None found