Gradient based sample selection for online continual learning
–Neural Information Processing Systems
A continual learning agent learns online with a non-stationary and never-ending stream of data. The key to such learning process is to overcome the catastrophic forgetting of previously seen data, which is a well known problem of neural networks. To prevent forgetting, a replay buffer is usually employed to store the previous data for the purpose of rehearsal. Previous work often depend on task boundary and i.i.d. In this work, we formulate sample selection as a constraint reduction problem based on the constrained optimization view of continual learning.
Neural Information Processing Systems
Oct-11-2024, 03:48:57 GMT