Overcoming Recency Bias of Normalization Statistics in Continual Learning: Balance and Adaptation

Neural Information Processing Systems 

Continual learning entails learning a sequence of tasks and balancing their knowledge appropriately. With limited access to old training samples, much of the current work in deep neural networks has focused on overcoming catastrophic forgetting of old tasks in gradient-based optimization.