Remember the Past: Distilling Datasets into Addressable Memories for Neural Networks

Neural Information Processing Systems 

Deep learning methods have made large strides in building task-specific models, but are shown to easily forget past knowledge when learning new tasks [4, 5].