MemoryEfficientContinualLearningwith Transformers

Neural Information Processing Systems 

In many real-world scenarios, data to train machine learning models becomes availableovertime.