AlterSGD: Finding Flat Minima for Continual Learning by Alternative Training

Open in new window