AlterSGD: Finding Flat Minima for Continual Learning by Alternative Training