Soup to go: mitigating forgetting during continual learning with model averaging