Towards understanding epoch-wise double descent in two-layer linear neural networks