On a continuous time model of gradient descent dynamics and instability in deep learning

Open in new window