Convergence of continuous-time stochastic gradient descent with applications to linear deep neural networks

Open in new window