Stochastic Gradient Descent-Induced Drift of Representation in a Two-Layer Neural Network

Open in new window