Unveiling the Training Dynamics of ReLU Networks through a Linear Lens

Open in new window