Unveiling the Training Dynamics of ReLU Networks through a Linear Lens