Understanding Multi-phase Optimization Dynamics and Rich Nonlinear Behaviors of ReLU Networks