Optimization Insights into Deep Diagonal Linear Networks