SGD vs GD: Rank Deficiency in Linear Networks