Gradient Descent Quantizes ReLU Network Features