Towards Understanding the Condensation of Two-layer Neural Networks at Initial Training