Beyond Signal Propagation: Is Feature Diversity Necessary in Deep Neural Network Initialization?