How Does Data Diversity Shape the Weight Landscape of Neural Networks?