Weight Initialization and Variance Dynamics in Deep Neural Networks and Large Language Models