Uncovering Symmetry Transfer in Large Language Models via Layer-Peeled Optimization

Open in new window