Pruning via Merging: Compressing LLMs via Manifold Alignment Based Layer Merging

Open in new window