Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models

Open in new window