Training with Multi-Layer Embeddings for Model Reduction