General Cross-Architecture Distillation of Pretrained Language Models into Matrix Embeddings

Open in new window