Fast Vocabulary Projection Method via Clustering for Multilingual Machine Translation on GPU

Open in new window