PyMarian: Fast Neural Machine Translation and Evaluation in Python
Gowda, Thamme, Grundkiewicz, Roman, Rippeth, Elijah, Post, Matt, Junczys-Dowmunt, Marcin
–arXiv.org Artificial Intelligence
The deep learning language of choice these days is Python; measured by factors such as available libraries and technical support, it is hard to beat. At the same time, software written in lower-level programming languages like C++ retain advantages in speed. We describe a Python interface to Marian NMT, a C++-based training and inference toolkit for sequence-to-sequence models, focusing on machine translation. This interface enables models trained with Marian to be connected to the rich, wide range of tools available in Python. A highlight of the interface is the ability to compute state-of-the-art COMET metrics from Python but using Marian's inference engine, with a speedup factor of up to 7.8$\times$ the existing implementations. We also briefly spotlight a number of other integrations, including Jupyter notebooks, connection with prebuilt models, and a web app interface provided with the package. PyMarian is available in PyPI via $\texttt{pip install pymarian}$.
arXiv.org Artificial Intelligence
Aug-14-2024
- Country:
- Asia
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- Singapore (0.04)
- Middle East > UAE
- Europe
- North America > United States
- Maryland (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- New York > New York County
- New York City (0.04)
- Oceania > Australia
- Asia
- Genre:
- Research Report (0.40)
- Industry:
- Information Technology > Software (0.34)
- Technology: