Binarized Neural Machine Translation

Neural Information Processing Systems 

The rapid scaling of language models is motivating research using low-bitwidth quantization. In this work, we propose a novel binarization technique for Transformers applied to machine translation (BMT), the first of its kind.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found