AITopics | unsupervised machine translation

A transcompiler, transpiler, or source-to-source compiler, is a translator which converts between programming languages that operate at a similar level of abstraction.

machine learning, programming language, translation, (18 more...)

Neural Information Processing Systems

Country:

Oceania > Australia (0.04)
North America > Canada (0.04)
Europe > France (0.04)

Industry: Banking & Finance (0.68)

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Reviews: Cross-lingual Language Model Pretraining

Neural Information Processing SystemsJan-26-2025, 20:11:31 GMT

This paper uses three techniques for incorporating multi-lingual (rather than just mono-lingual) information for pretraining contextualised representations: (i) autoregressive language modelling objective (e.g. The methods are evaluated on four tasks: (i) cross-lingual classification (XNLI), (ii) unsupervised machine translation, (iii) supervised machine translation, and (iv) low-resourcce language modelling. These results are important as they showcase the strong benefit of multi-lingual (rather than just mono-lingual) pretraining for multiple important downstream tasks, and achieve new state of the art. Originality: while the methods are not particularly novel (autoregressive and masked language modelling pretraining have both been used before for ELMo and BERT; this work extends these objectives to the multi-lingual case), the performance gains on all four tasks are still very impressive. The empirical results are strong, and the methodology is sound and explained in sufficient technical details. - Clarity: The paper is well-written, makes the connections with the relevant earlier work, and includes important details that can facilitate reproducibility (e.g. the learning rate, number of layers, etc.).

cross-lingual language model pretraining, empirical result, machine translation, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

Brief Review -- Unsupervised Machine Translation Using Monolingual Corpora Only

#artificialintelligenceOct-15-2022, 15:09:20 GMT

With the use of GAN idea, NMT model can be trained without parallel data, in which I think it is similar to the CycleGAN in image domain. 2013 … 2018 [UMNT] … 2020 [Batch Augment, BA] [GPT-3] [T5]…

brief review, monolingual corpora only, unsupervised machine translation

#artificialintelligence

Genre: Overview (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.53)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.40)

Add feedback

Refining Low-Resource Unsupervised Translation by Language Disentanglement of Multilingual Model

Nguyen, Xuan-Phi, Joty, Shafiq, Kui, Wu, Aw, Ai Ti

arXiv.org Artificial IntelligenceOct-1-2022

Numerous recent work on unsupervised machine translation (UMT) implies that competent unsupervised translations of low-resource and unrelated languages, such as Nepali or Sinhala, are only possible if the model is trained in a massive multilingual environment, where these low-resource languages are mixed with high-resource counterparts. Nonetheless, while the high-resource languages greatly help kick-start the target low-resource translation tasks, the language discrepancy between them may hinder their further improvement. In this work, we propose a simple refinement procedure to separate languages from a pre-trained multilingual UMT model for it to focus on only the target low-resource task. Our method achieves the state of the art in the fully unsupervised translation tasks of English to Nepali, Sinhala, Gujarati, Latvian, Estonian and Kazakh, with BLEU score gains of 3.5, 3.5, 3.3, 4.1, 4.2, and 3.3, respectively.

machine learning, natural language, translation, (17 more...)

arXiv.org Artificial Intelligence

2205.15544

Country:

Europe > Belgium > Brussels-Capital Region > Brussels (0.04)
Europe > Italy > Tuscany > Florence (0.04)
Asia > China > Hong Kong (0.04)
(7 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.92)

Add feedback

Filters

Collaborating Authors

unsupervised machine translation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

eb1a323fa10d4102ff13422476a744ff-Paper-Conference.pdf

1763ea5a7e72dd7ee64073c2dda7a7a8-Paper.pdf

1763ea5a7e72dd7ee64073c2dda7a7a8-AuthorFeedback.pdf

1763ea5a7e72dd7ee64073c2dda7a7a8-Paper.pdf

1763ea5a7e72dd7ee64073c2dda7a7a8-AuthorFeedback.pdf

Refining Low-Resource Unsupervised Translation by Language Disentanglement of Multilingual Model

Unsupervised Translation of Programming Languages

Reviews: Cross-lingual Language Model Pretraining

Brief Review -- Unsupervised Machine Translation Using Monolingual Corpora Only

Refining Low-Resource Unsupervised Translation by Language Disentanglement of Multilingual Model