AITopics | iterative self-supervised training

Cross-lingual Retrieval for Iterative Self-Supervised Training (supplementary materials) 1 Experiment details

Neural Information Processing SystemsFeb-7-2026, 14:57:07 GMT

Becauseof the file size limit, we will release the source code and pretrained checkpoints after the anonymity period. To be able to make a fair comparison,we followed the same preprocessingsteps as described in [13]. In each iteration, we mine all90 language pairs in parallel, using8 GPUs for each pair, each pair taking about15 30 hours to finish. We lightly tune the margin score threshold using validation BLEU (using threshold score between 1.04and1.07.) For all experiments, we use Transformerwith 12 layers of encoder and 12 layers of decoder with model dimension of1024 on 16 heads ( 680M parameters). 1 We trained for maximum20,000 steps using label-smoothed cross-entropy loss with 0.2 label smoothing,0.3

artificial intelligence, machine translation, natural language, (12 more...)

Neural Information Processing Systems

Country:

Europe > Bulgaria > Sofia City Province > Sofia (0.05)
Europe > Belgium (0.05)
Asia > Middle East > Saudi Arabia > Northern Borders Province > Arar (0.05)
Asia > China > Hong Kong (0.05)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Cross-lingual Retrieval for Iterative Self-Supervised Training

Neural Information Processing SystemsDec-23-2025, 19:26:05 GMT

Recent studies have demonstrated the cross-lingual alignment ability of multilingual pretrained language models. In this work, we found that the cross-lingual alignment can be further improved by training seq2seq models on sentence pairs mined using their own encoder outputs. We utilized these findings to develop a new approach --- cross-lingual retrieval for iterative self-supervised training (CRISS), where mining and training processes are applied iteratively, improving cross-lingual alignment and translation ability at the same time. Using this method, we achieved state-of-the-art unsupervised machine translation results on 9 language directions with an average improvement of 2.4 BLEU, and on the Tatoeba sentence retrieval task in the XTREME benchmark on 16 languages with an average improvement of 21.5% in absolute accuracy. Furthermore, CRISS also brings an additional 1.8 BLEU improvement on average compared to mBART, when finetuned on supervised machine translation downstream tasks.

cross-lingual retrieval, iterative self-supervised training, name change, (3 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.85)

Add feedback

Cross-lingual Retrieval for Iterative Self-Supervised Training (supplementary materials) 1 Experiment details

Neural Information Processing SystemsOct-2-2025, 06:16:51 GMT

In this section, we describe our experimental procedures in more details including hyperparameters, and intermediate results. For unsupervised machine translation task, we evaluate BLEU scores using multi-bleu.perl

artificial intelligence, machine translation, natural language, (14 more...)

Neural Information Processing Systems

Country:

Europe > Bulgaria (0.14)
Asia > China (0.14)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Review for NeurIPS paper: Cross-lingual Retrieval for Iterative Self-Supervised Training

Neural Information Processing SystemsJan-22-2025, 02:01:45 GMT

The paper proposes a novel approach for unsupervised parallel corpus mining and unsupervised machine translation, improving on the SoTA on both tasks by significant margins. Experiments are conducted on the Tatoeba retrieval task and a 25 language translation task based on a combination of a few academic benchmark datasets. Careful experiments to demonstrate how using parallel data from just one language pair significantly improves the cross-lingual embedding alignment in a multilingual de-noising auto-encoder. All reviewers support acceptance, as does the AC. Please make sure to incorporate the clarifications from the author response in the final version of the paper.

cross-lingual retrieval, iterative self-supervised training, neurips paper

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.73)

Add feedback

Cross-lingual Retrieval for Iterative Self-Supervised Training

Neural Information Processing SystemsOct-9-2024, 15:14:25 GMT

Recent studies have demonstrated the cross-lingual alignment ability of multilingual pretrained language models. In this work, we found that the cross-lingual alignment can be further improved by training seq2seq models on sentence pairs mined using their own encoder outputs. We utilized these findings to develop a new approach --- cross-lingual retrieval for iterative self-supervised training (CRISS), where mining and training processes are applied iteratively, improving cross-lingual alignment and translation ability at the same time. Using this method, we achieved state-of-the-art unsupervised machine translation results on 9 language directions with an average improvement of 2.4 BLEU, and on the Tatoeba sentence retrieval task in the XTREME benchmark on 16 languages with an average improvement of 21.5% in absolute accuracy. Furthermore, CRISS also brings an additional 1.8 BLEU improvement on average compared to mBART, when finetuned on supervised machine translation downstream tasks.

average improvement, cross-lingual retrieval, iterative self-supervised training

Neural Information Processing Systems

Genre: Research Report > New Finding (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.91)

Add feedback

Filters

Collaborating Authors

iterative self-supervised training

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Cross-lingual Retrieval for Iterative Self-Supervised Training (supplementary materials) 1 Experiment details

Cross-lingual Retrieval for Iterative Self-Supervised Training

Cross-lingual Retrieval for Iterative Self-Supervised Training (supplementary materials) 1 Experiment details

Review for NeurIPS paper: Cross-lingual Retrieval for Iterative Self-Supervised Training

Cross-lingual Retrieval for Iterative Self-Supervised Training