AITopics | Machine Translation

Collaborating Authors

Machine Translation

"Machine translation (MT) is the application of computers to the task of translating texts from one natural language to another. One of the very earliest pursuits in computer science, MT has proved to be an elusive goal, but today a number of systems are available which produce output which, if not perfect, is of sufficient quality to be useful in a number of specific domains."
– Definition from the European Association for Machine Translation (EAMT).

You can translate text of your choice by using free translators such as: CAPITA, Google Translate, SDL International, SYSTRAN.

News Overviews Instructional Materials AI-Alerts Classics

4ca82782c5372a547c104929f03fe7a9-Paper.pdf

Neural Information Processing SystemsAug-14-2025, 08:49:36 GMT

nasty teacher, skeptical student, student, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.28)
Asia (0.04)

Genre: Research Report (0.46)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)

Add feedback

Efficient Speech Translation through Model Compression and Knowledge Distillation

Moslem, Yasmin

arXiv.org Artificial IntelligenceAug-14-2025

Efficient deployment of large audio-language models for speech translation remains challenging due to their significant computational requirements. In this paper, we address this challenge through our system submissions to the "Model Compression" track at the International Conference on Spoken Language Translation (IWSLT 2025). We experiment with a combination of approaches including iterative layer pruning based on layer importance evaluation, low-rank adaptation with 4-bit quantization (QLoRA), and knowledge distillation. In our experiments, we use Qwen2-Audio-7B-Instruct for speech translation into German and Chinese. Our pruned (student) models achieve up to a 50% reduction in both model parameters and storage footprint, while retaining 97-100% of the translation quality of the in-domain (teacher) models.

artificial intelligence, natural language, pruning, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.18653/v1/2025.iwslt-1.40

2505.20237

Country:

Europe (1.00)
North America > United States (0.68)

Genre: Research Report (0.70)

Technology:

Information Technology > Artificial Intelligence > Speech (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Bemba Speech Translation: Exploring a Low-Resource African Language

Farouq, Muhammad Hazim Al, Wassie, Aman Kassahun, Moslem, Yasmin

arXiv.org Artificial IntelligenceAug-14-2025

This paper describes our system submission to the International Conference on Spoken Language Translation (IWSLT 2025), low-resource languages track, namely for Bemba-to-English speech translation. We built cascaded speech translation systems based on Whisper and NLLB-200, and employed data augmentation techniques, such as back-translation. We investigate the effect of using synthetic data and discuss our experimental setup.

artificial intelligence, computational linguistic, natural language, (13 more...)

arXiv.org Artificial Intelligence

doi: 10.18653/v1/2025.iwslt-1.37

2505.02518

Country:

Europe (1.00)
North America > United States (0.29)
Asia > Indonesia (0.28)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Speech (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

TopXGen: Topic-Diverse Parallel Data Generation for Low-Resource Machine Translation

Zebaze, Armel, Sagot, Benoît, Bawden, Rachel

arXiv.org Artificial IntelligenceAug-13-2025

LLMs have been shown to perform well in machine translation (MT) with the use of in-context learning (ICL), rivaling supervised models when translating into high-resource languages (HRLs). However, they lag behind when translating into low-resource language (LRLs). Example selection via similarity search and supervised fine-tuning help. However the improvements they give are limited by the size, quality and diversity of existing parallel datasets. A common technique in low-resource MT is synthetic parallel data creation, the most frequent of which is backtranslation, whereby existing target-side texts are automatically translated into the source language. However, this assumes the existence of good quality and relevant target-side texts, which are not readily available for many LRLs. In this paper, we present \textsc{TopXGen}, an LLM-based approach for the generation of high quality and topic-diverse data in multiple LRLs, which can then be backtranslated to produce useful and diverse parallel texts for ICL and fine-tuning. Our intuition is that while LLMs struggle to translate into LRLs, their ability to translate well into HRLs and their multilinguality enable them to generate good quality, natural-sounding target-side texts, which can be translated well into a high-resource source language. We show that \textsc{TopXGen} boosts LLM translation performance during fine-tuning and in-context learning. Code and outputs are available at https://github.com/ArmelRandy/topxgen.

computational linguistic, large language model, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2508.0868

Country:

Europe (1.00)
North America > United States (0.94)

Genre: Research Report (1.00)

Industry:

Government (0.46)
Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.77)

Add feedback

Fine-grained Video Dubbing Duration Alignment with Segment Supervised Preference Optimization

Cui, Chaoqun, Huang, Liangbin, Wang, Shijing, Tong, Zhe, Huang, Zhaolong, Zeng, Xiao, Liu, Xiaofeng

arXiv.org Artificial IntelligenceAug-13-2025

Video dubbing aims to translate original speech in visual media programs from the source language to the target language, relying on neural machine translation and text-to-speech technologies. Due to varying information densities across languages, target speech often mismatches the source speech duration, causing audio-video synchronization issues that significantly impact viewer experience. In this study, we approach duration alignment in LLM-based video dubbing machine translation as a preference optimization problem. We propose the Segment Supervised Preference Optimization (SSPO) method, which employs a segment-wise sampling strategy and fine-grained loss to mitigate duration mismatches between source and target lines. Experimental results demonstrate that SSPO achieves superior performance in duration alignment tasks.

large language model, machine learning, translation, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.18653/v1/2025.acl-long.227

2508.0855

Genre: Research Report > New Finding (1.00)

Industry: Law (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Tight Bounds for Schrödinger Potential Estimation in Unpaired Image-to-Image Translation Problems

Puchkin, Nikita, Suchkov, Denis, Naumov, Alexey, Belomestny, Denis

arXiv.org Machine LearningAug-12-2025

Modern methods of generative modelling and unpaired image-to-image translation based on Schrödinger bridges and stochastic optimal control theory aim to transform an initial density to a target one in an optimal way. In the present paper, we assume that we only have access to i.i.d. samples from initial and final distributions. This makes our setup suitable for both generative modelling and unpaired image-to-image translation. Relying on the stochastic optimal control approach, we choose an Ornstein-Uhlenbeck process as the reference one and estimate the corresponding Schrödinger potential. Introducing a risk function as the Kullback-Leibler divergence between couplings, we derive tight bounds on generalization ability of an empirical risk minimizer in a class of Schrödinger potentials including Gaussian mixtures. Thanks to the mixing properties of the Ornstein-Uhlenbeck process, we almost achieve fast rates of convergence up to some logarithmic factors in favourable scenarios. We also illustrate performance of the suggested approach with numerical experiments.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Machine Learning

2508.07392

Country:

Europe > Russia (0.04)
Asia > Russia (0.04)
Europe > Germany (0.04)

Genre: Research Report > New Finding (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.64)

Add feedback

9th Workshop on Sign Language Translation and Avatar Technologies (SLTAT 2025)

Nunnari, Fabrizio, Jiménez, Cristina Luna, Wolfe, Rosalee, McDonald, John C., Filhol, Michael, Efthimiou, Eleni, Fotinea, Evita, Hanke, Thomas

arXiv.org Artificial IntelligenceAug-12-2025

The Sign Language Translation and Avatar Technology (SLTAT) workshops continue a series of gatherings to share recent advances in improving deaf / human communication through non-invasive means. This 2025 edition, the 9th since its first appearance in 2011, is hosted by the International Conference on Intelligent Virtual Agents (IVA), giving the opportunity for contamination between two research communities, using digital humans as either virtual interpreters or as interactive conversational agents. As presented in this summary paper, SLTAT sees contributions beyond avatar technologies, with a consistent number of submissions on sign language recognition, and other work on data collection, data analysis, tools, ethics, usability, and affective computing.

artificial intelligence, natural language, workshop, (13 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3742886.3759656

2508.0805

Country:

Europe (1.00)
North America > United States (0.49)

Genre: Instructional Material > Course Syllabus & Notes (0.47)

Industry: Education > Curriculum > Subject-Specific Education (0.97)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.87)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.70)

Add feedback

Toward Machine Interpreting: Lessons from Human Interpreting Studies

Sperber, Matthias, de Seyssel, Maureen, Bao, Jiajun, Paulik, Matthias

arXiv.org Artificial IntelligenceAug-12-2025

Current speech translation systems, while having achieved impressive accuracies, are rather static in their behavior and do not adapt to real-world situations in ways human interpreters do. In order to improve their practical usefulness and enable interpreting-like experiences, a precise understanding of the nature of human interpreting is crucial. To this end, we discuss human interpreting literature from the perspective of the machine translation field, while considering both operational and qualitative aspects. We identify implications for the development of speech translation systems and argue that there is great potential to adopt many human interpreting principles using recent modeling techniques. We hope that our findings provide inspiration for closing the perceived usability gap, and can motivate progress toward true machine interpreting.

artificial intelligence, machine translation, natural language, (13 more...)

arXiv.org Artificial Intelligence

2508.07964

Country:

Europe (1.00)
Asia > Middle East > UAE (0.46)
North America > United States > Minnesota (0.28)
North America > United States > Florida > Miami-Dade County > Miami (0.14)

Genre: Research Report (1.00)

Industry:

Education (0.93)
Health & Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Speech (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

SASST: Leveraging Syntax-Aware Chunking and LLMs for Simultaneous Speech Translation

Yang, Zeyu, Wei, Lai, Koshkin, Roman, Chen, Xi, Nakamura, Satoshi

arXiv.org Artificial IntelligenceAug-12-2025

This work proposes a grammar-based chunking strategy that segments input streams into semantically complete units by parsing dependency relations (e.g., noun phrase boundaries, verb-object structures) and punctuation features. The method ensures chunk coherence and minimizes semantic fragmentation. Building on this mechanism, we present SASST (Syntax-Aware Simultaneous Speech Translation), an end-to-end framework integrating frozen Whisper encoder and decoder-only LLM. The unified architecture dynamically outputs translation tokens or symbols to jointly optimize translation timing and content, with target-side reordering addressing word-order divergence. Experiments on CoVoST2 multilingual corpus En-{De, Zh, Ja} demonstrate significant translation quality improvements across languages and validate the effectiveness of syntactic structures in LLM-driven SimulST systems.

large language model, natural language, translation, (16 more...)

arXiv.org Artificial Intelligence

2508.07781

Country: Asia > China (0.28)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)

Add feedback

BharatBBQ: A Multilingual Bias Benchmark for Question Answering in the Indian Context

Tomar, Aditya, Sahoo, Nihar Ranjan, Bhattacharyya, Pushpak

arXiv.org Artificial IntelligenceAug-12-2025

Evaluating social biases in language models (LMs) is crucial for ensuring fairness and minimizing the reinforcement of harmful stereotypes in AI systems. Existing benchmarks, such as the Bias Benchmark for Question Answering (BBQ), primarily focus on Western contexts, limiting their applicability to the Indian context. To address this gap, we introduce BharatBBQ, a culturally adapted benchmark designed to assess biases in Hindi, English, Marathi, Bengali, Tamil, Telugu, Odia, and Assamese. BharatBBQ covers 13 social categories, including 3 intersectional groups, reflecting prevalent biases in the Indian sociocultural landscape. Our dataset contains 49,108 examples in one language that are expanded using translation and verification to 392,864 examples in eight different languages. We evaluate five multilingual LM families across zero and few-shot settings, analyzing their bias and stereotypical bias scores. Our findings highlight persistent biases across languages and social categories and often amplified biases in Indian languages compared to English, demonstrating the necessity of linguistically and culturally grounded benchmarks for bias evaluation.

large language model, natural language, question answering, (19 more...)

arXiv.org Artificial Intelligence

2508.0709

Country:

Europe (1.00)
Asia > India (0.29)
North America > Mexico (0.28)

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.98)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.93)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.61)

Add feedback