AITopics | Machine Translation

Collaborating Authors

Machine Translation

"Machine translation (MT) is the application of computers to the task of translating texts from one natural language to another. One of the very earliest pursuits in computer science, MT has proved to be an elusive goal, but today a number of systems are available which produce output which, if not perfect, is of sufficient quality to be useful in a number of specific domains."
– Definition from the European Association for Machine Translation (EAMT).

You can translate text of your choice by using free translators such as: CAPITA, Google Translate, SDL International, SYSTRAN.

News Overviews Instructional Materials AI-Alerts Classics

Improving Bilingual Lexicon Induction with Cross-Encoder Reranking

Li, Yaoyiran, Liu, Fangyu, Vulić, Ivan, Korhonen, Anna

arXiv.org Artificial IntelligenceOct-30-2022

Bilingual lexicon induction (BLI) with limited bilingual supervision is a crucial yet challenging task in multilingual NLP. Current state-of-the-art BLI methods rely on the induction of cross-lingual word embeddings (CLWEs) to capture cross-lingual word similarities; such CLWEs are obtained 1) via traditional static models (e.g., VecMap), or 2) by extracting type-level CLWEs from multilingual pretrained language models (mPLMs), or 3) through combining the former two options. In this work, we propose a novel semi-supervised post-hoc reranking method termed BLICEr (BLI with Cross-Encoder Reranking), applicable to any precalculated CLWE space, which improves their BLI capability. The key idea is to 'extract' cross-lingual lexical knowledge from mPLMs, and then combine it with the original CLWEs. This crucial step is done via 1) creating a word similarity dataset, comprising positive word pairs (i.e., true translations) and hard negative pairs induced from the original CLWE space, and then 2) fine-tuning an mPLM (e.g., mBERT or XLM-R) in a cross-encoder manner to predict the similarity scores. At inference, we 3) combine the similarity score from the original CLWE space with the score from the BLI-tuned cross-encoder. BLICEr establishes new state-of-the-art results on two standard BLI benchmarks spanning a wide spectrum of diverse languages: it substantially outperforms a series of strong baselines across the board. We also validate the robustness of BLICEr with different CLWEs.

computational linguistic, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2210.16953

Country:

Asia > China > Hong Kong (0.04)
Europe > Italy > Tuscany > Florence (0.04)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
(12 more...)

Genre:

Research Report (0.64)
Overview (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Leveraging Locality in Abstractive Text Summarization

Liu, Yixin, Ni, Ansong, Nan, Linyong, Deb, Budhaditya, Zhu, Chenguang, Awadallah, Ahmed H., Radev, Dragomir

arXiv.org Artificial IntelligenceOct-30-2022

Neural attention models have achieved significant improvements on many natural language processing tasks. However, the quadratic memory complexity of the self-attention module with respect to the input length hinders their applications in long text summarization. Instead of designing more efficient attention modules, we approach this problem by investigating if models with a restricted context can have competitive performance compared with the memory-efficient attention models that maintain a global context by treating the input as a single sequence. Our model is applied to individual pages which contain parts of inputs grouped by the principle of locality during both encoding and decoding. We empirically investigated three kinds of locality in text summarization at different levels of granularity, ranging from sentences to documents. Our experimental results show that our model has a better performance compared with strong baselines with efficient attention modules, and our analysis provides further insights into our locality-aware modeling strategy.

artificial intelligence, machine translation, natural language, (18 more...)

arXiv.org Artificial Intelligence

2205.12476

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
Asia > China > Hong Kong (0.04)
(18 more...)

Genre: Research Report > New Finding (0.48)

Industry:

Government > Regional Government > North America Government > United States Government (0.93)
Health & Medicine > Health Care Providers & Services (0.68)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.46)

Add feedback

Joint Pre-Training with Speech and Bilingual Text for Direct Speech to Speech Translation

Wei, Kun, Zhou, Long, Zhang, Ziqiang, Chen, Liping, Liu, Shujie, He, Lei, Li, Jinyu, Wei, Furu

arXiv.org Artificial IntelligenceOct-30-2022

Direct speech-to-speech translation (S2ST) is an attractive research topic with many advantages compared to cascaded S2ST. However, direct S2ST suffers from the data scarcity problem because the corpora from speech of the source language to speech of the target language are very rare. To address this issue, we propose in this paper a Speech2S model, which is jointly pre-trained with unpaired speech and bilingual text data for direct speech-to-speech translation tasks. By effectively leveraging the paired text data, Speech2S is capable of modeling the cross-lingual speech conversion from source to target language. We verify the performance of the proposed Speech2S on Europarl-ST and VoxPopuli datasets. Experimental results demonstrate that Speech2S gets an improvement of about 5 BLEU scores compared to encoder-only pre-training models, and achieves a competitive or even better performance than existing state-of-the-art models1.

machine learning, natural language, translation, (19 more...)

arXiv.org Artificial Intelligence

2210.17027

Country:

Asia > Japan > Honshū > Tōhoku > Iwate Prefecture > Morioka (0.04)
Asia > China (0.04)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Artificial Intelligence and Localization

#artificialintelligenceOct-29-2022, 21:05:06 GMT

Artificial intelligence is a game changer for the content industry in general and the language industry in particular, in terms of both technology and services. Unsurprisingly, it has had a vast and continued impact on localization people, processes, and technology. In the early days of artificial intelligence (AI), machine learning (ML), and machine translation (MT) offered a basic and automatic conversion of text, such as glorified versions of online dictionaries and glossaries. ML and MT are now propelled by neural networks that bring them closer to functioning and behaving like human brains. Although AI is getting closer to human behavior, it cannot mimic it fully, and therefore, it has not yet replaced human beings.

artificial intelligence and localization, ml and mt

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

HashFormers: Towards Vocabulary-independent Pre-trained Transformers

Xue, Huiyin, Aletras, Nikolaos

arXiv.org Artificial IntelligenceOct-29-2022

Transformer-based pre-trained language models are vocabulary-dependent, mapping by default each token to its corresponding embedding. This one-to-one mapping results into embedding matrices that occupy a lot of memory (i.e. millions of parameters) and grow linearly with the size of the vocabulary. Previous work on on-device transformers dynamically generate token embeddings on-the-fly without embedding matrices using locality-sensitive hashing over morphological information. These embeddings are subsequently fed into transformer layers for text classification. However, these methods are not pre-trained. Inspired by this line of work, we propose HashFormers, a new family of vocabulary-independent pre-trained transformers that support an unlimited vocabulary (i.e. all possible tokens in a corpus) given a substantially smaller fixed-sized embedding matrix. We achieve this by first introducing computationally cheap hashing functions that bucket together individual tokens to embeddings. We also propose three variants that do not require an embedding matrix at all, further reducing the memory requirements. We empirically demonstrate that HashFormers are more memory efficient compared to standard pre-trained transformers while achieving comparable predictive performance when fine-tuned on multiple text classification tasks. For example, our most efficient HashFormer variant has a negligible performance degradation (0.4\% on GLUE) using only 99.1K parameters for representing the embeddings compared to 12.3-38M parameters of state-of-the-art models.

computational linguistic, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2210.07904

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Dominican Republic (0.04)
Europe > Italy > Tuscany > Florence (0.04)
(7 more...)

Genre: Research Report (0.70)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

Improving Natural-Language-based Audio Retrieval with Transfer Learning and Audio & Text Augmentations

Primus, Paul, Widmer, Gerhard

arXiv.org Artificial IntelligenceOct-29-2022

The absence of large labeled datasets remains a significant challenge in many application areas of deep learning. Researchers and practitioners typically resort to transfer learning and data augmentation to alleviate this issue. We study these strategies in the context of audio retrieval with natural language queries (Task 6b of the DCASE 2022 Challenge). Our proposed system uses pre-trained embedding models to project recordings and textual descriptions into a shared audio-caption space in which related examples from different modalities are close. We employ various data augmentation techniques on audio and text inputs and systematically tune their corresponding hyperparameters with sequential model-based optimization. Our results show that the used augmentations strategies reduce overfitting and improve retrieval performance.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2208.1146

Country:

Europe > France > Grand Est > Meurthe-et-Moselle > Nancy (0.05)
Europe > Austria > Upper Austria (0.04)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Meta AI powers spoken-only language translation

#artificialintelligenceOct-28-2022, 17:10:12 GMT

After plans to break physical barriers with his metaverse initiative, Meta CEO Mark Zuckerberg revealed plans for another globe-spanning artificial intelligence (AI) project earlier this year, this time a universal translation tool unlike any other. At the same time, the company that made itself famous (and notorious) for its social media networks also introduced another AI-powered tool, a virtual assistant. Both of these intelligent applications were intended to have practical use cases in Zuckerberg's metaverse, those were their intended uses but they will also have wider business applications that Meta is all too aware of. AI virtual assistants, of course, are already in wider use by organizations as chatbots to handle basic customer requests and interactions across a variety of digital services– including Meta's own popular platforms like Facebook Messenger, Instagram, and WhatsApp Business. The other, less well-known AI use case(s) is the language and translation exercises that provide alternatives to relying on human translators to provide accurate, expert-quality translations in real-time.

ai power spoken-only language translation, meta, translation, (12 more...)

#artificialintelligence

Country:

North America > United States > California (0.05)
Europe > France (0.05)
Asia > Taiwan (0.05)

Genre: Personal > Honors (0.32)

Industry:

Information Technology > Services (0.59)
Government > Regional Government > North America Government > United States Government (0.32)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.78)

Add feedback

DiMBERT: Learning Vision-Language Grounded Representations with Disentangled Multimodal-Attention

Liu, Fenglin, Wu, Xian, Ge, Shen, Ren, Xuancheng, Fan, Wei, Sun, Xu, Zou, Yuexian

arXiv.org Artificial IntelligenceOct-28-2022

Vision-and-language (V-L) tasks require the system to understand both vision content and natural language, thus learning fine-grained joint representations of vision and language (a.k.a. V-L representations) is of paramount importance. Recently, various pre-trained V-L models are proposed to learn V-L representations and achieve improved results in many tasks. However, the mainstream models process both vision and language inputs with the same set of attention matrices. As a result, the generated V-L representations are entangled in one common latent space. To tackle this problem, we propose DiMBERT (short for Disentangled Multimodal-Attention BERT), which is a novel framework that applies separated attention spaces for vision and language, and the representations of multi-modalities can thus be disentangled explicitly. To enhance the correlation between vision and language in disentangled spaces, we introduce the visual concepts to DiMBERT which represent visual information in textual format. In this manner, visual concepts help to bridge the gap between the two modalities. We pre-train DiMBERT on a large amount of image-sentence pairs on two tasks: bidirectional language modeling and sequence-to-sequence language modeling. After pre-train, DiMBERT is further fine-tuned for the downstream tasks. Experiments show that DiMBERT sets new state-of-the-art performance on three tasks (over four datasets), including both generation tasks (image captioning and visual storytelling) and classification tasks (referring expressions). The proposed DiM (short for Disentangled Multimodal-Attention) module can be easily incorporated into existing pre-trained V-L models to boost their performance, up to a 5% increase on the representative task. Finally, we conduct a systematic analysis and demonstrate the effectiveness of our DiM and the introduced visual concepts.

artificial intelligence, machine learning, natural language, (14 more...)

arXiv.org Artificial Intelligence

2210.16431

Country:

Asia > China > Heilongjiang Province > Daqing (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

CONQRR: Conversational Query Rewriting for Retrieval with Reinforcement Learning

Wu, Zeqiu, Luan, Yi, Rashkin, Hannah, Reitter, David, Hajishirzi, Hannaneh, Ostendorf, Mari, Tomar, Gaurav Singh

arXiv.org Artificial IntelligenceOct-28-2022

Compared to standard retrieval tasks, passage retrieval for conversational question answering (CQA) poses new challenges in understanding the current user question, as each question needs to be interpreted within the dialogue context. Moreover, it can be expensive to re-train well-established retrievers such as search engines that are originally developed for non-conversational queries. To facilitate their use, we develop a query rewriting model CONQRR that rewrites a conversational question in the context into a standalone question. It is trained with a novel reward function to directly optimize towards retrieval using reinforcement learning and can be adapted to any off-the-shelf retriever. CONQRR achieves state-of-the-art results on a recent open-domain CQA dataset containing conversations from three different sources, and is effective for two different off-the-shelf retrievers. Our extensive analysis also shows the robustness of CONQRR to out-of-domain dialogues as well as to zero query rewriting supervision.

information retrieval, machine learning, reinforcement learning, (21 more...)

arXiv.org Artificial Intelligence

2112.08558

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Tennessee > Davidson County > Nashville (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(8 more...)

Genre: Research Report (0.82)

Industry:

Media (0.46)
Leisure & Entertainment (0.46)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.88)
(2 more...)

Add feedback

Twist Decoding: Diverse Generators Guide Each Other

Kasai, Jungo, Sakaguchi, Keisuke, Bras, Ronan Le, Peng, Hao, Lu, Ximing, Radev, Dragomir, Choi, Yejin, Smith, Noah A.

arXiv.org Artificial IntelligenceOct-28-2022

Many language generation models are now available for a wide range of generation tasks, including machine translation and summarization. Combining such diverse models may lead to further progress, but ensembling generation models is challenging during inference: conventional ensembling methods (e.g., shallow fusion) require that the models share vocabulary/tokenization schemes. We introduce Twist decoding, a simple and general text generation algorithm that benefits from diverse models at inference time. Our method does not assume the vocabulary, tokenization or even generation order is shared. Our extensive evaluations on machine translation and scientific paper summarization demonstrate that Twist decoding substantially outperforms each model decoded in isolation over various scenarios, including cases where domain-specific and general-purpose models are both available. Twist decoding also consistently outperforms the popular reranking heuristic where output candidates from one model are rescored by another. We hope that our work will encourage researchers and practitioners to examine generation models collectively, not just independently, and to seek out models with complementary strengths to the currently available models. Our code is available at https://github.com/jungokasai/twist_decoding.

machine learning, natural language, translation, (19 more...)

arXiv.org Artificial Intelligence

2205.09273

Country:

North America > United States (0.14)
Asia > Japan > Honshū > Tōhoku (0.04)
Asia > India > Karnataka > Bengaluru (0.04)
Europe > Spain (0.04)

Genre:

Research Report > Strength High (0.68)
Research Report > Experimental Study (0.68)

Industry: Health & Medicine (0.96)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)

Add feedback