AITopics

2209.12325

Country:

Asia > India (0.05)
North America > Dominican Republic (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)
(11 more...)

Genre: Research Report > New Finding (0.68)

Industry:

Law > Statutes (0.46)
Law > Criminal Law (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.46)

#artificialintelligenceSep-23-2022, 07:45:22 GMT

Array

Artificial intelligence is a discipline that attempts to simulate human intelligence. The field of AI covers a wide range of technologies, from the relatively simple to the more complex. This wide range of technologies enables AI to solve a wide range of problems, from automated machine translation to high-level reasoning. These technologies include Machine Learning, Natural Language Processing, Knowledge Representation, Probabilistic Reasoning, Logic Programming, Expert Systems, and Genetic Programming. The complexity of AI is often misunderstood by those who have never worked in the field.

social media platform, user experience, wide range, (11 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.39)

Xia, Youya, Monica, Josephine, Chao, Wei-Lun, Hariharan, Bharath, Weinberger, Kilian Q, Campbell, Mark

Image-to-Image Translation for Autonomous Driving from Coarsely-Aligned Image Pairs

arXiv.org Artificial IntelligenceSep-23-2022

A self-driving car must be able to reliably handle adverse weather conditions (e.g., snowy) to operate safely. In this paper, we investigate the idea of turning sensor inputs (i.e., images) captured in an adverse condition into a benign one (i.e., sunny), upon which the downstream tasks (e.g., semantic segmentation) can attain high accuracy. Prior work primarily formulates this as an unpaired image-to-image translation problem due to the lack of paired images captured under the exact same camera poses and semantic layouts. While perfectly-aligned images are not available, one can easily obtain coarsely-paired images. For instance, many people drive the same routes daily in both good and adverse weather; thus, images captured at close-by GPS locations can form a pair. Though data from repeated traversals are unlikely to capture the same foreground objects, we posit that they provide rich contextual information to supervise the image translation model. To this end, we propose a novel training objective leveraging coarsely-aligned image pairs. We show that our coarsely-aligned training scheme leads to a better image translation quality and improved downstream tasks, such as semantic segmentation, monocular depth estimation, and visual localization.

machine learning, natural language, translation, (19 more...)

2209.11673

Country:

North America > United States > Ohio (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
Europe > Germany > Saarland > Saarbrücken (0.04)

Genre: Research Report (0.40)

Industry:

Transportation > Ground > Road (0.71)
Information Technology > Robotics & Automation (0.71)
Automobiles & Trucks (0.71)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)

arXiv.org Artificial IntelligenceSep-23-2022

Zero-shot Domain Adaptation for Neural Machine Translation with Retrieved Phrase-level Prompts

Sun, Zewei, Jiang, Qingnan, Huang, Shujian, Cao, Jun, Cheng, Shanbo, Wang, Mingxuan

Domain adaptation is an important challenge for neural machine translation. However, the traditional fine-tuning solution requires multiple extra training and yields a high cost. In this paper, we propose a non-tuning paradigm, resolving domain adaptation with a prompt-based method. Specifically, we construct a bilingual phrase-level database and retrieve relevant pairs from it as a prompt for the input sentences. By utilizing Retrieved Phrase-level Prompts (RePP), we effectively boost the translation quality. Experiments show that our method improves domain-specific machine translation for 6.2 BLEU scores and improves translation constraints for 11.5% accuracy without additional training.

artificial intelligence, natural language, translation, (15 more...)

2209.11409

Country:

Europe (0.04)
Asia > China > Jiangsu Province > Nanjing (0.04)

Genre: Research Report (0.50)

Industry: Information Technology > Security & Privacy (0.46)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Semantically Consistent Data Augmentation for Neural Machine Translation via Conditional Masked Language Model

Cheng, Qiao, Huang, Jin, Duan, Yitao

This paper introduces a new data augmentation method for neural machine translation that can enforce stronger semantic consistency both within and across languages. Our method is based on Conditional Masked Language Model (CMLM) which is bi-directional and can be conditional on both left and right context, as well as the label. We demonstrate that CMLM is a good technique for generating context-dependent word distributions. In particular, we show that CMLM is capable of enforcing semantic consistency by conditioning on both source and target during substitution. In addition, to enhance diversity, we incorporate the idea of soft word substitution for data augmentation which replaces a word with a probabilistic distribution over the vocabulary. Experiments on four translation datasets of different scales show that the overall solution results in more realistic data augmentation and better translation quality. Our approach consistently achieves the best performance in comparison with strong and recent works and yields improvements of up to 1.90 BLEU points over the baseline.

artificial intelligence, machine translation, natural language, (13 more...)

2209.10875

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Belgium (0.04)
Asia > Thailand > Bangkok > Bangkok (0.04)
(3 more...)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Approaching English-Polish Machine Translation Quality Assessment with Neural-based Methods

Nowakowski, Artur

This paper presents our contribution to the PolEval 2021 Task 2: Evaluation of translation quality assessment metrics. We describe experiments with pre-trained language models and state-of-the-art frameworks for translation quality assessment in both nonblind and blind versions of the task. Our solutions ranked second in the nonblind version and third in the blind version.

artificial intelligence, computational linguistic, natural language, (11 more...)

2209.11016

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Oceania > Australia > Victoria > Melbourne (0.05)
Europe > Portugal > Lisbon > Lisbon (0.05)
(7 more...)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Wei, Yizhen, Utsuro, Takehito, Nagata, Masaaki

Extending Word-Level Quality Estimation for Post-Editing Assistance

We define a novel concept called extended word alignment in order to improve post-editing assistance efficiency. Based on extended word alignment, we further propose a novel task called refined word-level QE that outputs refined tags and word-level correspondences. Compared to original word-level QE, the new task is able to directly point out editing operations, thus improves efficiency. To extract extended word alignment, we adopt a supervised method based on mBERT. To solve refined word-level QE, we firstly predict original QE tags by training a regression model for sequence tagging based on mBERT and XLM-R. Then, we refine original word tags with extended word alignment. In addition, we extract source-gap correspondences, meanwhile, obtaining gap tags. Experiments on two language pairs show the feasibility of our method and give us inspirations for further improvement.

artificial intelligence, machine learning, natural language, (18 more...)

2209.11378

Country:

North America > United States (0.04)
Asia > Japan > Honshū > Kantō > Ibaraki Prefecture > Tsukuba (0.04)
Asia > Japan > Honshū > Kansai > Kyoto Prefecture > Kyoto (0.04)
Africa > Middle East > Egypt > Giza Governorate > Giza (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

Sagare, Shivprasad, Abhishek, Tushar, Singh, Bhavyajeet, Sharma, Anubhav, Gupta, Manish, Varma, Vasudeva

XF2T: Cross-lingual Fact-to-Text Generation for Low-Resource Languages

Multiple business scenarios require an automated generation of descriptive human-readable text from structured input data. Hence, fact-to-text generation systems have been developed for various downstream tasks like generating soccer reports, weather and financial reports, medical reports, person biographies, etc. Unfortunately, previous work on fact-to-text (F2T) generation has focused primarily on English mainly due to the high availability of relevant datasets. Only recently, the problem of cross-lingual fact-to-text (XF2T) was proposed for generation across multiple languages alongwith a dataset, XALIGN for eight languages. However, there has been no rigorous work on the actual XF2T generation problem. We extend XALIGN dataset with annotated data for four more languages: Punjabi, Malayalam, Assamese and Oriya. We conduct an extensive study using popular Transformer-based text generation models on our extended multi-lingual dataset, which we call XALIGNV2. Further, we investigate the performance of different text generation strategies: multiple variations of pretraining, fact-aware embeddings and structure-aware input encoding. Our extensive experiments show that a multi-lingual mT5 model which uses fact-aware embeddings with structure-aware input encoding leads to best results on average across the twelve languages. We make our code, dataset and model publicly available, and hope that this will help advance further research in this critical area.

artificial intelligence, machine learning, natural language, (21 more...)

2209.11252

Country:

Asia > India > Uttar Pradesh (0.04)
Asia > India > Gujarat (0.04)
Africa > South Africa (0.04)
(8 more...)

Genre: Research Report (0.50)

Industry:

Leisure & Entertainment > Sports (1.00)
Government (0.94)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)
Information Technology > Artificial Intelligence > Natural Language > Generation (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

SlateSep-21-2022, 09:50:00 GMT

Too Much Trust in Machine Translation Could Have Deadly Consequences

Imagine you are in a foreign country where you don't speak the language and your small child unexpectedly starts to have a fever seizure. You take them to the hospital, and the doctors use an online translator to let you know that your kid is going to be OK. But "your child is having a seizure" accidentally comes up in your mother tongue is "your child is dead." This specific example is a very real possibility, according to a 2014 study published in the British Medical Journal about the limited usefulness of AI-powered machine translation in communications between patients and doctors. Sometimes we need American-British translation, too.)

google translate, machine translation, translation, (14 more...)

Slate

Country:

North America > United States > Kansas (0.05)
North America > United States > Arizona (0.05)
Europe > Denmark > Capital Region > Copenhagen (0.05)
(2 more...)

Genre: Research Report > New Finding (0.49)

Industry:

Information Technology (0.96)
Government (0.96)
Law Enforcement & Public Safety (0.69)
Health & Medicine > Health Care Providers & Services (0.35)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Khanmohammadi, Reza, Mirshafiee, Mitra Sadat, Jouryabi, Yazdan Rezaee, Mirroshandel, Seyed Abolghasem

Prose2Poem: The Blessing of Transformers in Translating Prose to Persian Poetry

arXiv.org Artificial IntelligenceSep-21-2022

Persian Poetry has consistently expressed its philosophy, wisdom, speech, and rationale on the basis of its couplets, making it an enigmatic language on its own to both native and non-native speakers. Nevertheless, the notice able gap between Persian prose and poem has left the two pieces of literature medium-less. Having curated a parallel corpus of prose and their equivalent poems, we introduce a novel Neural Machine Translation (NMT) approach to translate prose to ancient Persian poetry using transformer-based Language Models in an extremely low-resource setting. More specifically, we trained a Transformer model from scratch to obtain initial translations and pretrained different variations of BERT to obtain final translations. To address the challenge of using masked language modelling under poeticness criteria, we heuristically joined the two models and generated valid poems in terms of automatic and human assessments. Final results demonstrate the eligibility and creativity of our novel heuristically aided approach among Literature professionals and non-professionals in generating novel Persian poems.

artificial intelligence, machine learning, natural language, (15 more...)

2109.14934

Country:

Asia > Middle East > Iran (0.05)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
(8 more...)

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)