AITopics

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Ghai, Bhavya, Hoque, Md Naimul, Mueller, Klaus

WordBias: An Interactive Visual Tool for Discovering Intersectional Biases Encoded in Word Embeddings

arXiv.org Artificial IntelligenceMar-5-2021

Intersectional bias is a bias caused by an overlap of multiple social factors like gender, sexuality, race, disability, religion, etc. A recent study has shown that word embedding models can be laden with biases against intersectional groups like African American females, etc. The first step towards tackling such intersectional biases is to identify them. However, discovering biases against different intersectional groups remains a challenging task. In this work, we present WordBias, an interactive visual tool designed to explore biases against intersectional groups encoded in static word embeddings. Given a pretrained static word embedding, WordBias computes the association of each word along different groups based on race, age, etc. and then visualizes them using a novel interactive interface. Using a case study, we demonstrate how WordBias can help uncover biases against intersectional groups like Black Muslim Males, Poor Females, etc. encoded in word embedding. In addition, we also evaluate our tool using qualitative feedback from expert interviews. The source code for this tool can be publicly accessed for reproducibility at github.com/bhavyaghai/WordBias.

bias score, bias type, subgroup, (10 more...)

2103.03598

Country:

North America > United States > New York > Suffolk County > Stony Brook (0.05)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Africa > Eswatini > Manzini > Manzini (0.04)

Genre:

Research Report (0.70)
Personal > Interview (0.34)

Industry:

Law Enforcement & Public Safety (0.68)
Law (0.68)
Health & Medicine (0.67)

Technology:

Information Technology > Human Computer Interaction (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Visualization (0.93)
(2 more...)

arXiv.org Artificial IntelligenceMar-4-2021

IOT: Instance-wise Layer Reordering for Transformer Structures

Zhu, Jinhua, Wu, Lijun, Xia, Yingce, Xie, Shufang, Qin, Tao, Zhou, Wengang, Li, Houqiang, Liu, Tie-Yan

With sequentially stacked self-attention, (optional) encoder-decoder attention, and feed-forward layers, Transformer achieves big success in natural language processing (NLP), and many variants have been proposed. Currently, almost all these models assume that the layer order is fixed and kept the same across data samples. We observe that different data samples actually favor different orders of the layers. Based on this observation, in this work, we break the assumption of the fixed layer order in the Transformer and introduce instance-wise layer reordering into the model structure. Our Instance-wise Ordered Transformer (IOT) can model variant functions by reordered layers, which enables each sample to select the better one to improve the model performance under the constraint of almost the same number of parameters. To achieve this, we introduce a light predictor with negligible parameter and inference cost to decide the most capable and favorable layer order for any input sequence. Experiments on 3 tasks (neural machine translation, abstractive summarization, and code generation) and 9 datasets demonstrate consistent improvements of our method. We further show that our method can also be applied to other architectures beyond Transformer. Our code is released at Github.

decoder, iot, transformer, (15 more...)

2103.03457

Country: Asia > China (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

arXiv.org Artificial IntelligenceMar-4-2021

An empirical analysis of phrase-based and neural machine translation

Ghader, Hamidreza

Two popular types of machine translation (MT) are phrase-based and neural machine translation systems. Both of these types of systems are composed of multiple complex models or layers. Each of these models and layers learns different linguistic aspects of the source language. However, for some of these models and layers, it is not clear which linguistic phenomena are learned or how this information is learned. For phrase-based MT systems, it is often clear what information is learned by each model, and the question is rather how this information is learned, especially for its phrase reordering model. For neural machine translation systems, the situation is even more complex, since for many cases it is not exactly clear what information is learned and how it is learned. To shed light on what linguistic phenomena are captured by MT systems, we analyze the behavior of important models in both phrase-based and neural MT systems. We consider phrase reordering models from phrase-based MT systems to investigate which words from inside of a phrase have the biggest impact on defining the phrase reordering behavior. Additionally, to contribute to the interpretability of neural MT systems we study the behavior of the attention model, which is a key component in neural MT systems and the closest model in functionality to phrase reordering models in phrase-based systems. The attention model together with the encoder hidden state representations form the main components to encode source side linguistic information in neural MT. To this end, we also analyze the information captured in the encoder hidden state representations of a neural MT system. We investigate the extent to which syntactic and lexical-semantic information from the source side is captured by hidden state representations of different neural MT architectures.

empirical methods, phrase-based machine translation, semantic information, (17 more...)

2103.03108

Country:

North America > United States > Texas > Travis County > Austin (0.14)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.13)
(36 more...)

Genre: Research Report > New Finding (1.00)

Industry: Government (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.67)

#artificialintelligenceMar-2-2021, 21:00:01 GMT

AbbVie Accelerates Natural Language Processing

AbbVie is a research-based biopharmaceutical company that serves more than 30 million patients in 175 countries. With its global scale, AbbVie partnered with Intel to optimize processes for its more than 47,000 employees. This whitepaper highlights two use cases that are important to AbbVie's research. The first is Abbelfish Machine Translation, AbbVie's language translation service based on the Transformer NLP model, that leverages second-generation Intel Xeon Scalable processors and the Intel Optimization for TensorFlow with Intel oneAPI Deep Neural Network Library (oneDNN). AbbVie was able to achieve a 1.9x improvement in throughput for Abbelfish language translation using Intel Optimization for TensorFlow 1.15 with oneAPI Deep Neural Network Library when compared to TensorFlow 1.15 without oneDNN.1

abbvie accelerate natural language processing, oneapi deep neural network library, tensorflow 1, (5 more...)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.86)

#artificialintelligenceMar-1-2021, 20:45:39 GMT

LTI Value Cast: Reshaping Remote Work & Meetings with Linguistic AI

The abrupt move to an almost exclusive home-based working environment at the start of the Covid19 crisis resulted in a new work ethic: back-to-back web calls from your living room or kitchen, across various web conferencing systems, and requiring to handle multilanguage interactions. The onsite in-person meetings were facilitated by the help of interpreters, traditional meeting notes redaction, and lengthy post-meeting analysis and review. In the new virtual environment, it is up to advanced language technologies powered by Artificial Intelligence to solve these issues. Speech to text, neural machine translation and hybrid natural language understanding will automate complex human tasks and replace the more repetitive processes, creating a „digital work companion" that can assist in the next fast-paced remote working environment challenges.

linguistic ai, lti value, reshaping remote work

Industry: Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (0.40)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

arXiv.org Artificial IntelligenceMar-1-2021

OmniNet: Omnidirectional Representations from Transformers

Tay, Yi, Dehghani, Mostafa, Aribandi, Vamsi, Gupta, Jai, Pham, Philip, Qin, Zhen, Bahri, Dara, Juan, Da-Cheng, Metzler, Donald

This paper proposes Omnidirectional Representations from Transformers (OmniNet). In OmniNet, instead of maintaining a strictly horizontal receptive field, each token is allowed to attend to all tokens in the entire network. This process can also be interpreted as a form of extreme or intensive attention mechanism that has the receptive field of the entire width and depth of the network. To this end, the omnidirectional attention is learned via a meta-learner, which is essentially another self-attention based model. In order to mitigate the computationally expensive costs of full receptive field attention, we leverage efficient self-attention models such as kernel-based (Choromanski et al.), low-rank attention (Wang et al.) and/or Big Bird (Zaheer et al.) as the meta-learner. Extensive experiments are conducted on autoregressive language modeling (LM1B, C4), Machine Translation, Long Range Arena (LRA), and Image Recognition. The experiments show that OmniNet achieves considerable improvements across these tasks, including achieving state-of-the-art performance on LM1B, WMT'14 En-De/En-Fr, and Long Range Arena. Moreover, using omnidirectional representation in Vision Transformers leads to significant improvements on image recognition tasks on both few-shot learning and fine-tuning setups.

omnidirectional representation, omninet, representation, (13 more...)

2103.01075

Country:

North America > United States > California > Santa Clara County > Stanford (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)
Europe > Belgium > Brussels-Capital Region > Brussels (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.55)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.50)

#artificialintelligenceFeb-26-2021, 13:25:09 GMT

AI Incident Database Spotlights Worst Machine Translation Fails

In the ongoing popular (albeit shallow) debate pitting human translators against machine translation (MT), one constant is the question of quality -- how to define it, how to measure it, and how to improve it. Now, a new website, the AI Incident Database (AIID), aims to quantify the risks presented, and actual harm caused, by AI. Sean McGregor, ML architect at Syntiant and developer of the AIID, described the "collective memory of [AI systems'] failings" in a November 2020 paper. As McGregor explained, the AIID is a project of the Partnership on AI (PAI), an organization funded by tech companies and governed by a board comprising corporate partners and non-profits. The AIID is modeled on incident databases in other industries, namely aviation and cybersecurity, which promote transparency.

incident, spotlight worst machine translation fail, translation, (7 more...)

Country: North America > Mexico (0.06)

Industry:

Information Technology (0.72)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.32)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Jiang, Nan, Lutellier, Thibaud, Tan, Lin

CURE: Code-Aware Neural Machine Translation for Automatic Program Repair

arXiv.org Artificial IntelligenceFeb-26-2021

Automatic program repair (APR) is crucial to improve software reliability. Recently, neural machine translation (NMT) techniques have been used to fix software bugs automatically. While promising, these approaches have two major limitations. Their search space often does not contain the correct fix, and their search strategy ignores software knowledge such as strict code syntax. Due to these limitations, existing NMT-based techniques underperform the best template-based approaches. We propose CURE, a new NMT-based APR technique with three major novelties. First, CURE pre-trains a programming language (PL) model on a large software codebase to learn developer-like source code before the APR task. Second, CURE designs a new code-aware search strategy that finds more correct fixes by focusing on compilable patches and patches that are close in length to the buggy code. Finally, CURE uses a subword tokenization technique to generate a smaller search space that contains more correct fixes. Our evaluation on two widely-used benchmarks shows that CURE correctly fixes 57 Defects4J bugs and 26 QuixBugs bugs, outperforming all existing APR techniques on both benchmarks.

buggy line, correct fix, sequence, (16 more...)

2103.00073

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > Canada > Ontario > Waterloo Region > Waterloo (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

#artificialintelligenceFeb-25-2021, 09:43:02 GMT

Even Small Companies Use AI, Machine Learning

Data, technology, and people are at hand to make artificial intelligence and machine learning available to all commerce companies. To be certain, artificial intelligence and its sub-field, machine learning, have gone through cycles of inflated expectations followed by disappointments. For example, in the 1950s and 1960s, the United States government funded research for the machine translation of languages. The hope was that Russian-language documents could be instantly translated to English. But by 1966, a report from the Automatic Language Processing Advisory Committee, a government team of seven scientists, essentially killed machine translation research in the U.S. for about a decade.

ai-ml, data scientist, deverter, (9 more...)

Country: North America > United States (1.00)

Industry: Government > Regional Government > North America Government > United States Government (0.77)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.73)