AITopics

Country: Asia > China (0.05)

Industry:

Information Technology > Services (1.00)
Health & Medicine (1.00)
Transportation > Ground > Road (0.35)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.96)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.70)
(2 more...)

Zhang, Jing, Tao, Dacheng

Empowering Things with Intelligence: A Survey of the Progress, Challenges, and Opportunities in Artificial Intelligence of Things

arXiv.org Artificial IntelligenceNov-17-2020

In the Internet of Things (IoT) era, billions of sensors and devices collect and process data from the environment, transmit them to cloud centers, and receive feedback via the internet for connectivity and perception. However, transmitting massive amounts of heterogeneous data, perceiving complex environments from these data, and then making smart decisions in a timely manner are difficult. Artificial intelligence (AI), especially deep learning, is now a proven success in various areas including computer vision, speech recognition, and natural language processing. AI introduced into the IoT heralds the era of artificial intelligence of things (AIoT). This paper presents a comprehensive survey on AIoT to show how AI can empower the IoT to make it faster, smarter, greener, and safer. Specifically, we briefly present the AIoT architecture in the context of cloud computing, fog computing, and edge computing. Then, we present progress in AI research for IoT from four perspectives: perceiving, learning, reasoning, and behaving. Next, we summarize some promising applications of AIoT that are likely to profoundly reshape our world. Finally, we highlight the challenges facing AIoT and some potential research opportunities.

application, learning, proceedings, (14 more...)

2011.08612

Country:

Oceania > Australia (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > Illinois (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)

Genre:

Overview (1.00)
Research Report > Experimental Study (0.45)

Industry:

Transportation > Ground > Road (1.00)
Information Technology > Security & Privacy (1.00)
Food & Agriculture > Agriculture (1.00)
(8 more...)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications > Networks > Sensor Networks (1.00)
(12 more...)

Pasunuru, Ramakanth, Guo, Han, Bansal, Mohit

DORB: Dynamically Optimizing Multiple Rewards with Bandits

arXiv.org Artificial IntelligenceNov-15-2020

Policy gradients-based reinforcement learning has proven to be a promising approach for directly optimizing non-differentiable evaluation metrics for language generation tasks. However, optimizing for a specific metric reward leads to improvements in mostly that metric only, suggesting that the model is gaming the formulation of that metric in a particular way without often achieving real qualitative improvements. Hence, it is more beneficial to make the model optimize multiple diverse metric rewards jointly. While appealing, this is challenging because one needs to manually decide the importance and scaling weights of these metric rewards. Further, it is important to consider using a dynamic combination and curriculum of metric rewards that flexibly changes over time. Considering the above aspects, in our work, we automate the optimization of multiple metric rewards simultaneously via a multi-armed bandit approach (DORB), where at each round, the bandit chooses which metric reward to optimize next, based on expected arm gains. We use the Exp3 algorithm for bandits and formulate two approaches for bandit rewards: (1) Single Multi-reward Bandit (SM-Bandit); (2) Hierarchical Multi-reward Bandit (HM-Bandit). We empirically show the effectiveness of our approaches via various automatic metrics and human evaluation on two important NLG tasks: question generation and data-to-text generation, including on an unseen-test transfer setup. Finally, we present interpretable analyses of the learned bandit curriculum over the optimized rewards.

bandit, bansal, proceedings, (13 more...)

2011.07635

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > New York (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(2 more...)

Genre: Research Report > Promising Solution (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Data Science > Data Mining > Big Data (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.67)

#artificialintelligenceNov-14-2020, 14:31:09 GMT

Will AI and Machine Learning Be the Future of the Translation Industry?

In the year 2020, it may seem natural to receive a meaningful translation from Google Translator, when some of us can still remember the times when it required correction every time you tried to translate more than three words altogether. This is the example of changes we tend to overlook as unpretentious users, but there is a lot of hard work behind them. While processing data, the neural network doesn't just follow some algorithm but finds ways of solving the problems and, in fact, learns to solve them. And the more tasks it solves, the better it copes with them. This similarity with a principle of human brain functioning is the reason to name neural networks an artificial intelligence (AI).

translation, translation industry, translator, (3 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.50)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

#artificialintelligenceNov-13-2020, 09:10:16 GMT

The Future of Artificial Intelligence: Language, Ethics, Technology

Established at the University of Cambridge in 2001, the Centre for Research in the Arts, Social Sciences and Humanities (CRASSH) works actively with the Schools and Faculties across the University undertaking collaborations that cross faculties and disciplines in order to stimulate fresh thinking and dialogue in and beyond the humanities and social sciences and to reach out to new collaborators and new publics.

artificial intelligence, marcus tomalin, workshop, (12 more...)

Country: Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.27)

Industry:

Information Technology > Security & Privacy (0.31)
Government (0.30)

Technology:

Information Technology > Communications > Social Media (0.50)
Information Technology > Artificial Intelligence > The Future (0.41)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.41)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.30)

Farkas, Anna, Németh, Renáta

How to Measure Gender Bias in Machine Translation: Optimal Translators, Multiple Reference Points

arXiv.org Machine LearningNov-12-2020

In this paper--as a case study--we present a systematic study of gender bias in machine translation with Google Translate. We translated sentences containing names of occupations from Hungarian, a language with gender-neutral pronouns, into English. Our aim was to present a fair measure for bias by comparing the translations to an optimal non-biased translator. When assessing bias, we used the following reference points: (1) the distribution of men and women among occupations in both the source and the target language countries, as well as (2) the results of a Hungarian survey that examined if certain jobs are generally perceived as feminine or masculine. We also studied how expanding sentences with adjectives referring to occupations effect the gender of the translated pronouns. As a result, we found bias against both genders, but biased results against women are much more frequent. Translations are closer to our perception of occupations than to objective occupational statistics. Finally, occupations have a greater effect on translation than adjectives.

artificial intelligence, natural language, occupation, (17 more...)

arXiv.org Machine Learning

2011.06445

Country:

Europe > Hungary > Budapest > Budapest (0.04)
North America > United States > Washington > King County > Seattle (0.04)
North America > United States > New York > New York County > New York City (0.04)
(3 more...)

Genre:

Questionnaire & Opinion Survey (1.00)
Research Report > New Finding (0.93)

Industry:

Law (1.00)
Health & Medicine (0.68)
Government > Regional Government > North America Government > United States Government (0.68)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

AIHubNov-9-2020, 09:50:52 GMT

On learning language-invariant representations for universal machine translation

Despite the recent improvements in neural machine translation (NMT), training a large NMT model with hundreds of millions of parameters usually requires a collection of parallel corpora at a large scale, on the order of millions or even billions of aligned sentences for supervised training (Arivazhagan et al.). While it might be possible to automatically crawl the web to collect parallel sentences for high-resource language pairs, such as German-English and French-English, it is often infeasible or expensive to manually translate large amounts of sentences for low-resource language pairs, such as Nepali-English, Sinhala-English, etc. To this end, the goal of the so-called multilingual universal machine translation, a.k.a., universal machine translation (UMT), is to learn to translate between any pair of languages using a single system, given pairs of translated documents for some of these languages. The hope is that by learning a shared "semantic space" between multiple source and target languages, the model can leverage language-invariant structure from high-resource translation pairs to transfer to the translation between low-resource language pairs, or even enable zero-shot translation. Indeed, training such a single massively multilingual model has gained impressive empirical results, especially in the case of low-resource language pairs (see Figure 1).

language-invariant representation, representation, translation, (16 more...)

AIHub

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

#artificialintelligenceNov-7-2020, 11:10:16 GMT

gordicaleksa/pytorch-original-transformer

This repo contains PyTorch implementation of the original transformer paper ( Vaswani et al.). It's aimed at making it easy to start playing and learning about transformers. Important note: I'll be adding a jupyter notebook soon as well! Transformers were originally proposed by Vaswani et al. in a seminal paper called Attention Is All You Need. You probably heard of transformers one way or another. GPT-3 and BERT to name a few well known ones .

gordicaleksa pytorch-original-transformer, transformer, translation, (13 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.92)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.71)

Han, Lifeng, Jones, Gareth, Smeaton, Alan

AlphaMWE: Construction of Multilingual Parallel Corpora with MWE Annotations

arXiv.org Artificial IntelligenceNov-7-2020

In this work, we present the construction of multilingual parallel corpora with annotation of multiword expressions (MWEs). MWEs include verbal MWEs (vMWEs) defined in the PARSEME shared task that have a verb as the head of the studied terms. The annotated vMWEs are also bilingually and multilingually aligned manually. The languages covered include English, Chinese, Polish, and German. Our original English corpus is taken from the PARSEME shared task in 2018. We performed machine translation of this source corpus followed by human post editing and annotation of target MWEs. Strict quality control was applied for error limitation, i.e., each MT output sentence received first manual post editing and annotation plus second manual quality rechecking. One of our findings during corpora preparation is that accurate translation of MWEs presents challenges to MT systems. To facilitate further MT research, we present a categorisation of the error types encountered by MT systems in performing MWE related translation. To acquire a broader view of MT issues, we selected four popular state-of-the-art MT models for comparisons namely: Microsoft Bing Translator, GoogleMT, Baidu Fanyi and DeepL MT. Because of the noise removal, translation post editing and MWE annotation by human professionals, we believe our AlphaMWE dataset will be an asset for cross-lingual and multilingual research, such as MT and information extraction. Our multilingual corpora are available as open access at github.com/poethan/AlphaMWE.

corpus, mwe, translation, (16 more...)

2011.03783

Country:

Europe > Ireland > Leinster > County Dublin > Dublin (0.14)
Asia > Middle East > Palestine (0.14)
South America > Uruguay > Maldonado > Maldonado (0.04)
(18 more...)

Genre: Research Report (0.70)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Mishra, Abhijit, Chowdhury, Md Faisal Mahbub, Manohar, Sagar, Gutfreund, Dan, Sankaranarayanan, Karthik

Template Controllable keywords-to-text Generation

arXiv.org Artificial IntelligenceNov-7-2020

This paper proposes a novel neural model for the understudied task of generating text from keywords. The model takes as input a set of un-ordered keywords, and part-of-speech (POS) based template instructions. This makes it ideal for surface realization in any NLG setup. The framework is based on the encode-attend-decode paradigm, where keywords and templates are encoded first, and the decoder judiciously attends over the contexts derived from the encoded keywords and templates to generate the sentences. Training exploits weak supervision, as the model trains on a large amount of labeled data with keywords and POS based templates prepared through completely automatic means. Qualitative and quantitative performance analyses on publicly available test-data in various domains reveal our system's superiority over baselines, built using state-of-the-art neural machine translation and controllable transfer techniques. Our approach is indifferent to the order of input keywords.

computational linguistic, keyword, template, (15 more...)

2011.03722

Country:

Europe > Denmark > Capital Region > Copenhagen (0.04)
Asia > China > Hong Kong (0.04)
Europe > Italy > Tuscany > Florence (0.04)
(7 more...)

Genre: Research Report (0.50)

Industry: Media > Film (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)