AITopics

2105.14462

Country:

Europe > Belgium (0.04)
Asia > China > Hong Kong (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
(5 more...)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

#artificialintelligenceMay-28-2021

The Future of Computational Linguistics: On Beyond Alchemy

Over the decades, fashions in Computational Linguistics have changed again and again, with major shifts in motivations, methods and applications. When digital computers first appeared, linguistic analysis adopted the new methods of information theory, which accorded well with the ideas that dominated psychology and philosophy. Then came formal language theory and the idea of AI as applied logic, in sync with the development of cognitive science. That was followed by a revival of 1950s-style empiricism—AI as applied statistics—which in turn was followed by the age of deep nets. There are signs that the climate is changing again, and we offer some thoughts about paths forward, especially for younger researchers who will soon be the leaders.

computational linguistic, lecun, younger researcher, (15 more...)

Country:

North America > United States > District of Columbia > Washington (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
Asia > China (0.05)
(23 more...)

Genre:

Research Report (0.93)
Personal (0.67)

Industry:

Leisure & Entertainment (1.00)
Information Technology > Security & Privacy (1.00)
Government > Military (1.00)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (0.68)

arXiv.org Artificial IntelligenceMay-28-2021

Data Augmentation for Text Generation Without Any Augmented Data

Bi, Wei, Li, Huayang, Huang, Jiacheng

Data augmentation is an effective way to improve the performance of many neural text generation models. However, current data augmentation methods need to define or choose proper data mapping functions that map the original samples into the augmented samples. In this work, we derive an objective to formulate the problem of data augmentation on text generation tasks without any use of augmented data constructed by specific mapping functions. Our proposed objective can be efficiently optimized and applied to popular loss functions on text generation tasks with a convergence rate guarantee. Experiments on five datasets of two text generation tasks show that our approach can approximate or even surpass popular data augmentation methods.

data augmentation, mapping function, proceedings, (14 more...)

2105.1365

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

arXiv.org Artificial IntelligenceMay-27-2021

Selective Knowledge Distillation for Neural Machine Translation

Wang, Fusheng, Yan, Jianhao, Meng, Fandong, Zhou, Jie

Neural Machine Translation (NMT) models achieve state-of-the-art performance on many translation benchmarks. As an active research field in NMT, knowledge distillation is widely applied to enhance the model's performance by transferring teacher model's knowledge on each training sample. However, previous work rarely discusses the different impacts and connections among these samples, which serve as the medium for transferring teacher knowledge. In this paper, we design a novel protocol that can effectively analyze the different impacts of samples by comparing various samples' partitions. Based on above protocol, we conduct extensive experiments and find that the teacher's knowledge is not the more, the better. Knowledge over specific samples may even hurt the whole performance of knowledge distillation. Finally, to address these issues, we propose two simple yet effective strategies, i.e., batch-level and global-level selections, to pick suitable samples for distillation. We evaluate our approaches on two large-scale machine translation tasks, WMT'14 English->German and WMT'19 Chinese->English. Experimental results show that our approaches yield up to +1.28 and +0.89 BLEU points improvements over the Transformer baseline, respectively.

distillation, knowledge, knowledge distillation, (15 more...)

2105.12967

Country:

Asia > China (0.04)
Europe > Italy > Tuscany > Florence (0.04)
Europe > Germany > Berlin (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Education (0.71)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

arXiv.org Artificial IntelligenceMay-24-2021

Project CodeNet: A Large-Scale AI for Code Dataset for Learning a Diversity of Coding Tasks

Puri, Ruchir, Kung, David S., Janssen, Geert, Zhang, Wei, Domeniconi, Giacomo, Zolotov, Vladmir, Dolby, Julian, Chen, Jie, Choudhury, Mihir, Decker, Lindsey, Thost, Veronika, Buratti, Luca, Pujar, Saurabh, Finkler, Ulrich

Advancements in deep learning and machine learning algorithms have enabled breakthrough progress in computer vision, speech recognition, natural language processing and beyond. In addition, over the last several decades, software has been built into the fabric of every aspect of our society. Together, these two trends have generated new interest in the fast-emerging research area of AI for Code. As software development becomes ubiquitous across all industries and code infrastructure of enterprise legacy applications ages, it is more critical than ever to increase software development productivity and modernize legacy applications. Over the last decade, datasets like ImageNet, with its large scale and diversity, have played a pivotal role in algorithmic advancements from computer vision to language and speech understanding. In this paper, we present Project CodeNet, a first-of-its-kind, very large scale, diverse, and high-quality dataset to accelerate the algorithmic advancements in AI for Code. It consists of 14M code samples and about 500M lines of code in 55 different programming languages. Project CodeNet is not only unique in its scale, but also in the diversity of coding tasks it can help benchmark: from code similarity and classification for advances in code recommendation algorithms, and code translation between a large variety programming languages, to advances in code performance (both runtime, and memory) improvement techniques. CodeNet also provides sample input and output test sets for over 7M code samples, which can be critical for determining code equivalence in different languages. As a usability feature, we provide several preprocessing tools in Project CodeNet to transform source codes into representations that can be readily used as inputs into machine learning models.

code sample, dataset, submission, (15 more...)

2105.12655

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > California > Orange County > Irvine (0.04)

Genre: Research Report (0.64)

Industry: Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

#artificialintelligenceMay-22-2021, 15:25:07 GMT

How AI Is Accelerating Business Growth and Innovation

Despite the many ominous connotations trumpeted in works of fiction, the adoption and growth of AI can is simply another phase of the technological advance that has marked the development of human society. Yet, because we associate intelligence with living creatures, particularly our own species, the idea of machines that possess that faculty excites some trepidation. AI agents may turn out to be as unpredictable and perverse as any intelligent human. No such worry is evident in Silicon Valley. Sundar Pichai, Google's chief, speaking at the World Economic Forum in Davos, Switzerland, enthused about the technology: "AI is probably the most important thing humanity has ever worked on. I think of it as something more profound than electricity or fire," he said. Google is a major participant in an AI market that is clipping along at a five-year compound annual growth rate (CAGR) of 17.5%. Globally, the industry is projected to swell to $554.3 billion by 2024. Other players of note are IBM, Intuit, Microsoft, OpenText, Palantir, SAS, and Slack.

intelligence, learning, occupation, (13 more...)

Country:

North America > United States > California (0.24)
Europe > Switzerland (0.24)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.04)

Genre: Personal (0.68)

Industry:

Information Technology (1.00)
Education > Educational Setting > Online (1.00)
Banking & Finance (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.97)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.70)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.69)

Yin, Kayo, Fernandes, Patrick, Pruthi, Danish, Chaudhary, Aditi, Martins, André F. T., Neubig, Graham

Do Context-Aware Translation Models Pay the Right Attention?

arXiv.org Artificial IntelligenceMay-21-2021

Context-aware machine translation models are designed to leverage contextual information, but often fail to do so. As a result, they inaccurately disambiguate pronouns and polysemous words that require context for resolution. In this paper, we ask several questions: What contexts do human translators use to resolve ambiguous words? Are models paying large amounts of attention to the same context? What if we explicitly train them to do so? To answer these questions, we introduce SCAT (Supporting Context for Ambiguous Translations), a new English-French dataset comprising supporting context words for 14K translations that professional translators found useful for pronoun disambiguation. Using SCAT, we perform an in-depth analysis of the context used to disambiguate, examining positional and lexical characteristics of the supporting words. Furthermore, we measure the degree of alignment between the model's attention scores and the supporting context from SCAT, and apply a guided attention strategy to encourage agreement between the two.

computational linguistic, proceedings, translation, (14 more...)

2105.06977

Country:

Europe > Portugal > Lisbon > Lisbon (0.14)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
(17 more...)

Genre:

Research Report > Experimental Study (0.68)
Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

#artificialintelligenceMay-19-2021, 19:10:17 GMT

Neural Machine Translation using a Seq2Seq Architecture and Attention (ENG to POR)

Neural Machine Translation (NMT) is an end-to-end learning approach for automated translation [1]. Its strength comes from the fact that it learns the mapping directly from input text to associated output text. It has been proven to be more effective than traditional phrase-based machine translation, which requires much more effort to design the model. On the other hand, NMT models are costly to train, especially on large-scale translation datasets. They are also significantly slower at inference time due to the large number of parameters used.

cell state, rnn, translation, (12 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

#artificialintelligenceMay-16-2021, 18:15:57 GMT

Can Artificial Intelligence Create its Own Language?

Back in 2017, the media was frenzying over Facebook's decision to scrap one of its artificial intelligence engines, which was said to have created its own language that could not be understood by humans. AI can sometimes be quirky and suspicious. In the current world technology is not a luxury, but a necessity. AI is one of the most popular of them, which has successfully aided several innovations across all industries. There have been many speculations around this disruptive technology and it has often been looked upon with fear.

artificial intelligence create, negotiation, own language, (11 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.31)

The Japan TimesMay-14-2021, 03:40:08 GMT

Fujitsu releases hands-free speech translation service

Fujitsu Ltd. on Thursday released a multilingual speech translation service that does not require users to operate devices by hand. The service is designed for settings in which multilingual communication is needed amid a rise in the domestic population of non-Japanese speakers, such as medical facilities. It automatically translates speech after identifying the voices and locations of users on the basis of sound picked up by directional microphones connected to tablet devices. Fujitsu said that the voice recognition is highly accurate thanks to technology limiting the effects of background noise. In addition to medical settings, the service is expected to be used at tourist sites.

artificial intelligence, machine translation, natural language, (6 more...)

The Japan Times

Country: Asia > Myanmar (0.08)

Industry: Law > Civil Rights & Constitutional Law (0.40)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)