Goto

Collaborating Authors

 Machine Translation


De-mystifying AI and its potential for further application in a B2B context

#artificialintelligence

AI, or Artificial Intelligence, is often demonised and portrayed as some cyborg entity just about ready to take our jobs and eventually kill us all, but more and more businesses, martech and adtech providers are using different AI subsystems each day to advance their services. The term AI is contentiously used to describe a broad spectrum of systems and software's, the controversy arises from where we can begin to describe a machine as being'intelligent' opposed to simply following complex but nonetheless human-reliant algorithms. Regardless of strict definition, there are helpful systems within the subsets of AI which already exist that B2B marketers need to utilise. Machine learning is a subset of AI that can help marketers to improve productivity by taking over mundane tasks, particularly work involving dissecting datasets (like our Argus platform for example). If you're not already using some forms of machine learning, it might be helpful to understand why some sytstems have been reported to increase the productivity of business by 40% (Source: Accenture) and how you can effectively incorporate machine learning into your marketing strategy.


Machine Learning โ€“ Introduction to Quick and Accurate Machine Translation Vinod Sharma's Blog

#artificialintelligence

This tool helps to translate one language to another with high accuracy. This post will focus high level arguments around machine translation only to you can find out more details on Machine Learning Basics here. Machine translation (MT) is an automated translation process used by a computer application to translate a natural language text into another. In the translation process, the meaning of the source text must be already stored in the destination i.e. target language. Sounds simple, but on the surface floor, it is far more complex.


Top 10 Applications of Machine Learning Daily Life Applications Edureka

#artificialintelligence

Machine Learning is a buzzword in the technology world right now and for good reason, it represents a major step forward in how computers can learn. The need for Machine Learning Engineers are high in demand and this surge is due to evolving technology and generation of huge amounts of data aka Big Data. On an Average, an ML Engineer can expect a salary of โ‚น719,646 (IND) or $111,490 (US). So, let's discuss some of the Applications of Machine Learning. I'll be discussing the following Applications of Machine Learning one by one: Now, Google Maps is probably THE app we use whenever we go out and require assistance in directions and traffic.


A Survey of Cross-lingual Word Embedding Models

Journal of Artificial Intelligence Research

Cross-lingual representations of words enable us to reason about word meaning in multilingual contexts and are a key facilitator of cross-lingual transfer when developing natural language processing models for low-resource languages. In this survey, we provide a comprehensive typology of cross-lingual word embedding models. We compare their data requirements and objective functions. The recurring theme of the survey is that many of the models presented in the literature optimize for the same objectives, and that seemingly different models are often equivalent, modulo optimization strategies, hyper-parameters, and such. We also discuss the different ways cross-lingual word embeddings are evaluated, as well as future challenges and research horizons.


Atlas: A Dataset and Benchmark for E-commerce Clothing Product Categorization

arXiv.org Machine Learning

In E-commerce, it is a common practice to organize the product catalog using product taxonomy. This enables the buyer to easily locate the item they are looking for and also to explore various items available under a category. Product taxonomy is a tree structure with 3 or more levels of depth and several leaf nodes. Product categorization is a large scale classification task that assigns a category path to a particular product. Research in this area is restricted by the unavailability of good real-world datasets and the variations in taxonomy due to the absence of a standard across the different e-commerce stores. In this paper, we introduce a high-quality product taxonomy dataset focusing on clothing products which contain 186,150 images under clothing category with 3 levels and 52 leaf nodes in the taxonomy. We explain the methodology used to collect and label this dataset. Further, we establish the benchmark by comparing image classification and Attention based Sequence models for predicting the category path. Our benchmark model reaches a micro f-score of 0.92 on the test set. The dataset, code and pre-trained models are publicly available at \url{https://github.com/vumaasha/atlas}. We invite the community to improve upon these baselines.


On the Variance of the Adaptive Learning Rate and Beyond

arXiv.org Machine Learning

The learning rate warmup heuristic achieves remarkable success in stabilizing training, accelerating convergence and improving generalization for adaptive stochastic optimization algorithms like RMSprop and Adam. Here, we study its mechanism in details. Pursuing the theory behind warmup, we identify a problem of the adaptive learning rate (i.e., it has problematically large variance in the early stage), suggest warmup works as a variance reduction technique, and provide both empirical and theoretical evidence to verify our hypothesis. We further propose RAdam, a new variant of Adam, by introducing a term to rectify the variance of the adaptive learning rate. Extensive experimental results on image classification, language modeling, and neural machine translation verify our intuition and demonstrate the effectiveness and robustness of our proposed method. All implementations are available at: https://github.com/LiyuanLucasLiu/RAdam.


Tourists to Japan are fueling a boom in personal translation devices

The Japan Times

Takehiko Fujita wouldn't be able to do his job selling eye drops and pain relievers without his pocket translator. Instead of an app, language dictionary or call-in translation service, the clerk in a Japanese drugstore uses Pocketalk, a ยฅ25,000 ($230) device made by Sourcenext Corp. that looks like an oval puck. The gadget translates phrases to and from 74 languages, helping Fujita communicate with customers from Sweden, Vietnam and other countries. Tourists are flooding into Japan, with 31 million people visiting the archipelago in 2018, triple the number six years earlier, according to the Japan National Tourism Organization. Businesses are struggling with visitors looking to shop, eat and move around -- a situation that will probably worsen during next year's Tokyo Olympics.


What Are The Risks And Benefits Of Artificial Intelligence?

#artificialintelligence

What are the risks and benefits of artificial intelligence? It's a complicated topic, but I'll try to unpack a few key points here. Let's start with a quick definition: AI is the simulation of human intelligence by machines. Example of AI systems used regularly in developed countries include Amazon's Alexa, smart replies in Gmail, Chatbots, predictive searches in Google, and recommendations. At a baseline level, AI helps improve our everyday lives by solving pain points, streamlining processes, and advancing human knowledge.


Invariance-based Adversarial Attack on Neural Machine Translation Systems

arXiv.org Machine Learning

Abstract--Recently, NLP models have been shown to be susceptible to adversarial attacks. In this paper, we explore adve rsarial attacks on neural machine translation (NMT) systems. Given a sentence in the source language, the goal of the proposed att ack is to change multiple words while ensuring that the predicte d translation remains unchanged. In order to choose the word from the source vocabulary, we propose a soft-attention bas ed technique. The experiments are conducted on two language pa irs: English-German (en-de) and English-French (en-fr) and two state-of-the-art NMT systems: BLSTM-based encoder-decod er with attention and Transformer . The proposed soft-attenti on based technique outperforms existing methods like HotFlip by a significant margin for all the conducted experiments The res ults demonstrate that state-of-the-art NMT systems are unable t o capture the semantics of the source language.


Self-Knowledge Distillation in Natural Language Processing

arXiv.org Machine Learning

Since deep learning became a key player in natural language processing (NLP), many deep learning models have been showing remarkable performances in a variety of NLP tasks, and in some cases, they are even outperforming humans. Such high performance can be explained by efficient knowledge representation of deep learning models. While many methods have been proposed to learn more efficient representation, knowledge distillation from pretrained deep networks suggest that we can use more information from the soft target probability to train other neural networks. In this paper, we propose a new knowledge distillation method self-knowledge distillation, based on the soft target probabilities of the training model itself, where multimode information is distilled from the word embedding space right below the softmax layer. Due to the time complexity, our method approximates the soft target probabilities. In experiments, we applied the proposed method to two different and fundamental NLP tasks: language model and neural machine translation. The experiment results show that our proposed method improves performance on the tasks.