AITopics

doi: 10.18653/v1/2022.autosimtrans-1.2

2206.05807

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.60)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.60)

#artificialintelligenceJun-14-2022, 07:42:45 GMT

Synthetic Data Is About To Transform Artificial Intelligence - AI Summary

So instead, AV companies developed sophisticated simulation engines to synthetically generate the requisite volume of data and efficiently expose their AI systems to the "long tail" of driving scenarios. These simulated worlds make it possible to automatically produce thousands or millions of permutations of any imaginable driving scenario--e.g., changing the locations of other cars, adding or removing pedestrians, increasing or decreasing vehicle speeds, adjusting the weather, and so on. But it didn't take long for AI entrepreneurs to recognize that the synthetic data capabilities that had been developed for the autonomous vehicle industry could be generalized and applied to a host of other computer vision applications. Founded by AI luminary Raquel Urtasun, who previously ran Uber's AV research efforts, Waabi came out of stealth last year with a star-studded team and over $80 million in funding. Dramatic recent advances in natural language processing (NLP) are opening up virtually unbounded opportunities for value creation across the economy, as previously explored in this column.

language processing, natural language processing, synthetic data, (13 more...)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.36)

#artificialintelligenceJun-7-2022, 09:05:46 GMT

A machine-learning method hallucinates its way to better text translation

As babies, we babble and imitate our way to learning languages. We don't start off reading raw text, which requires fundamental knowledge and understanding about the world, as well as the advanced ability to interpret and infer descriptions and relationships. Rather, humans begin our language journey slowly, by pointing and interacting with our environment, basing our words and perceiving their meaning through the context of the physical and social world. Eventually, we can craft full sentences to communicate complex ideas. Similarly, when humans begin learning and translating into another language, the incorporation of other sensory information, like multimedia, paired with the new and unfamiliar words, like flashcards with images, improves language acquisition and retention. Then, with enough practice, humans can accurately translate new, unseen sentences in context without the accompanying media; however, imagining a picture based on the original text helps.

source sentence, transformer, translation, (15 more...)

Country:

North America > United States > California > San Diego County > San Diego (0.05)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.77)

#artificialintelligenceJun-7-2022, 00:12:19 GMT

Petuum and Inception Institute for AI Partner for Advanced AI

Petuum, the creator of the world's first composable platform for MLOps, and the Inception Institute for Artificial Intelligence (IIAI), have agreed to partner on the development of revolutionary AI applications. Petuum has recently announced a limited release of the composable platform, which includes the AI OS, Universal Pipelines, Deployment Manager, and Experiment Manager, for select private beta partners. Through the partnership with Petuum, IIAI's enterprise AI/ML teams will operationalize and scale their applications into production. Founded in 2018, IIAI's mission is to build full-stack AI solutions and operating systems for enterprise businesses and developers. Besides being the research arm for G42, IIAI is also empowering stakeholders with AI applications and incubating new technology at the cutting edge of ML innovation.

iiai, inception institute, petuum, (13 more...)

Country:

Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.17)
Europe > Middle East (0.07)
Africa > Middle East (0.07)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.37)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.34)

Sahmoud, Thaer, Mikki, Mohammad

Spam Detection Using BERT

arXiv.org Artificial IntelligenceJun-7-2022

Abstract-Emails and SMSs are the most popular tools in today communications, and as the increase of emails and SMSs users are increase, the number of spams is also increases. Spam is any kind of unwanted, unsolicited digital communication that gets sent out in bulk, spam emails and SMSs are causing major resource wastage by unnecessarily flooding the network links. Although most spam mail originate with advertisers looking to push their products, some are much more malicious in their intent like phishing emails that aims to trick victims into giving up sensitive information like website logins or credit card information this type of cybercrime is known as phishing. To countermeasure spams, many researches and efforts are done to build spam detectors that are able to filter out messages and emails as spam or ham. In this research we build a spam detector using BERT pre-trained model that classifies emails and messages by understanding to their context, and we trained our spam detector model using multiple corpuses like SMS collection corpus, Enron corpus, SpamAssassin corpus, Ling-Spam corpus and SMS spam collection corpus, our spam detector performance was 98.62%, 97.83%, 99.13% and 99.28% respectively.

corpus, dataset, email, (12 more...)

2206.02443

Country:

North America > United States > New York > New York County > New York City (0.04)
Asia > Middle East > Palestine > Gaza Strip > Gaza Governorate > Gaza (0.04)

Genre: Research Report (0.50)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy > Spam Filtering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)

#artificialintelligenceJun-6-2022, 20:27:34 GMT

Hallucinating to better text translation

source sentence, transformer, translation, (15 more...)

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.40)
North America > United States > California > San Diego County > San Diego (0.05)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.77)

#artificialintelligenceJun-2-2022, 21:45:06 GMT

Mozilla brings free, offline translation to Firefox – TechCrunch

Mozilla has added an official translation tool to Firefox that doesn't rely on cloud processing to do its work, instead performing the machine learning-based process right on your own computer. It's a huge step forward for a popular service tied strongly to giants like Google and Microsoft. The translation tool, called Firefox Translations, can be added to your browser here. It will need to download some resources the first time it translates a language, and presumably it may download improved models if needed, but the actual translation work is done by your computer, not in a datacenter a couple hundred miles away. This is important not because a lot of people need to translate in their browsers while offline -- like screen door for a submarine, it's not really a use case that makes sense.

mozilla bring free, offline translation, translation, (10 more...)

Industry: Information Technology (0.34)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Burchell, Laurie, Birch, Alexandra, Heafield, Kenneth

Exploring Diversity in Back Translation for Low-Resource Machine Translation

arXiv.org Artificial IntelligenceJun-1-2022

Back translation is one of the most widely used methods for improving the performance of neural machine translation systems. Recent research has sought to enhance the effectiveness of this method by increasing the 'diversity' of the generated translations. We argue that the definitions and metrics used to quantify 'diversity' in previous work have been insufficient. This work puts forward a more nuanced framework for understanding diversity in training data, splitting it into lexical diversity and syntactic diversity. We present novel metrics for measuring these different aspects of diversity and carry out empirical analysis into the effect of these types of diversity on final neural machine translation model performance for low-resource English$\leftrightarrow$Turkish and mid-resource English$\leftrightarrow$Icelandic. Our findings show that generating back translation using nucleus sampling results in higher final model performance, and that this method of generation has high levels of both lexical and syntactic diversity. We also find evidence that lexical diversity is more important than syntactic for back translation performance.

artificial intelligence, computational linguistic, natural language, (15 more...)

doi: 10.18653/v1/2022.deeplo-1.8

2206.00564

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Belgium > Brussels-Capital Region > Brussels (0.05)
Europe > Italy > Tuscany > Florence (0.04)
(18 more...)

Genre: Research Report > New Finding (0.68)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

#artificialintelligenceMay-31-2022, 20:29:23 GMT

The Potential of AI-Based Machine Translation – The Coin Republic: Cryptocurrency , Bitcoin …

Its blockchain-based AI network combines the efficiency of artificial intelligence with the resolution of human experts, to create data sets that …

coin republic, cryptocurrency

Industry:

Banking & Finance > Trading (0.85)
Media > News (0.69)

Technology:

Information Technology > e-Commerce > Financial Technology (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.40)

Naeem, Sameea, Rahman, Arif ur, Haider, Syed Mujtaba, Mughal, Abdul Basit

Exploiting Transliterated Words for Finding Similarity in Inter-Language News Articles using Machine Learning

arXiv.org Artificial IntelligenceMay-29-2022

Finding similarities between two inter-language news articles is a challenging problem of Natural Language Processing (NLP). It is difficult to find similar news articles in a different language other than the native language of user, there is a need for a Machine Learning based automatic system to find the similarity between two inter-language news articles. In this article, we propose a Machine Learning model with the combination of English Urdu word transliteration which will show whether the English news article is similar to the Urdu news article or not. The existing approaches to find similarities has a major drawback when the archives contain articles of low-resourced languages like Urdu along with English news article. The existing approaches to find similarities has drawback when the archives contain low-resourced languages like Urdu along with English news articles. We used lexicon to link Urdu and English news articles. As Urdu language processing applications like machine translation, text to speech, etc are unable to handle English text at the same time so this research proposed technique to find similarities in English and Urdu news articles based on transliteration.

news article, similarity, transliteration, (15 more...)

2206.1186

Country: Asia > Pakistan > Islamabad Capital Territory > Islamabad (0.05)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.48)