AITopics

2003.02556

Country:

North America > Canada (0.28)
Europe > France (0.28)
North America > United States > New York (0.14)
(7 more...)

Genre: Research Report > New Finding (0.34)

Industry:

Information Technology (0.48)
Materials > Chemicals > Industrial Gases > Liquified Gas (0.46)
Materials > Chemicals > Commodity Chemicals > Petrochemicals > LNG (0.46)
Energy > Oil & Gas > Midstream (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.93)

Sequeira, Lucas Nunes, Moreschi, Bruno, Cozman, Fabio Gagliardi, Fontes, Bernardo

An Empirical Accuracy Law for Sequential Machine Translation: the Case of Google Translate

arXiv.org Machine LearningMar-5-2020

We have established, through empirical testing, a law that relates the number of translating hops to translation accuracy in sequential machine translation in Google Translate. Both accuracy and size decrease with the number of hops; the former displays a decrease closely following a power law. Such a law allows one to predict the behavior of translation chains that may be built as society increasingly depends on automated devices.

accuracy, machine translation, translation, (11 more...)

2003.02817

Country:

South America > Brazil > São Paulo (0.04)
South America > Brazil > Santa Catarina (0.04)
North America > Mexico (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Livne, Micha, Swersky, Kevin, Fleet, David J.

SentenceMIM: A Latent Variable Language Model

arXiv.org Machine LearningMar-5-2020

We introduce sentenceMIM, a probabilistic auto-encoder for language modelling, trained with Mutual Information Machine (MIM) learning. Previous attempts to learn variational auto-encoders for language data have had mixed success, with empirical performance well below state-of-the-art auto-regressive models, a key barrier being the occurrence of posterior collapse with VAEs. The recently proposed MIM framework encourages high mutual information between observations and latent variables, and is more robust against posterior collapse. This paper formulates a MIM model for text data, along with a corresponding learning algorithm. We demonstrate excellent perplexity (PPL) results on several datasets, and show that the framework learns a rich latent space, allowing for interpolation between sentences of different lengths with a fixed-dimensional latent representation. We also demonstrate the versatility of sentenceMIM by utilizing a trained model for question-answering, a transfer learning task, without fine-tuning. To the best of our knowledge, this is the first latent variable model (LVM) for text modelling that achieves competitive performance with non-LVM models.

eos, reconstruction, smim, (14 more...)

2003.02645

Country:

North America > Canada > Ontario > Toronto (0.14)
Europe > Spain (0.06)
South America (0.04)
(14 more...)

Genre: Research Report (0.40)

Industry:

Leisure & Entertainment > Sports (0.67)
Consumer Products & Services > Food, Beverage, Tobacco & Cannabis (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

#artificialintelligenceMar-4-2020, 10:39:36 GMT

Caspar.AI Named to the 2020 CB Insights AI 100 List of Most Innovative Artificial Intelligence Startups

Caspar.AI honored for achievements in AI Technology for Real Estate CB Insights today named Caspar.AI to the fourth annual AI 100 ranking, showcasing the 100 most promising private artificial intelligence companies in the world. Featured in CB Insights real-estate AI category, Caspar is reshaping the real estate industry to allow for smart, sustainable design that both improves the residents' living experience and reduces overall costs for property management. Real estate developers partner with Caspar to build differentiated smart properties, drive additional revenue, save costs, and enhance resident experience. "We are delighted to be awarded as the top AI company for real estate," said Dr. Ashutosh Saxena, Founder & CEO of Caspar.AI & Former Faculty in the Department of Computer Science at Cornell University. "People spend two-thirds of their time at home. There is a massive opportunity for AI to reimagine how people live in their homes. Our Caspar Sense and Caspar Adapt technology, understand the resident activities and automatically adapts home to their preferences. "It's been remarkable to see the success of the companies named to the Artificial Intelligence 100 over the last four years.

ai 100, caspar, innovative artificial intelligence startup, (8 more...)

#artificialintelligence

Country:

South America > Chile (0.06)
North America > United States > New York (0.06)
North America > United States > Nevada > Clark County > Las Vegas (0.06)
(5 more...)

Genre: Press Release (0.34)

Industry: Banking & Finance > Real Estate (1.00)

Technology: Information Technology > Artificial Intelligence (1.00)

Bardozzo, Francesco, De La Osa, Borja, Horanska, Lubomira, Fumanal-Idocin, Javier, Priscoli, Mattia delli, Troiano, Luigi, Tagliaferri, Roberto, Fernandez, Javier, Bustince, Humberto

Adaptive binarization based on fuzzy integrals

Adaptive binarization methodologies threshold the intensity of the pixels with respect to adjacent pixels exploiting the integral images. In turn, the integral images are generally computed optimally using the summed-area-table algorithm (SAT). This document presents a new adaptive binarization technique based on fuzzy integral images through an efficient design of a modified SAT for fuzzy integrals. We define this new methodology as FLAT (Fuzzy Local Adaptive Thresholding). The experimental results show that the proposed methodology have produced an image quality thresholding often better than traditional algorithms and saliency neural networks. We propose a new generalization of the Sugeno and CF 1,2 integrals to improve existing results with an efficient integral image computation. Therefore, these new generalized fuzzy integrals can be used as a tool for grayscale processing in real-time and deep-learning applications. Index Terms: Image Thresholding, Image Processing, Fuzzy Integrals, Aggregation Functions

algorithm, binarization, university, (16 more...)

2003.08755

Country:

Europe > Spain > Navarre > Pamplona (0.04)
Europe > Slovakia > Bratislava > Bratislava (0.04)
Europe > Italy (0.04)
(7 more...)

Genre: Research Report > New Finding (0.34)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (0.67)
Health & Medicine > Therapeutic Area (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.68)

Krishnan, Jitin, Purohit, Hemant, Rangwala, Huzefa

Unsupervised and Interpretable Domain Adaptation to Rapidly Filter Social Web Data for Emergency Services

During the onset of a disaster event, filtering relevant information from the social web data is challenging due to its sparse availability and practical limitations in labeling datasets of an ongoing crisis. In this paper, we show that unsupervised domain adaptation through multi-task learning can be a useful framework to leverage data from past crisis events, as well as exploit additional web resources for training efficient information filtering models during an ongoing crisis. We present a novel method to classify relevant tweets during an ongoing crisis without seeing any new examples, using the publicly available dataset of TREC incident streams that provides labeled tweets with 4 relevant classes across 10 different crisis events. Additionally, our method addresses a crucial but missing component from current research in web science for crisis data filtering models: interpretability. Specifically, we first identify a standard single-task attention-based neural network architecture and then construct a customized multi-task architecture for the crisis domain: Multi-Task Domain Adversarial Attention Network. This model consists of dedicated attention layers for each task and a domain classifier for gradient reversal. Evaluation of domain adaptation for crisis events is performed by choosing a target event as the test set and training on the rest. Our results show that the multi-task model outperformed its single-task counterpart and also, training with additional web-resources showed further performance boost. Furthermore, we show that the attention layer can be used as a guide to explain the model predictions by showcasing the words in a tweet that are deemed important in the classification process. Our research aims to pave the way towards a fully unsupervised and interpretable domain adaptation of low-resource crisis web data to aid emergency responders quickly and effectively.

adaptation, classifier, domain adaptation, (14 more...)

2003.04991

Country:

North America > Guatemala (0.05)
South America > Chile (0.05)
Oceania > Australia > Queensland (0.05)
(8 more...)

Genre:

Research Report > New Finding (0.68)
Research Report > Promising Solution (0.48)

Industry: Health & Medicine > Health Care Providers & Services (0.41)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Reduced Dilation-Erosion Perceptron for Binary Classification

Valle, Marcos Eduardo

Dilation and erosion are two elementary operations from mathematical morphology, a non-linear lattice computing methodology widely used for image processing and analysis. The dilation-erosion perceptron (DEP) is a morphological neural network obtained by a convex combination of a dilation and an erosion followed by the application of a hard-limiter function for binary classification tasks. A DEP classifier can be trained using a convex-concave procedure along with the minimization of the hinge loss function. As a lattice computing model, the DEP classifier assumes the feature and class spaces are partially ordered sets. In many practical situations, however, there is no natural ordering for the feature patterns. Using concepts from multi-valued mathematical morphology, this paper introduces the reduced dilation-erosion (r-DEP) classifier. An r-DEP classifier is obtained by endowing the feature space with an appropriate reduced ordering. Such reduced ordering can be determined using two approaches: One based on an ensemble of support vector classifiers (SVCs) with different kernels and the other based on a bagging of similar SVCs trained using different samples of the training set. Using several binary classification datasets from the OpenML repository, the ensemble and bagging r-DEP classifiers yielded in mean higher balanced accuracy scores than the linear, polynomial, and radial basis function (RBF) SVCs as well as their ensemble and a bagging of RBF SVCs.

classifier, morphological perceptron, perceptron, (15 more...)

2003.02306

Country:

North America > United States > Wisconsin (0.04)
North America > United States > New York > New York County > New York City (0.04)
South America > Brazil > São Paulo (0.04)
(6 more...)

Genre: Research Report (0.82)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.75)

Saporito, Yuri F., Zhang, Zhaoyu

PDGM: a Neural Network Approach to Solve Path-Dependent Partial Differential Equations

In this paper we propose a generalization of the Deep Galerking Method (DGM) of Sirignano and Spiliopoulos [2018] to deal with Path-Dependent Partial Differential Equations (PPDEs). These equations firstly appeared in the seminal work of Dupire [2009], where the functional Itô calculus was developed to deal with path-dependent financial derivatives contracts. The method, which we call Path-Dependent DGM (PDGM), consists of using a combination of feed-forward and Long Short-Term Memory architectures to model the solution of the PPDE. We then analyze several numerical examples, many from the Financial Mathematics literature, that show the capabilities of the method under very different situations.

algorithm, neural network, preprint, (13 more...)

2003.02035

Country:

South America > Brazil (0.04)
North America > United States > New York (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceMar-4-2020

Knowledge Graphs

Hogan, Aidan, Blomqvist, Eva, Cochez, Michael, d'Amato, Claudia, de Melo, Gerard, Gutierrez, Claudio, Gayo, José Emilio Labra, Kirrane, Sabrina, Neumaier, Sebastian, Polleres, Axel, Navigli, Roberto, Ngomo, Axel-Cyrille Ngonga, Rashid, Sabbir M., Rula, Anisa, Schmelzeisen, Lukas, Sequeda, Juan, Staab, Steffen, Zimmermann, Antoine

In this paper we provide a comprehensive introduction to knowledge graphs, which have recently garnered significant attention from both industry and academia in scenarios that require exploiting diverse, dynamic, large-scale collections of data. After a general introduction, we motivate and contrast various graph-based data models and query languages that are used for knowledge graphs. We discuss the roles of schema, identity, and context in knowledge graphs. We explain how knowledge can be represented and extracted using a combination of deductive and inductive techniques. We summarise methods for the creation, enrichment, quality assessment, refinement, and publication of knowledge graphs. We provide an overview of prominent open knowledge graphs and enterprise knowledge graphs, their applications, and how they use the aforementioned techniques. We conclude with high-level future research directions for knowledge graphs.

computational natural language learning, domesticairport internationalairport, information and knowledge management, (17 more...)

arXiv.org Artificial Intelligence

2003.0232

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.27)
Europe > Austria > Vienna (0.14)
North America > United States > New York > New York County > New York City (0.14)
(128 more...)

Genre:

Research Report (1.00)
Overview (1.00)
Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Transportation > Passenger (1.00)
Transportation > Infrastructure & Services (1.00)
Transportation > Air (1.00)
(12 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.45)

arXiv.org Artificial IntelligenceMar-4-2020

Graph Neural Networks Meet Neural-Symbolic Computing: A Survey and Perspective

Lamb, Luis, Garcez, Artur, Gori, Marco, Prates, Marcelo, Avelar, Pedro, Vardi, Moshe

Neural-symbolic computing has now become the subject of interest of both academic and industry research laboratories. Graph Neural Networks (GNN) have been widely used in relational and symbolic domains, with widespread application of GNNs in combinatorial optimization, constraint satisfaction, relational reasoning and other scientific domains. The need for improved explainability, interpretability and trust of AI systems in general demands principled methodologies, as suggested by neural-symbolic computing. In this paper, we review the state-of-the-art on the use of GNNs as a model of neural-symbolic computing. This includes the application of GNNs in several domains as well as its relationship to current developments in neural-symbolic computing.

graph, neural network, representation, (12 more...)

arXiv.org Artificial Intelligence

2003.0033

Country:

North America > United States > New York (0.04)
South America > Brazil (0.04)
North America > Canada > Quebec > Montreal (0.04)

Genre:

Overview (0.66)
Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)