AITopics | Overview

Collaborating Authors

Overview

A review on outlier/anomaly detection in time series data

Blázquez-García, Ane, Conde, Angel, Mori, Usue, Lozano, Jose A.

arXiv.org Machine LearningFeb-11-2020

The simplified series is obtained by first applying their univariate technique to each of the variables independently; that is, each univariate batch of data is separated into variable-length subsequences, and the obtained subsequences are then clustered as explained in Section 4.1. With this process, a set of representative univariate subsequences is obtained for each variable. Each new multivariate batch of data is then represented by a vector of distances, (d 1,d 2,...,d l), where d j represents the Euclidean distance between the j th variable-length subsequence of the new batch and its corresponding representative subsequence. As with their univariate technique, the reference of normality that is considered by this method is the same time series. The technique proposed by Hu et al. [2019] is also based on reducing the dimensionality of the time series and allows us to detect variable-length discords, while using the same time series as the reference of normality. This is based on the fact that the most unusual subsequences tend to have local regions with significantly different densities (points that are similar) in comparison to the other subsequences in the series. Each point in the new univariate time series describes the density of a local region of the input multivariate time series obtained by a sliding window. This series is also used to obtain the variable-length subsequences. Discords are identified using the Euclidean and Bhattacharyya distances simultaneously.

outlier, subsequence, time sery, (15 more...)

arXiv.org Machine Learning

2002.04236

Country:

North America > United States > California > San Francisco County > San Francisco (0.28)
Oceania > Australia > Victoria > Melbourne (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
(42 more...)

Genre:

Research Report (0.64)
Overview (0.46)

Industry: Information Technology > Security & Privacy (0.67)

Technology:

Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(2 more...)

Add feedback

Deep Reinforcement Learning With TensorFlow 2.1 Roman Ring

#artificialintelligenceFeb-10-2020, 07:12:12 GMT

In this tutorial, I will give an overview of the TensorFlow 2.x features through the lens of deep reinforcement learning (DRL) by implementing an advantage actor-critic (A2C) agent, solving the classic CartPole-v0 environment. While the goal is to showcase TensorFlow 2.x, I will do my best to make DRL approachable as well, including a birds-eye overview of the field. In fact, since the main focus of the 2.x release is making life easier for the developers, it's a great time to get into DRL with TensorFlow. For example, the source code for this blog post is under 150 lines, including comments! Code is available on GitHub here and as a notebook on Google Colab here.

algorithm, eager mode, tensorflow 2, (13 more...)

#artificialintelligence

Genre: Overview (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Exploring Chemical Space using Natural Language Processing Methodologies for Drug Discovery

Öztürk, Hakime, Özgür, Arzucan, Schwaller, Philippe, Laino, Teodoro, Ozkirimli, Elif

arXiv.org Machine LearningFeb-10-2020

Biochemical methods that measure affinity and biophysical methods that describe the interaction in atomistic level detail have provided valuable information toward a mechanistic explanation for bimolecular recognition [1]. However, more often than not, compounds with drug potential are discovered serendipitously or by phenotypic drug discovery [2] since this highly specific interaction is still difficult to predict [3]. Protein structure based computational strategies such as docking [4], ultra-large library docking for discovering new chemotypes [5], and molecular dynamics simulations [4] or ligand based strategies such as quantitative structure-activity relationship (QSAR) [6, 7], and molecular similarity [8] have been powerful at narrowing down the list of compounds to be tested experimentally. With the increase in available data, machine learning and deep learning architectures are also starting to play a significant role in cheminformatics and drug discovery [9]. These approaches often require extensive computational resources or they are limited by the availability of 3D information. On the other hand, text based representations of biochemical entities are more readily available as evidenced by the 19,588 biomolecular complexes (3D structures) in PDB-Bind [10] (accessed on Nov 13, 2019) compared with 561,356 (manually annotated and reviewed) protein sequences in Uniprot [11] (accessed on Nov 13, 2019) or 97 million compounds in Pubchem [12] (accessed on Nov 13, 2019). The advances in natural language processing (NLP) methodologies make processing of text based representations of biomolecules an area of intense research interest. The discipline of natural language processing (NLP) comprises a variety of methods that explore a large amount of textual data in order to bring unstructured, latent (or hidden) knowledge to the fore [13]. Advances in this field are beneficial for tasks that use language (textual data) to build insight.

arxiv preprint arxiv, representation, sequence, (13 more...)

arXiv.org Machine Learning

doi: 10.1016/j.drudis.2020.01.020

2002.06053

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Europe > Middle East > Republic of Türkiye > Istanbul Province > Istanbul (0.04)
Asia > Middle East > Republic of Türkiye > Istanbul Province > Istanbul (0.04)
Europe > Ireland (0.04)

Genre:

Overview (0.93)
Research Report > New Finding (0.46)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.92)

Add feedback

Deep Learning for Financial Applications : A Survey

Ozbayoglu, Ahmet Murat, Gudelek, Mehmet Ugur, Sezer, Omer Berat

arXiv.org Machine LearningFeb-9-2020

Computational intelligence in finance has been a very popular topic for both academia and financial industry in the last few decades. Numerous studies have been published resulting in various models. Meanwhile, within the Machine Learning (ML) field, Deep Learning (DL) started getting a lot of attention recently, mostly due to its outperformance over the classical models. Lots of different implementations of DL exist today, and the broad interest is continuing. Finance is one particular area where DL models started getting traction, however, the playfield is wide open, a lot of research opportunities still exist. In this paper, we tried to provide a state-of-the-art snapshot of the developed DL models for financial applications, as of today. We not only categorized the works according to their intended subfield in finance but also analyzed them based on their DL models. In addition, we also aimed at identifying possible future implementations and highlighted the pathway for the ongoing research within the field.

application, implementation, prediction, (14 more...)

arXiv.org Machine Learning

2002.05786

Country:

Asia > Taiwan (0.04)
Europe > Middle East > Republic of Türkiye > Istanbul Province > Istanbul (0.04)
Asia > Singapore (0.04)
(19 more...)

Genre:

Overview (1.00)
Research Report > Promising Solution (0.93)
Research Report > New Finding (0.67)

Industry:

Information Technology > Software (1.00)
Information Technology > Security & Privacy (1.00)
Banking & Finance > Trading (1.00)
(6 more...)

Technology:

Information Technology > e-Commerce > Financial Technology (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

Graph Neural Distance Metric Learning with Graph-Bert

Zhang, Jiawei

arXiv.org Machine LearningFeb-9-2020

Graph distance metric learning serves as the foundation for many graph learning problems, e.g., graph clustering, graph classification and graph matching. Existing research works on graph distance metric (or graph kernels) learning fail to maintain the basic properties of such metrics, e.g., non-negative, identity of indiscernibles, symmetry and triangle inequality, respectively. In this paper, we will introduce a new graph neural network based distance metric learning approaches, namely GB-DISTANCE (GRAPH-BERT based Neural Distance). Solely based on the attention mechanism, GB-DISTANCE can learn graph instance representations effectively based on a pre-trained GRAPH-BERT model. Different from the existing supervised/unsupervised metrics, GB-DISTANCE can be learned effectively in a semi-supervised manner. In addition, GB-DISTANCE can also maintain the distance metric basic properties mentioned above. Extensive experiments have been done on several benchmark graph datasets, and the results demonstrate that GB-DISTANCE can out-perform the existing baseline methods, especially the recent graph neural network model based graph metrics, with a significant gap in computing the graph distance.

gb-distance, neural network, representation, (9 more...)

arXiv.org Machine Learning

2002.03427

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Michigan (0.04)
North America > United States > Florida > Leon County > Tallahassee (0.04)
(2 more...)

Genre:

Research Report (0.71)
Overview (0.46)

Industry:

Health & Medicine > Therapeutic Area (0.48)
Education (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Segmented Graph-Bert for Graph Instance Modeling

Zhang, Jiawei

arXiv.org Machine LearningFeb-8-2020

In graph instance representation learning, both the diverse graph instance sizes and the graph node orderless property have been the major obstacles that render existing representation learning models fail to work. In this paper, we will examine the effectiveness of GRAPH-BERT on graph instance representation learning, which was designed for node representation learning tasks originally. To adapt GRAPH-BERT to the new problem settings, we re-design it with a segmented architecture instead, which is also named as SEG-BERT (Segmented GRAPH-BERT) for reference simplicity in this paper. SEG-BERT involves no node-order-variant inputs or functional components anymore, and it can handle the graph node orderless property naturally. What's more, SEG-BERT has a segmented architecture and introduces three different strategies to unify the graph instance sizes, i.e., full-input, padding/pruning and segment shifting, respectively. SEG-BERT is pre-trainable in an unsupervised manner, which can be further transferred to new tasks directly or with necessary fine-tuning. We have tested the effectiveness of SEG-BERT with experiments on seven graph instance benchmark datasets, and SEG-BERT can out-perform the comparison methods on six out of them with significant performance advantages.

node, representation, seg-bert, (16 more...)

arXiv.org Machine Learning

2002.03283

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Florida > Leon County > Tallahassee (0.04)
Europe > Hungary > Budapest > Budapest (0.04)
Asia > Middle East > Qatar > Ad-Dawhah > Doha (0.04)

Genre:

Research Report (0.51)
Overview (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.93)

Add feedback

A Constraint Driven Solution Model for Discrete Domains with a Case Study of Exam Timetabling Problems

Sharma, Anuraganand

arXiv.org Artificial IntelligenceFeb-8-2020

Many science and engineering applications require finding solutions to planning and optimization problems by satisfying a set of constraints. These constraint problems (CPs) are typically NP-complete and can be formalized as constraint satisfaction problems (CSPs) or constraint optimization problems (COPs). Evolutionary algorithms (EAs) are good solvers for optimization problems ubiquitous in various problem domains, however traditional operators for EAs are 'blind' to constraints or generally use problem dependent objective functions; as they do not exploit information from the constraints in search for solutions. A variation of EA, Intelligent constraint handling evolutionary algorithm (ICHEA), has been demonstrated to be a versatile constraints-guided EA for continuous constrained problems in our earlier works in (Sharma and Sharma, 2012) where it extracts information from constraints and exploits it in the evolutionary search to make the search more efficient. In this paper ICHEA has been demonstrated to solve benchmark exam timetabling problems, a classic COP. The presented approach demonstrates competitive results with other state-of-the-art approaches in EAs in terms of quality of solutions. ICHEA first uses its inter-marriage crossover operator to satisfy all the given constraints incrementally and then uses combination of traditional and enhanced operators to optimize the solution. Generally CPs solved by EAs are problem dependent penalty based fitness functions. We also proposed a generic preference based solution model that does not require a problem dependent fitness function, however currently it only works for mutually exclusive constraints.

algorithm, constraint, feasible solution, (16 more...)

arXiv.org Artificial Intelligence

2002.03102

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
Oceania > Australia > Australian Capital Territory > Canberra (0.04)
(7 more...)

Genre:

Overview (1.00)
Research Report > Promising Solution (0.48)

Industry: Health & Medicine (0.31)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)

Add feedback

Causally Correct Partial Models for Reinforcement Learning

Rezende, Danilo J., Danihelka, Ivo, Papamakarios, George, Ke, Nan Rosemary, Jiang, Ray, Weber, Theophane, Gregor, Karol, Merzic, Hamza, Viola, Fabio, Wang, Jane, Mitrovic, Jovana, Besse, Frederic, Antonoglou, Ioannis, Buesing, Lars

arXiv.org Artificial IntelligenceFeb-7-2020

In reinforcement learning, we can learn a model of future observations and rewards, and use it to plan the agent's next actions. However, jointly modeling future observations can be computationally expensive or even intractable if the observations are high-dimensional (e.g. images). For this reason, previous works have considered partial models, which model only part of the observation. In this paper, we show that partial models can be causally incorrect: they are confounded by the observations they don't model, and can therefore lead to incorrect planning. To address this, we introduce a general family of partial models that are provably causally correct, yet remain fast because they do not need to fully model future observations.

agent, behavior policy, partial model, (14 more...)

arXiv.org Artificial Intelligence

2002.02836

Country: North America > Canada > Quebec > Montreal (0.04)

Genre:

Research Report (0.64)
Overview (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Five Emerging Trends In Retail Technology

#artificialintelligenceFeb-6-2020, 04:43:50 GMT

Consumers want shopping experiences that feel personal to them, whether they are shopping online or in-store. Retailers are in a race to use technology to match consumer expectations and win more business. With new advancements in technology, retail will continue its state of constant disruption. Most consumers dread having to call a customer service phone number. Instead, 55% of Americans -- and 65% of millennials -- want chatbots involved in the customer service process, especially when it makes the customer service process more efficient.

consumer, customer, retailer, (16 more...)

#artificialintelligence

Genre: Overview (0.40)

Industry: Retail (1.00)

Technology:

Information Technology > e-Commerce (0.97)
Information Technology > Artificial Intelligence > Natural Language (0.72)

Add feedback

From Data to Actions in Intelligent Transportation Systems: a Prescription of Functional Requirements for Model Actionability

Lana, Ibai, Sanchez-Medina, Javier J., Vlahogianni, Eleni I., Del Ser, Javier

arXiv.org Artificial IntelligenceFeb-6-2020

Advances in Data Science are lately permeating every field of Transportation Science and Engineering, making it straightforward to imagine that developments in the transportation sector will be data-driven. Nowadays, Intelligent Transportation Systems (ITS) could be arguably approached as a "story" intensively producing and consuming large amounts of data. A diversity of sensing devices densely spread over the infrastructure, vehicles or the travelers' personal devices act as sources of data flows that are eventually fed to software running on automatic devices, actuators or control systems producing, in turn, complex information flows between users, traffic managers, data analysts, traffic modeling scientists, etc. These information flows provide enormous opportunities to improve model development and decision-making. The present work aims to describe how data, coming from diverse ITS sources, can be used to learn and adapt data-driven models for efficiently operating ITS assets, systems and processes; in other words, for data-based models to fully become actionable. Grounded on this described data modeling pipeline for ITS, we define the characteristics, engineering requisites and challenges intrinsic to its three compounding stages, namely, data fusion, adaptive learning and model evaluation. We deliberately generalize model learning to be adaptive, since, in the core of our paper is the firm conviction that most learners will have to adapt to the everchanging phenomenon scenario underlying the majority of ITS applications. Finally, we provide a prospect of current research lines within the Data Science realm that can bring notable advances to data-based ITS modeling, which will eventually bridge the gap towards the practicality and actionability of such models.

application, data-based model, requirement, (13 more...)

arXiv.org Artificial Intelligence

2002.0221

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > Ohio (0.04)
North America > United States > New York (0.04)
(10 more...)

Genre:

Overview (1.00)
Research Report (0.81)

Industry:

Transportation > Infrastructure & Services (1.00)
Transportation > Ground > Road (1.00)
Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Data Science > Data Mining > Big Data (1.00)
Information Technology > Data Science > Data Integration (1.00)
(11 more...)

Add feedback