Goto

Collaborating Authors

 Rule-Based Reasoning


Artificial intelligence: The growth factor for Cloud GPU market

#artificialintelligence

According to a report by IDC, worldwide spending on artificial intelligence systems is forecast to reach $35.8 billion in 2019, an increase of 44.0% over the amount spent in 2018. The report also predicts that the retail sector will lead the spending, followed by the banking sector. Artificial intelligence is well-positioned to impact various sectors like retail, healthcare, banking, finance, discrete manufacturing, transportation, etc. According to a Gartner survey, 37% of organizations have implemented AI in some way. In the early stages, AI was based on rule-based systems, in which, the AI system depended on a knowledge base of rules to deliver business value. These systems were limited by how well the rules were defined by human experts.


Coverage-based Outlier Explanation

arXiv.org Artificial Intelligence

Outlier detection is a core task in data mining with a plethora of algorithms that have enjoyed wide scale usage. Existing algorithms are primarily focused on detection, that is the identification of outliers in a given dataset. In this paper we explore the relatively under-studied problem of the outlier explanation problem. Our goal is, given a dataset that is already divided into outliers and normal instances, explain what characterizes the outliers. We explore the novel direction of a semantic explanation that a domain expert or policy maker is able to understand. We formulate this as an optimization problem to find explanations that are both interpretable and pure. Through experiments on real-world data sets, we quantitatively show that our method can efficiently generate better explanations compared with rule-based learners.


SHACL Constraints with Inference Rules

arXiv.org Artificial Intelligence

The Shapes Constraint Language (SHACL) has been recently introduced as a W3C recommendation to define constraints that can be validated against RDF graphs. Interactions of SHACL with other Semantic Web technologies, such as ontologies or reasoners, is a matter of ongoing research. In this paper we study the interaction of a subset of SHACL with inference rules expressed in datalog. On the one hand, SHACL constraints can be used to define a "schema" for graph datasets. On the other hand, inference rules can lead to the discovery of new facts that do not match the original schema. Given a set of SHACL constraints and a set of datalog rules, we present a method to detect which constraints could be violated by the application of the inference rules on some graph instance of the schema, and update the original schema, i.e, the set of SHACL constraints, in order to capture the new facts that can be inferred. We provide theoretical and experimental results of the various components of our approach.


Generating Justifications for Norm-Related Agent Decisions

arXiv.org Artificial Intelligence

W e present an approach to generating natural language justifications of decisions derived from norm-based reasoning. Assuming an agent which maximally satisfies a set of rules specified in an object-oriented temporal logic, the user can ask factual questions (about the agent's rules, actions, and the extent to which the agent violated the rules) as well as "why" questions that require the agent comparing actual behavior to counterfactual trajectories with respect to these rules. To produce natural-sounding explanations, we focus on the subproblem of producing natural language clauses from statements in a fragment of temporal logic, and then describe how to embed these clauses into explanatory sentences. W e use a human judgment evaluation on a testbed task to compare our approach to variants in terms of intelligibility, mental model and perceived trust.


Fast Dimensional Analysis for Root Cause Investigation in Large-Scale Service Environment

arXiv.org Machine Learning

Root cause analysis in a large-scale production environment is challenging due to the complexity of services running across global data centers. Due to the distributed nature of a large-scale system, the various hardware, software, and tooling logs are often maintained separately, making it difficult to review the logs jointly for detecting issues. Another challenge in reviewing the logs for identifying issues is the scale - there could easily be millions of entities, each with hundreds of features. In this paper we present a fast dimensional analysis framework that automates the root cause analysis on structured logs with improved scalability. We first explore item-sets, i.e. a group of feature values, that could identify groups of samples with sufficient support for the target failures using the Apriori algorithm and a subsequent improvement, FP-Growth. These algorithms were designed for frequent item-set mining and association rule learning over transactional databases. After applying them on structured logs, we select the item-sets that are most unique to the target failures based on lift. With the use of a large-scale real-time database, we propose pre- and post-processing techniques and parallelism to further speed up the analysis. We have successfully rolled out this approach for root cause investigation purposes in a large-scale infrastructure. We also present the setup and results from multiple production use-cases in this paper.


DRUM: End-To-End Differentiable Rule Mining On Knowledge Graphs

arXiv.org Machine Learning

In this paper, we study the problem of learning probabilistic logical rules for inductive and interpretable link prediction. Despite the importance of inductive link prediction, most previous works focused on transductive link prediction and cannot manage previously unseen entities. Moreover, they are black-box models that are not easily explainable for humans. We propose DRUM, a scalable and differentiable approach for mining first-order logical rules from knowledge graphs which resolves these problems. We motivate our method by making a connection between learning confidence scores for each rule and low-rank tensor approximation. DRUM uses bidirectional RNNs to share useful information across the tasks of learning rules for different relations. We also empirically demonstrate the efficiency of DRUM over existing rule mining methods for inductive link prediction on a variety of benchmark datasets.


Sana Benefits gets $6.3 million to disrupt 'Stone Age' healthcare insurance industry

#artificialintelligence

Sana Benefits, an Austin company that wants to disrupt the healthcare insurance industry with more efficient software, said it has raised $6.3 million in seed funding. The company joins a raft of upstarts disrupting a notoriously inefficient healthcare system. These startups include players like Oscar Health, Clover, and Bright Health -- all of which have raised hundreds of millions to billions of dollars over the last few years. Most of them have been focused on individuals and not as much on the employer plans that cover half of Americans. Sana distinguishes itself in that it's one of the few new providers focused on providers, and specifically the market for small and medium sized businesses (a thousand or fewer employees).


An Active Approach for Model Interpretation

arXiv.org Artificial Intelligence

Model interpretation, or explanation of a machine learning classifier, aims to extract generalizable knowledge from a trained classifier into a human-understandable format, for various purposes such as model assessment, debugging and trust. From a computaional viewpoint, it is formulated as approximating the target classifier using a simpler interpretable model, such as rule models like a decision set/list/tree. Often, this approximation is handled as standard supervised learning and the only difference is that the labels are provided by the target classifier instead of ground truth. This paradigm is particularly popular because there exists a variety of well-studied supervised algorithms for learning an interpretable classifier. However, we argue that this paradigm is suboptimal for it does not utilize the unique property of the model interpretation problem, that is, the ability to generate synthetic instances and query the target classifier for their labels. We call this the active-query property, suggesting that we should consider model interpretation from an active learning perspective. Following this insight, we argue that the active-query property should be employed when designing a model interpretation algorithm, and that the generation of synthetic instances should be integrated seamlessly with the algorithm that learns the model interpretation. In this paper, we demonstrate that by doing so, it is possible to achieve more faithful interpretation with simpler model complexity. As a technical contribution, we present an active algorithm Active Decision Set Induction (ADS) to learn a decision set, a set of if-else rules, for model interpretation. ADS performs a local search over the space of all decision sets. In every iteration, ADS computes confidence intervals for the value of the objective function of all local actions and utilizes active-query to determine the best one.


How AI spots fraud quicker than people - Raconteur

#artificialintelligence

Identity fraud, in which a slice of your identity ranging from new credit cards to entire bank accounts is taken over by criminals, rose by 49 per cent in 2015 on the previous year. That totalled almost 170,000 cases, according to data collected by Cifas, the financial industry's non-profit fraud advisory service. The reason for the rise is that more and more we use the internet for financial transactions, but have very few ways to verify our identity without cumbersome systems involving human interaction, which are also vulnerable to fraud. Cifas' 2015 Fraudscape report shows that 86 per cent of identity fraud happened online, with bank accounts and credit or debit cards most targeted, closely followed by loans and communications, typically mobile phone accounts. Businesses looking to tackle fraud are turning to artificial intelligence and deploying neural networks because the systems learn in a manner like the brain's own neurons to try to bust fraud Traditionally, companies dealing with such problems have acted after the fact, trying to unravel complex or opportunistic frauds by working back through audit trails.


Introduction to Natural Language Processing (NLP) - KDnuggets

#artificialintelligence

NLP is an interdisciplinary field concerned with the interactions between computers and human natural languages (e.g: English) -- speech or text. Okay, now we get it, NLP plays a major role in our daily computer interactions, let's see more business-related NLP use-cases: NLP is divided into two fields: Linguistics and Computer Science. The Linguistics side is concerned with language, it's formation, syntax, meaning, different kind of phrases (noun or verb) and whatnot. The Computer Science side is concerned with applying linguistic knowledge, by transforming it into computer programs with the help of sub-fields such as Artificial Intelligence (Machine Learning & Deep Learning). Scientific advancements in NLP can be divided into 3 categories (Rule-based systems, Classical Machine Learning models and Deep Learning models).