Goto

Collaborating Authors

 SPE


Explanation in Human-AI Systems: A Literature Meta-Review, Synopsis of Key Ideas and Publications, and Bibliography for Explainable AI

arXiv.org Artificial Intelligence

This is an integrative review that address the question, "What makes for a good explanation?" with reference to AI systems. Pertinent literatures are vast. Thus, this review is necessarily selective. That said, most of the key concepts and issues are expressed in this Report. The Report encapsulates the history of computer science efforts to create systems that explain and instruct (intelligent tutoring systems and expert systems). The Report expresses the explainability issues and challenges in modern AI, and presents capsule views of the leading psychological theories of explanation. Certain articles stand out by virtue of their particular relevance to XAI, and their methods, results, and key points are highlighted. It is recommended that AI/XAI researchers be encouraged to include in their research reports fuller details on their empirical or experimental methods, in the fashion of experimental psychology research reports: details on Participants, Instructions, Procedures, Tasks, Dependent Variables (operational definitions of the measures and metrics), Independent Variables (conditions), and Control Conditions.


A Netflix for video games? Why a longtime dream is closer than ever to coming true.

Washington Post - Technology News

When Cory Burdette awoke recently to learn that Winter Storm Gia had caused a two-hour school delay in Reston, Va., he seized the chance to do a little family bonding. Plopping down in front of the TV, Burdette and his 5-year-old daughter spent the morning together playing Minecraft, the Lego-like adventure game where players construct buildings out of virtual blocks. "We play all our games together on the Xbox," he said. "In Minecraft, we both get to build a house together, find monsters and explore." The first time he fired up the game, Burdette had to wait for Minecraft to download and install on his Xbox before launching it.


Deep Learning for Anomaly Detection: A Survey

arXiv.org Machine Learning

Anomaly detection is an important problem that has been well-studied within diverse research areas and application domains. The aim of this survey is twofold, firstly we present a structured and comprehensive overviewof research methods in deep learning-based anomaly detection. Furthermore, we review the adoption of these methods for anomaly across various application domains and assess their effectiveness. We have grouped state-of-the-art deep anomaly detection research techniques into different categories based on the underlying assumptions and approach adopted. Within each category, we outline the basic anomaly detection technique, along with its variants and present key assumptions, to differentiate between normal and anomalous behavior. Besides, for each category, we also present the advantages and limitations and discuss the computational complexity of the techniques inreal application domains. Finally, we outline open issues in research and challenges faced while adopting deep anomaly detection techniques for real-world problems.


Combating Fake News: A Survey on Identification and Mitigation Techniques

arXiv.org Machine Learning

The proliferation of fake news on social media has opened up new directions of research for timely identification and containment of fake news, and mitigation of its widespread impact on public opinion. While much of the earlier research was focused on identification of fake news based on its contents or by exploiting users' engagements with the news on social media, there has been a rising interest in proactive intervention strategies to counter the spread of misinformation and its impact on society. In this survey, we describe the modern-day problem of fake news and, in particular, highlight the technical challenges associated with it. We discuss existing methods and techniques applicable to both identification and mitigation, with a focus on the significant advances in each method and their advantages and limitations. In addition, research has often been limited by the quality of existing datasets and their specific application contexts. To alleviate this problem, we comprehensively compile and summarize characteristic features of available datasets. Furthermore, we outline new directions of research to facilitate future development of effective and interdisciplinary solutions.


How Algorithms Are Taking Over Big Oil

#artificialintelligence

With the help of artificial intelligence, BP says it needs 40% fewer workers to keep its natural gas flowing in Wyoming. A visitor to one of BP's natural gas fields in Wyoming a few years ago might have noticed an odd sight: smartphones in plastic bags tied to pumps with zip ties. This was an early test of a multistate initiative by the oil giant to link a network of Wi-Fi sensors to an artificial intelligence system--one that now operates the Wamsutter field in Wyoming with far less human oversight than before. Artificial intelligence has come to the oil patch, accelerating a technical change that is transforming the conditions for the oil and gas industry's 150,000 U.S. workers. Giant energy companies like Shell and BP are investing billions to bring artificial intelligence to new refineries, oilfields and deepwater drilling platforms.


10 ways AI will completely change Product Management - GipsyBot

#artificialintelligence

"In the last meeting you discussed about Alex showcasing the new notifications feature today. You also wanted to see a sample report of MixPanel data. Also Martin has a lot of unanswered questions in Google Drive โ€“ you may want to clarify a few points with him. Ulysee mentioned a blocker in the sprint this week โ€“ he wants you to read this 15 minutes document before the call" Every time a new design is completed for a new feature Gipsy sends reports from automated user testing done through a third party app.


CFOF: A Concentration Free Measure for Anomaly Detection

arXiv.org Machine Learning

We present a novel notion of outlier, called the Concentration Free Outlier Factor, or CFOF. As a main contribution, we formalize the notion of concentration of outlier scores and theoretically prove that CFOF does not concentrate in the Euclidean space for any arbitrary large dimensionality. To the best of our knowledge, there are no other proposals of data analysis measures related to the Euclidean distance for which it has been provided theoretical evidence that they are immune to the concentration effect. We determine the closed form of the distribution of CFOF scores in arbitrarily large dimensionalities and show that the CFOF score of a point depends on its squared norm standard score and on the kurtosis of the data distribution, thus providing a clear and statistically founded characterization of this notion. Moreover, we leverage this closed form to provide evidence that the definition does not suffer of the hubness problem affecting other measures. We prove that the number of CFOF outliers coming from each cluster is proportional to cluster size and kurtosis, a property that we call semi-locality. We determine that semi-locality characterizes existing reverse nearest neighbor-based outlier definitions, thus clarifying the exact nature of their observed local behavior. We also formally prove that classical distance-based and density-based outliers concentrate both for bounded and unbounded sample sizes and for fixed and variable values of the neighborhood parameter. We introduce the fast-CFOF algorithm for detecting outliers in large high-dimensional dataset. The algorithm has linear cost, supports multi-resolution analysis, and is embarrassingly parallel. Experiments highlight that the technique is able to efficiently process huge datasets and to deal even with large values of the neighborhood parameter, to avoid concentration, and to obtain excellent accuracy.


Automated Rationale Generation: A Technique for Explainable AI and its Effects on Human Perceptions

arXiv.org Artificial Intelligence

Automated rationale generation is an approach for real-time explanation generation whereby a computational model learns to translate an autonomous agent's internal state and action data representations into natural language. Training on human explanation data can enable agents to learn to generate human-like explanations for their behavior. In this paper, using the context of an agent that plays Frogger, we describe (a) how to collect a corpus of explanations, (b) how to train a neural rationale generator to produce different styles of rationales, and (c) how people perceive these rationales. We conducted two user studies. The first study establishes the plausibility of each type of generated rationale and situates their user perceptions along the dimensions of confidence, humanlike-ness, adequate justification, and understandability. The second study further explores user preferences between the generated rationales with regard to confidence in the autonomous agent, communicating failure and unexpected behavior. Overall, we find alignment between the intended differences in features of the generated rationales and the perceived differences by users. Moreover, context permitting, participants preferred detailed rationales to form a stable mental model of the agent's behavior.


In 2019, We'll Have Taxis Without Drivers--or Steering Wheels

IEEE Spectrum

A coming milestone in the automobile world is the widespread rollout of Level 4 autonomy, where the car drives itself without supervision. Waymo, the company spun out of Google's self-driving car research, said it would start a commercial Level 4 taxi service by late 2018, although that hadn't happened as of press time. And GM Cruise, in San Francisco, is committed to do the same in 2019, using a Chevrolet Bolt that has neither a steering wheel nor pedals. These cars wouldn't work in all conditions and regions--that's why they're on rung 4 and not rung 5 of the autonomy ladder. But within some limited operational domain, they'll have the look and feel of a fully robotized car.


Off-Policy Evaluation of Probabilistic Identity Data in Lookalike Modeling

arXiv.org Machine Learning

We evaluate the impact of probabilistically-constructed digital identity data collected from Sep. to Dec. 2017 (approx.), in the context of Lookalike-targeted campaigns. The backbone of this study is a large set of probabilistically-constructed "identities", represented as small bags of cookies and mobile ad identifiers with associated metadata, that are likely all owned by the same underlying user. The identity data allows to generate "identity-based", rather than "identifier-based", user models, giving a fuller picture of the interests of the users underlying the identifiers. We employ off-policy techniques to evaluate the potential of identity-powered lookalike models without incurring the risk of allowing untested models to direct large amounts of ad spend or the large cost of performing A/B tests. We add to historical work on off-policy evaluation by noting a significant type of "finite-sample bias" that occurs for studies combining modestly-sized datasets and evaluation metrics involving rare events (e.g., conversions). We illustrate this bias using a simulation study that later informs the handling of inverse propensity weights in our analyses on real data. We demonstrate significant lift in identity-powered lookalikes versus an identity-ignorant baseline: on average ~70% lift in conversion rate. This rises to factors of ~(4-32)x for identifiers having little data themselves, but that can be inferred to belong to users with substantial data to aggregate across identifiers. This implies that identity-powered user modeling is especially important in the context of identifiers having very short lifespans (i.e., frequently churned cookies). Our work motivates and informs the use of probabilistically-constructed identities in marketing. It also deepens the canon of examples in which off-policy learning has been employed to evaluate the complex systems of the internet economy.