AITopics | SPE

Collaborating Authors

SPE

Bayesian Optimization with Gradients

Wu, Jian, Poloczek, Matthias, Wilson, Andrew Gordon, Frazier, Peter I.

arXiv.org Artificial IntelligenceFeb-5-2018, 18:00:00 GMT

Bayesian optimization has been successful at global optimization of expensive-to-evaluate multimodal objective functions. However, unlike most optimization methods, Bayesian optimization typically does not use derivative information. In this paper we show how Bayesian optimization can exploit derivative information to decrease the number of objective function evaluations required for good performance. In particular, we develop a novel Bayesian optimization algorithm, the derivative-enabled knowledge-gradient (dKG), for which we show one-step Bayes-optimality, asymptotic consistency, and greater one-step value of information than is possible in the derivative-free setting. Our procedure accommodates noisy and incomplete derivative information, comes in both sequential and batch forms, and can optionally reduce the computational cost of inference through automatically selected retention of a single directional derivative. We also compute the d-KG acquisition function and its gradient using a novel fast discretization-free technique. We show d-KG provides state-of-the-art performance compared to a wide range of optimization procedures with and without gradients, on benchmarks including logistic regression, deep learning, kernel learning, and k-nearest neighbors.

artificial intelligence, machine learning, optimization, (16 more...)

arXiv.org Artificial Intelligence

1703.04389

Country: North America > United States (0.67)

Genre:

Research Report > New Finding (0.34)
Research Report > Experimental Study (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

Towards Shockingly Easy Structured Classification: A Search-based Probabilistic Online Learning Framework

Sun, Xu

arXiv.org Artificial IntelligenceFeb-4-2018, 18:00:00 GMT

There are two major approaches for structured classification. One is the probabilistic gradient-based methods such as conditional random fields (CRF), which has high accuracy but with drawbacks: slow training, and no support of search-based optimization (which is important in many cases). The other one is the search-based learning methods such as perceptrons and margin infused relaxed algorithm (MIRA), which have fast training but also with drawbacks: low accuracy, no probabilistic information, and non-convergence in real-world tasks. We propose a novel and "shockingly easy" solution, a search-based probabilistic online learning method, to address most of those issues. This method searches the output candidates, derives probabilities, and conduct efficient online learning. We show that this method is with fast training, support search-based optimization, very easy to implement, with top accuracy, with probabilities, and with theoretical guarantees of convergence. Experiments on well-known tasks show that our method has better accuracy than CRF and almost as fast training speed as perceptron and MIRA. Results also show that SAPO can easily beat the state-of-the-art systems on those highly-competitive tasks, achieving record-breaking accuracies. The codes can be found at https://github.com/lancopku

artificial intelligence, learning method, machine learning, (15 more...)

arXiv.org Artificial Intelligence

1503.08381

Country: North America > United States (0.28)

Genre: Research Report (1.00)

Industry: Education > Educational Setting > Online (1.00)

Add feedback

Employers Are Setting Workers Up for Failure

#artificialintelligenceFeb-4-2018, 12:50:15 GMT

Machines are learning how to perform routine tasks and some more complicated ones, and their progress is piquing employers' interests. The retail and health industries in particular stand the most to gain from incorporating artificial intelligence into work. Both could see about a 50 percent revenue increase, according to a new Accenture report. And if all companies invest in artificial intelligence at a rate similar to that of top-performing businesses, such as those in the S&P 500, companies could boost their revenues by 38 percent. Despite all of the talk of machines taking away jobs, the study also found that 100 percent of C-level executives who plan to use artificial intelligence intend to use that AI to enhance, not diminish, their workers' capabilities; three in four of the C-level executives surveyed said they plan to automate tasks "to a large or very large extent" in the next three years.

artificial intelligence, c-level executive, employer

#artificialintelligence

Genre: Research Report > New Finding (0.39)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Skopus: Mining top-k sequential patterns under leverage

Petitjean, Francois, Li, Tao, Tatti, Nikolaj, Webb, Geoffrey I.

arXiv.org Artificial IntelligenceFeb-3-2018, 18:00:00 GMT

This paper presents a framework for exact discovery of the top-k sequential patterns under Leverage. It combines (1) a novel definition of the expected support for a sequential pattern - a concept on which most interestingness measures directly rely - with (2) SkOPUS: a new branch-and-bound algorithm for the exact discovery of top-k sequential patterns under a given measure of interest. Our interestingness measure employs the partition approach. A pattern is interesting to the extent that it is more frequent than can be explained by assuming independence between any of the pairs of patterns from which it can be composed. The larger the support compared to the expectation under independence, the more interesting is the pattern. We build on these two elements to exactly extract the k sequential patterns with highest leverage, consistent with our definition of expected support. We conduct experiments on both synthetic data with known patterns and real-world datasets; both experiments confirm the consistency and relevance of our approach with regard to the state of the art. This article was published in Data Mining and Knowledge Discovery and is accessible at http://dx.doi.org/10.1007/s10618-016-0467-9.

data mining, machine learning, sequential pattern, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/s10618-016-0467-9

1506.08009

Country:

Asia (1.00)
Europe (0.67)
North America > United States > California (0.28)

Genre: Research Report (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Learning Structured Text Representations

Liu, Yang, Lapata, Mirella

arXiv.org Artificial IntelligenceFeb-2-2018, 18:00:00 GMT

In this paper, we focus on learning structure-aware document representations from data without recourse to a discourse parser or additional annotations. Drawing inspiration from recent efforts to empower neural networks with a structural bias (Cheng et al., 2016; Kim et al., 2017), we propose a model that can encode a document while automatically inducing rich structural dependencies. Specifically, we embed a differentiable non-projective parsing algorithm into a neural model and use attention mechanisms to incorporate the structural biases. Experimental evaluations across different tasks and datasets show that the proposed model achieves state-of-the-art results on document modeling tasks while inducing intermediate structures which are both interpretable and meaningful.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

1705.09207

Genre: Research Report (0.82)

Industry: Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.99)

Add feedback

What Machine Learning Can do For Retailers Today

#artificialintelligenceFeb-2-2018, 13:30:35 GMT

At its simplest, machine learning (ML) refers to the capacity for a program to automatically improve, or "learn," as it ingests data to accomplish a specific task or set of tasks. While ML is a subset of artificial intelligence, it's often used interchangeably with AI and conflated with predictive analytics or algorithms. The many uses and applications of machine learning create a lot of confusion about what the term really means, especially at a time when complex algorithms have been able to generate seemingly "intelligent" outcomes for quite some time. Machine learning goes a step beyond algorithms or predictive analytics, and its applications are growing exponentially in tandem with the number of new and existing companies investing in its development. Despite machine learning's advanced nature, it may come as a surprise to smaller and midmarket retailers that ML is by no means out of reach.

artificial intelligence, machine learning, retailer, (5 more...)

#artificialintelligence

Industry: Retail (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Finding Better Active Learners for Faster Literature Reviews

Yu, Zhe, Kraft, Nicholas A., Menzies, Tim

arXiv.org Artificial IntelligenceFeb-2-2018

Literature reviews can be time-consuming and tedious to complete. By cataloging and refactoring three state-of-the-art active learning techniques from evidence-based medicine and legal electronic discovery, this paper finds and implements FASTREAD, a faster technique for studying a large corpus of documents. This paper assesses FASTREAD using datasets generated from existing SE literature reviews (Hall, Wahono, Radjenovi\'c, Kitchenham et al.). Compared to manual methods, FASTREAD lets researchers find 95% relevant studies after reviewing an order of magnitude fewer papers. Compared to other state-of-the-art automatic methods, FASTREAD reviews 20-50% fewer studies while finding same number of relevant primary studies in a systematic literature review.

data mining, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

1612.03224

Country: North America > United States (0.94)

Genre:

Research Report > New Finding (1.00)
Overview (1.00)
Research Report > Promising Solution (0.68)

Industry: Law > Litigation (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Data Science > Data Mining (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.46)

Add feedback

An overview of embedding models of entities and relationships for knowledge base completion

Nguyen, Dat Quoc

arXiv.org Artificial IntelligenceFeb-1-2018, 18:00:00 GMT

Knowledge bases (KBs) of real-world facts about entities and their relationships are useful resources for a variety of natural language processing tasks. However, because knowledge bases are typically incomplete, it is useful to be able to perform knowledge base completion or link prediction, i.e., predict whether a relationship not in the knowledge base is likely to be true. This article serves as a brief overview of embedding models of entities and relationships for knowledge base completion, summarizing up-to-date experimental results on standard benchmark datasets FB15k, WN18, FB15k-237, WN18RR, FB13 and WN11.

artificial intelligence, knowledge management, natural language, (13 more...)

arXiv.org Artificial Intelligence

1703.08098

Country:

Europe (1.00)
Asia (0.93)

Genre: Research Report (0.64)

Technology:

Information Technology > Knowledge Management > Knowledge Engineering (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

A Note on Topology Preservation in Classification, and the Construction of a Universal Neuron Grid

Volz, Dietmar

arXiv.org Artificial IntelligenceFeb-1-2018

It will be shown that according to theorems of K. Menger, every neuron grid if identified with a curve is able to preserve the adopted qualitative structure of a data space. Furthermore, if this identification is made, the neuron grid structure can always be mapped to a subset of a universal neuron grid which is constructable in three space dimensions. Conclusions will be drawn for established neuron grid types as well as neural fields. Topology is one of the basic branches of mathematics. It is sometimes also referred to as qualitative geometry, in a way that it deals with the qualitative properties and structure of geometrical objects.

artificial intelligence, machine learning, neuron grid, (18 more...)

arXiv.org Artificial Intelligence

1308.1603

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Reliable Decision Support using Counterfactual Models

Schulam, Peter, Saria, Suchi

arXiv.org Artificial IntelligenceFeb-1-2018

Decision-makers are faced with the challenge of estimating what is likely to happen when they take an action. For instance, if I choose not to treat this patient, are they likely to die? Practitioners commonly use supervised learning algorithms to fit predictive models that help decision-makers reason about likely future outcomes, but we show that this approach is unreliable, and sometimes even dangerous. The key issue is that supervised learning algorithms are highly sensitive to the policy used to choose actions in the training data, which causes the model to capture relationships that do not generalize. We propose using a different learning objective that predicts counterfactuals instead of predicting outcomes under an existing action policy as in supervised learning. To support decision-making in temporal settings, we introduce the Counterfactual Gaussian Process (CGP) to predict the counterfactual future progression of continuous-time trajectories under sequences of future actions. We demonstrate the benefits of the CGP on two important decision-support tasks: risk prediction and "what if?" reasoning for individualized treatment planning.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

arXiv.org Artificial Intelligence

1703.10651

Country: North America > United States (0.46)

Genre: Research Report > New Finding (0.46)

Industry: Health & Medicine > Therapeutic Area > Nephrology (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

Add feedback