AITopics | Accuracy

Collaborating Authors

Accuracy

News Overviews Instructional Materials AI-Alerts Classics

WWE Extreme Rules 2017: Predictions, Match Card For 'Monday Night Raw' PPV

International Business TimesMay-30-2017, 14:47:34 GMT

WWE's next pay-per-view is set for Sunday night in Baltimore with Extreme Rules 2017. It will exclusively feature wrestlers from the "Monday Night Raw" roster. Six matches are on the Extreme Rules card, and the main event will determine the No.1 contender for Brock Lesnar's WWE Universal Championship. Below are predictions for every match at the event. Reigns is really the only superstar that has no chance in this match. The winner is expected to face Lesnar at the Great Balls of Fire PPV on July 9, and Reigns vs. Lesnar would only happen at SummerSlam or WrestleMania.

artificial intelligence, machine learning, prediction, (15 more...)

International Business Times

Country:

Oceania > Samoa (0.06)
North America > United States > New York (0.06)

Industry: Leisure & Entertainment > Sports > Martial Arts (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.62)

Add feedback

Optimization of Tree Ensembles

Mišić, Velibor V.

arXiv.org Machine LearningMay-30-2017

Tree ensemble models such as random forests and boosted trees are among the most widely used and practically successful predictive models in applied machine learning and business analytics. Although such models have been used to make predictions based on exogenous, uncontrollable independent variables, they are increasingly being used to make predictions where the independent variables are controllable and are also decision variables. In this paper, we study the problem of tree ensemble optimization: given a tree ensemble that predicts some dependent variable using controllable independent variables, how should we set these variables so as to maximize the predicted value? We formulate the problem as a mixed-integer optimization problem. We theoretically examine the strength of our formulation, provide a hierarchy of approximate formulations with bounds on approximation quality and exploit the structure of the problem to develop two large-scale solution methods, one based on Benders decomposition and one based on iteratively generating tree split constraints. We test our methodology on real data sets, including two case studies in drug design and customized pricing, and show that our methodology can efficiently solve large-scale instances to near or full optimality, and outperforms solutions obtained by heuristic approaches. In our drug design case, we show how our approach can identify compounds that efficiently trade-off predicted performance and novelty with respect to existing, known compounds. In our customized pricing case, we show how our approach can efficiently determine optimal store-level prices under a random forest model that delivers excellent predictive accuracy.

constraint, data mining, machine learning, (16 more...)

arXiv.org Machine Learning

1705.10883

Country: North America > United States > California > Los Angeles County > Los Angeles (0.28)

Genre: Research Report > New Finding (0.67)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Consumer Products & Services (0.92)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(3 more...)

Add feedback

Forward-Backward Selection with Early Dropping

Borboudakis, Giorgos, Tsamardinos, Ioannis

arXiv.org Machine LearningMay-30-2017

Forward-backward selection is one of the most basic and commonly-used feature selection algorithms available. It is also general and conceptually applicable to many different types of data. In this paper, we propose a heuristic that significantly improves its running time, while preserving predictive accuracy. The idea is to temporarily discard the variables that are conditionally independent with the outcome given the selected variable set. Depending on how those variables are reconsidered and reintroduced, this heuristic gives rise to a family of algorithms with increasingly stronger theoretical guarantees. In distributions that can be faithfully represented by Bayesian networks or maximal ancestral graphs, members of this algorithmic family are able to correctly identify the Markov blanket in the sample limit. In experiments we show that the proposed heuristic increases computational efficiency by about two orders of magnitude in high-dimensional problems, while selecting fewer variables and retaining predictive performance. Furthermore, we show that the proposed algorithm and feature selection with LASSO perform similarly when restricted to select the same number of variables, making the proposed algorithm an attractive alternative for problems where no (efficient) algorithm for LASSO exists.

algorithm, artificial intelligence, machine learning, (15 more...)

arXiv.org Machine Learning

1705.1077

Country:

Europe (0.46)
North America > United States (0.28)

Genre: Research Report > Experimental Study (0.94)

Industry: Health & Medicine (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.88)

Add feedback

Fairness in Criminal Justice Risk Assessments: The State of the Art

Berk, Richard, Heidari, Hoda, Jabbari, Shahin, Kearns, Michael, Roth, Aaron

arXiv.org Machine LearningMay-27-2017

Objectives: Discussions of fairness in criminal justice risk assessments typically lack conceptual precision. Rhetoric too often substitutes for careful analysis. In this paper, we seek to clarify the tradeoffs between different kinds of fairness and between fairness and accuracy. Methods: We draw on the existing literatures in criminology, computer science and statistics to provide an integrated examination of fairness and accuracy in criminal justice risk assessments. We also provide an empirical illustration using data from arraignments. Results: We show that there are at least six kinds of fairness, some of which are incompatible with one another and with accuracy. Conclusions: Except in trivial cases, it is impossible to maximize accuracy and fairness at the same time, and impossible simultaneously to satisfy all kinds of fairness. In practice, a major complication is different base rates across different legally protected groups. There is a need to consider challenging tradeoffs.

accuracy equality, artificial intelligence, machine learning, (16 more...)

arXiv.org Machine Learning

1703.09207

Country: North America > United States > Illinois (0.28)

Genre: Research Report (1.00)

Industry:

Law > Criminal Law (1.00)
Information Technology > Security & Privacy (1.00)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

Predicting House Sales

@machinelearnbotMay-25-2017, 19:45:14 GMT

Missingness was an important issue with this dataset. I ignored certain missing items, such as a house not being available on Zillow, under the assumption that they item got dropped for technical reasons in the web scraping process. But others I was less confident about.

artificial intelligence, logistic regression, machine learning, (17 more...)

@machinelearnbot

Industry: Banking & Finance > Real Estate (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.50)

Add feedback

Day88: Ridge Regression

#artificialintelligenceMay-24-2017, 07:14:00 GMT

I ended up thinking about ridge regression today, and how foggy my recollection is. Therefore, in today's post I'll look into ridge regression. The Jupyter Notebook for this little project is found here. The data used is the Digit Recognizer MNIST data set available on Kaggle. The data (from the train file) was split 80% for training and 20% for testing.

artificial intelligence, machine learning, ridge regression, (2 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

An experimental study of graph-based semi-supervised classification with additional node information

Lebichot, Bertrand, Saerens, Marco

arXiv.org Machine LearningMay-24-2017

The volume of data generated by internet and social networks is increasing every day, and there is a clear need for efficient ways of extracting useful information from them. As those data can take different forms, it is important to use all the available data representations for prediction. In this paper, we focus our attention on supervised classification using both regular plain, tabular, data and structural information coming from a network structure. 14 techniques are investigated and compared in this study and can be divided in three classes: the first one uses only the plain data to build a classification model, the second uses only the graph structure and the last uses both information sources. The relative performances in these three cases are investigated. Furthermore, the effect of using a graph embedding and well-known indicators in spatial statistics is also studied. Possible applications are automatic classification of web pages or other linked documents, of people in a social network or of proteins in a biological complex system, to name a few. Based on our comparison, we draw some general conclusions and advices to tackle this particular classification task: some datasets can be better explained by their graph structure (graph-driven), or by their feature set (features-driven). The most efficient methods are discussed in both cases.

artificial intelligence, dataset, machine learning, (16 more...)

arXiv.org Machine Learning

1705.08716

Country: Europe > United Kingdom > England (0.14)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine (0.92)
Information Technology > Services (0.54)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.69)
(2 more...)

Add feedback

Inclusive Flavour Tagging Algorithm

Likhomanenko, Tatiana, Derkach, Denis, Rogozhnikov, Alex

arXiv.org Machine LearningMay-24-2017

Identifying the flavour of neutral $B$ mesons production is one of the most important components needed in the study of time-dependent $CP$ violation. The harsh environment of the Large Hadron Collider makes it particularly hard to succeed in this task. We present an inclusive flavour-tagging algorithm as an upgrade of the algorithms currently used by the LHCb experiment. Specifically, a probabilistic model which efficiently combines information from reconstructed vertices and tracks using machine learning is proposed. The algorithm does not use information about underlying physics process. It reduces the dependence on the performance of lower level identification capacities and thus increases the overall performance. The proposed inclusive flavour-tagging algorithm is applicable to tag the flavour of $B$ mesons in any proton-proton experiment.

artificial intelligence, machine learning, vertex, (18 more...)

arXiv.org Machine Learning

doi: 10.1088/1742-6596/762/1/012045

1705.08707

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.37)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.32)

Add feedback

Machine Learning Basics with Naive Bayes

#artificialintelligenceMay-23-2017, 07:02:37 GMT

She shares her mistakes and what she learned. She also give us her favorite hack for making a soundproof iPad booth for recording.

artificial intelligence, machine learning basic, naive baye, (1 more...)

#artificialintelligence

Technology:

Information Technology > Communications > Mobile (0.76)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.40)

Add feedback

Exponential error rates of SDP for block models: Beyond Grothendieck's inequality

Fei, Yingjie, Chen, Yudong

arXiv.org Machine LearningMay-23-2017

In this paper we consider the cluster estimation problem under the Stochastic Block Model. We show that the semidefinite programming (SDP) formulation for this problem achieves an error rate that decays exponentially in the signal-to-noise ratio. The error bound implies weak recovery in the sparse graph regime with bounded expected degrees, as well as exact recovery in the dense regime. An immediate corollary of our results yields error bounds under the Censored Block Model. Moreover, these error bounds are robust, continuing to hold under heterogeneous edge probabilities and a form of the so-called monotone attack. Significantly, this error rate is achieved by the SDP solution itself without any further pre- or post-processing, and improves upon existing polynomially-decaying error bounds proved using the Grothendieck\textquoteright s inequality. Our analysis has two key ingredients: (i) showing that the graph has a well-behaved spectrum, even in the sparse regime, after discounting an exponentially small number of edges, and (ii) an order-statistics argument that governs the final error rate. Both arguments highlight the implicit regularization effect of the SDP formulation.

artificial intelligence, inequality, machine learning, (18 more...)

arXiv.org Machine Learning

1705.08391

Country:

North America > United States (0.46)
Europe (0.45)

Genre: Research Report (0.84)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback