AITopics | Accuracy

Collaborating Authors

Accuracy

News Overviews Instructional Materials AI-Alerts Classics

Understanding Naïve Bayes Classifier Using R – R-posts.com

#artificialintelligenceJan-23-2018, 02:21:09 GMT

Chaitanya Sagar is the Founder and CEO of Perceptive Analytics. Perceptive Analytics has been chosen as one of the top 10 analytics companies to watch out for by Analytics India Magazine.

machine learning, perceptive analytic, ve bayes classifier, (2 more...)

#artificialintelligence

Country: Asia > India (0.44)

Industry: Information Technology > Services (0.54)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.85)

Add feedback

The best metric to measure accuracy of classification models CleverTap

#artificialintelligenceJan-22-2018, 22:11:06 GMT

As an analyst, if you are looking at a metric to measure and maximize the overall accuracy of the classification model, MCC seems to the best bet since it is not only easily interpretable but also robust to changes in the prediction goal.

accuracy, classification model, confusion matrix, (14 more...)

#artificialintelligence

Genre: Research Report (0.31)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

Offline A/B testing for Recommender Systems

Gilotte, Alexandre, Calauzènes, Clément, Nedelec, Thomas, Abraham, Alexandre, Dollé, Simon

arXiv.org Machine LearningJan-22-2018

Online A/B testing evaluates the impact of a new technology by running it in a real production environment and testing its performance on a subset of the users of the platform. It is a well-known practice to run a preliminary offline evaluation on historical data to iterate faster on new ideas, and to detect poor policies in order to avoid losing money or breaking the system. For such offline evaluations, we are interested in methods that can compute offline an estimate of the potential uplift of performance generated by a new technology. Offline performance can be measured using estimators known as counterfactual or off-policy estimators. Traditional counterfactual estimators, such as capped importance sampling or normalised importance sampling, exhibit unsatisfying bias-variance compromises when experimenting on personalized product recommendation systems. To overcome this issue, we model the bias incurred by these estimators rather than bound it in the worst case, which leads us to propose a new counterfactual estimator. We provide a benchmark of the different estimators showing their correlation with business metrics observed by running online A/B tests on a large-scale commercial recommender system.

artificial intelligence, estimator, machine learning, (17 more...)

arXiv.org Machine Learning

doi: 10.1145/3159652.3159687

1801.0703

Genre: Research Report (0.64)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.48)

Add feedback

Optimizing Prediction Intervals by Tuning Random Forest via Meta-Validation

Bayley, Sean, Falessi, Davide

arXiv.org Machine LearningJan-22-2018

Recent studies have shown that tuning prediction models increases prediction accuracy and that Random Forest can be used to construct prediction intervals. However, to our best knowledge, no study has investigated the need to, and the manner in which one can, tune Random Forest for optimizing prediction intervals { this paper aims to fill this gap. We explore a tuning approach that combines an effectively exhaustive search with a validation technique on a single Random Forest parameter. This paper investigates which, out of eight validation techniques, are beneficial for tuning, i.e., which automatically choose a Random Forest configuration constructing prediction intervals that are reliable and with a smaller width than the default configuration. Additionally, we present and validate three meta-validation techniques to determine which are beneficial, i.e., those which automatically chose a beneficial validation technique. This study uses data from our industrial partner (Keymind Inc.) and the Tukutuku Research Project, related to post-release defect prediction and Web application effort estimation, respectively. Results from our study indicate that: i) the default configuration is frequently unreliable, ii) most of the validation techniques, including previously successfully adopted ones such as 50/50 holdout and bootstrap, are counterproductive in most of the cases, and iii) the 75/25 holdout meta-validation technique is always beneficial; i.e., it avoids the likely counterproductive effects of validation techniques.

artificial intelligence, configuration, machine learning, (15 more...)

arXiv.org Machine Learning

1801.07194

Country: North America > United States (1.00)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.67)
(2 more...)

Add feedback

Fair Inference On Outcomes

Nabi, Razieh, Shpitser, Ilya

arXiv.org Machine LearningJan-21-2018

In this paper, we consider the problem of fair statistical inference involving outcome variables. Examples include classification and regression problems, and estimating treatment effects in randomized trials or observational data. The issue of fairness arises in such problems where some covariates or treatments are "sensitive," in the sense of having potential of creating discrimination. In this paper, we argue that the presence of discrimination can be formalized in a sensible way as the presence of an effect of a sensitive covariate on the outcome along certain causal pathways, a view which generalizes (Pearl, 2009). A fair outcome model can then be learned by solving a constrained optimization problem. We discuss a number of complications that arise in classical statistical inference due to this view and provide workarounds based on recent work in causal and semi-parametric inference.

artificial intelligence, data mining, machine learning, (18 more...)

arXiv.org Machine Learning

1705.10378

Country: North America > United States (1.00)

Genre: Research Report > Experimental Study (1.00)

Industry:

Law (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.67)
Health & Medicine > Therapeutic Area > Immunology (0.67)

Technology:

Information Technology > Data Science > Data Mining (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.68)
(2 more...)

Add feedback

Contingency Tables in R

@machinelearnbotJan-20-2018, 16:21:46 GMT

You can use the prop.table()

artificial intelligence, contingency table, machine learning, (16 more...)

@machinelearnbot

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.53)
Information Technology > Communications > Social Media (0.40)

Add feedback

Optimizing Marketing with AI : An application to Cross Selling

#artificialintelligenceJan-19-2018, 23:47:13 GMT

We mainly used Apache Spark in order to preprocess data and scikit-learn to train our model. A common working framework for data scientists is scikit-learn. It is a python package containing all the machine learning models a data scientist could desire. It allows us to build powerful AI models on single machines with state-of-the-art performances. We used scikit-learn to build one model for each of our 60 stores in order to select those where we could extract the most value from cross-selling.

artificial intelligence, machine learning, optimizing marketing, (4 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.66)

Add feedback

Crime Prediction Algorithms Aren't Very Good At Predicting Crimes

International Business TimesJan-18-2018, 04:12:11 GMT

Some courts in the U.S., particularly in states from California to New Jersey, use crime-predicting algorithms to determine if a defendant is likely to commit another crime in the future. While the software helps judges decide who gets bail, who goes to jail and who can walk away free, it appears the technology isn't very reliable and opens doors to a more unfair justice system. Dartmouth College researchers Julia Dressel and Hany Farid tackled the issue with the so-called risk assessment algorithms in a paper published in Science Advances. The study examined one popular risk-assessment algorithm, called Compas, and pointed out how the software's recidivism predictions are no different from the answers random people give to online surveys. Farid, who teaches computer science at Dartmouth, and Dressel, who majored in computer science and gender studies at the same school, used Amazon Mechanical Turk in the study.

artificial intelligence, defendant, machine learning, (18 more...)

International Business Times

Country:

North America > United States > New Jersey (0.25)
North America > United States > California (0.25)

Genre: Research Report > Experimental Study (0.36)

Industry:

Law > Criminal Law (0.71)
Education > Curriculum > Subject-Specific Education (0.56)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.31)

Add feedback

Robust Kronecker Component Analysis

Bahri, Mehdi, Panagakis, Yannis, Zafeiriou, Stefanos

arXiv.org Machine LearningJan-18-2018

Dictionary learning and component analysis models are fundamental in learning compact representations that are relevant to a given task (feature extraction, dimensionality reduction, denoising, etc.). The model complexity is encoded by means of specific structure, such as sparsity, low-rankness, or nonnegativity. Unfortunately, approaches like K-SVD - that learn dictionaries for sparse coding via Singular Value Decomposition (SVD) - are hard to scale to high-volume and high-dimensional visual data, and fragile in the presence of outliers. Conversely, robust component analysis methods such as the Robust Principle Component Analysis (RPCA) are able to recover low-complexity (e.g., low-rank) representations from data corrupted with noise of unknown magnitude and support, but do not provide a dictionary that respects the structure of the data (e.g., images), and also involve expensive computations. In this paper, we propose a novel Kronecker-decomposable component analysis model, coined as Robust Kronecker Component Analysis (RKCA), that combines ideas from sparse dictionary learning and robust component analysis. RKCA has several appealing properties, including robustness to gross corruption; it can be used for low-rank modeling, and leverages separability to solve significantly smaller problems. We design an efficient learning algorithm by drawing links with a restricted form of tensor factorization, and analyze its optimality and low-rankness properties. The effectiveness of the proposed approach is demonstrated on real-world applications, namely background subtraction and image denoising and completion, by performing a thorough comparison with the current state of the art.

artificial intelligence, data mining, machine learning, (18 more...)

arXiv.org Machine Learning

1801.06432

Country:

North America > United States (0.28)
Europe > United Kingdom (0.28)
Europe > Greece (0.28)

Genre: Research Report (1.00)

Industry: Education (0.68)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.93)

Add feedback

Safe Policy Improvement with Baseline Bootstrapping

Laroche, Romain, Trichelair, Paul

arXiv.org Machine LearningJan-18-2018

A common goal in Reinforcement Learning is to derive a good strategy given a limited batch of data. In this paper, we adopt the safe policy improvement (SPI) approach: we compute a target policy guaranteed to perform at least as well as a given baseline policy. Our SPI strategy, inspired by the knows-what-it-knows paradigms, consists in bootstrapping the target policy with the baseline policy when it does not know. We develop two computationally efficient bootstrapping algorithms, a value-based and a policy-based, both accompanied with theoretical SPI bounds. Three algorithm variants are proposed. We empirically show the literature algorithms limits on a small stochastic gridworld problem, and then demonstrate that our five algorithms not only improve the worst case scenarios, but also the mean performance.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

arXiv.org Machine Learning

1712.06924

Country: North America > United States > California (0.28)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.81)

Add feedback