AITopics | Accuracy

Collaborating Authors

Accuracy

News Overviews Instructional Materials AI-Alerts Classics

Equitability Analysis of the Maximal Information Coefficient, with Comparisons

Reshef, David, Reshef, Yakir, Mitzenmacher, Michael, Sabeti, Pardis

arXiv.org Machine LearningAug-14-2013

A measure of dependence is said to be equitable if it gives similar scores to equally noisy relationships of different types. Equitability is important in data exploration when the goal is to identify a relatively small set of strongest associations within a dataset as opposed to finding as many non-zero associations as possible, which often are too many to sift through. Thus an equitable statistic, such as the maximal information coefficient (MIC), can be useful for analyzing high-dimensional data sets. Here, we explore both equitability and the properties of MIC, and discuss several aspects of the theory and practice of MIC. We begin by presenting an intuition behind the equitability of MIC through the exploration of the maximization and normalization steps in its definition. We then examine the speed and optimality of the approximation algorithm used to compute MIC, and suggest some directions for improving both. Finally, we demonstrate in a range of noise models and sample sizes that MIC is more equitable than natural alternatives, such as mutual information estimation and distance correlation.

artificial intelligence, equitability, machine learning, (18 more...)

arXiv.org Machine Learning

1301.6314

Country: North America > United States (0.28)

Genre: Research Report (0.82)

Industry:

Health & Medicine > Therapeutic Area (0.46)
Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology:

Information Technology > Data Science (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.47)

Add feedback

A better Beta for the H measure of classification performance

Hand, David J., Anagnostopoulos, Christoforos

arXiv.org Machine LearningAug-1-2013

Department of Mathematics, South Kensington Campus, Imperial College London, London SW7 2AZ Abstract The area under the ROC curve is widely used as a measure of performance of classification rules. However, it has recently been shown that the measure is fundamentally incoherent, in the sense that it treats the relative severities of misclassifications differently when different classifiers are used. To overcome this, [5, 6] proposed the H measure, which allows a given researcher to fix the distribution of relative severities to a classifier-independent setting on a given problem. Keywords: supervised classification, classifier performance, AUC, ROC curve, H measure 1. Introduction The aim of supervised classification is to construct a rule which will allow one to assign objects to one of M classes, on the basis of vectors of descriptive features of those objects. The rule will be constructed using a'training' set (machine learning and pattern recognition terminology) or'design' set (statistics terminology) of data which includes both descriptive vectors and true classes for a sample of objects.

class 0, class 1, classifier, (16 more...)

arXiv.org Machine Learning

1202.2564

Country:

North America > United States > North Carolina > Wake County > Cary (0.04)
North America > United States > New York (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Genre: Research Report (0.64)

Industry: Health & Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

Accelerated Time-of-Flight Mass Spectrometry

Ibrahimi, Morteza, Montanari, Andrea, Moore, George S

arXiv.org Machine LearningJul-28-2013

We study a simple modification to the conventional time of flight mass spectrometry (TOFMS) where a \emph{variable} and (pseudo)-\emph{random} pulsing rate is used which allows for traces from different pulses to overlap. This modification requires little alteration to the currently employed hardware. However, it requires a reconstruction method to recover the spectrum from highly aliased traces. We propose and demonstrate an efficient algorithm that can process massive TOFMS data using computational resources that can be considered modest with today's standards. This approach can be used to improve duty cycle, speed, and mass resolving power of TOFMS at the same time. We expect this to extend the applicability of TOFMS to new domains.

artificial intelligence, machine learning, spectrum, (16 more...)

arXiv.org Machine Learning

1212.4269

Country: North America > United States (0.68)

Genre: Research Report (0.50)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.73)

Add feedback

Does generalization performance of $l^q$ regularization learning depend on $q$? A negative example

Lin, Shaobo, Xu, Chen, Zeng, Jingshan, Fang, Jian

arXiv.org Machine LearningJul-24-2013

$l^q$-regularization has been demonstrated to be an attractive technique in machine learning and statistical modeling. It attempts to improve the generalization (prediction) capability of a machine (model) through appropriately shrinking its coefficients. The shape of a $l^q$ estimator differs in varying choices of the regularization order $q$. In particular, $l^1$ leads to the LASSO estimate, while $l^{2}$ corresponds to the smooth ridge regression. This makes the order $q$ a potential tuning parameter in applications. To facilitate the use of $l^{q}$-regularization, we intend to seek for a modeling strategy where an elaborative selection on $q$ is avoidable. In this spirit, we place our investigation within a general framework of $l^{q}$-regularized kernel learning under a sample dependent hypothesis space (SDHS). For a designated class of kernel functions, we show that all $l^{q}$ estimators for $0< q < \infty$ attain similar generalization error bounds. These estimated bounds are almost optimal in the sense that up to a logarithmic factor, the upper and lower bounds are asymptotically identical. This finding tentatively reveals that, in some modeling contexts, the choice of $q$ might not have a strong impact in terms of the generalization capability. From this perspective, $q$ can be arbitrarily specified, or specified merely by other no generalization criteria like smoothness, computational complexity, sparsity, etc..

artificial intelligence, generalization capability, machine learning, (17 more...)

arXiv.org Machine Learning

1307.6616

Country: North America > United States (0.46)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.64)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.34)

Add feedback

When is the majority-vote classifier beneficial?

Zhu, Mu

arXiv.org Machine LearningJul-24-2013

In his seminal work, Schapire (1990) proved that weak classifiers could be improved to achieve arbitrarily high accuracy, but he never implied that a simple majority-vote mechanism could always do the trick. By comparing the asymptotic misclassification error of the majority-vote classifier with the average individual error, we discover an interesting phase-transition phenomenon. For binary classification with equal prior probabilities, our result implies that, for the majority-vote mechanism to work, the collection of weak classifiers must meet the minimum requirement of having an average true positive rate of at least 50% and an average false positive rate of at most 50%.

artificial intelligence, classifier, machine learning, (15 more...)

arXiv.org Machine Learning

1307.6522

Country: North America > United States (0.47)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

A Massively Parallel Associative Memory Based on Sparse Neural Networks

Yao, Zhe, Gripon, Vincent, Rabbat, Michael G.

arXiv.org Artificial IntelligenceJul-21-2013

Associative memories store content in such a way that the content can be later retrieved by presenting the memory with a small portion of the content, rather than presenting the memory with an address as in more traditional memories. Associative memories are used as building blocks for algorithms within database engines, anomaly detection systems, compression algorithms, and face recognition systems. A classical example of an associative memory is the Hopfield neural network. Recently, Gripon and Berrou have introduced an alternative construction which builds on ideas from the theory of error correcting codes and which greatly outperforms the Hopfield network in capacity, diversity, and efficiency. In this paper we implement a variation of the Gripon-Berrou associative memory on a general purpose graphical processing unit (GPU). The work of Gripon and Berrou proposes two retrieval rules, sum-of-sum and sum-of-max. The sum-of-sum rule uses only matrix-vector multiplication and is easily implemented on the GPU. The sum-of-max rule is much less straightforward to implement because it involves non-linear operations. However, the sum-of-max rule gives significantly better retrieval error rates. We propose a hybrid rule tailored for implementation on a GPU which achieves a 880-fold speedup without sacrificing any accuracy.

artificial intelligence, machine learning, neuron, (17 more...)

arXiv.org Artificial Intelligence

1303.7032

Country:

North America > United States (0.46)
Europe > France (0.28)
North America > Canada (0.28)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Systems & Languages > Programming Languages (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.66)

Add feedback

The Cluster Graphical Lasso for improved estimation of Gaussian graphical models

Tan, Kean Ming, Witten, Daniela, Shojaie, Ali

arXiv.org Machine LearningJul-19-2013

We consider the task of estimating a Gaussian graphical model in the high-dimensional setting. The graphical lasso, which involves maximizing the Gaussian log likelihood subject to an l1 penalty, is a well-studied approach for this task. We begin by introducing a surprising connection between the graphical lasso and hierarchical clustering: the graphical lasso in effect performs a two-step procedure, in which (1) single linkage hierarchical clustering is performed on the variables in order to identify connected components, and then (2) an l1-penalized log likelihood is maximized on the subset of variables within each connected component. In other words, the graphical lasso determines the connected components of the estimated network via single linkage clustering. Unfortunately, single linkage clustering is known to perform poorly in certain settings. Therefore, we propose the cluster graphical lasso, which involves clustering the features using an alternative to single linkage clustering, and then performing the graphical lasso on the subset of variables within each cluster. We establish model selection consistency for this technique, and demonstrate its improved performance relative to the graphical lasso in a simulation study, as well as in applications to an equities data set, a university webpage data set, and a gene expression data set.

artificial intelligence, graphical lasso, machine learning, (17 more...)

arXiv.org Machine Learning

1307.5339

Country: North America > United States (0.68)

Genre: Research Report (0.82)

Industry:

Banking & Finance > Trading (0.68)
Health & Medicine > Pharmaceuticals & Biotechnology (0.66)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.70)

Add feedback

Error Rate Bounds in Crowdsourcing Models

Li, Hongwei, Yu, Bin, Zhou, Dengyong

arXiv.org Machine LearningJul-10-2013

Crowdsourcing is an effective tool for human-powered computation on many tasks challenging for computers. In this paper, we provide finite-sample exponential bounds on the error rate (in probability and in expectation) of hyperplane binary labeling rules under the Dawid-Skene crowdsourcing model. The bounds can be applied to analyze many common prediction methods, including the majority voting and weighted majority voting. These bound results could be useful for controlling the error rate and designing better algorithms. We show that the oracle Maximum A Posterior (MAP) rule approximately optimizes our upper bound on the mean error rate for any hyperplane binary labeling rule, and propose a simple data-driven weighted majority voting (WMV) rule (called one-step WMV) that attempts to approximate the oracle MAP and has a provable theoretical guarantee on the error rate. Moreover, we use simulated and real data to demonstrate that the data-driven EM-MAP rule is a good approximation to the oracle MAP rule, and to demonstrate that the mean error rate of the data-driven EM-MAP rule is also bounded by the mean error rate bound of the oracle MAP rule with estimated parameters plugging into the bound.

artificial intelligence, error rate, machine learning, (16 more...)

arXiv.org Machine Learning

1307.2674

Country: North America > United States > California (0.28)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

A Generalized Student-t Based Approach to Mixed-Type Anomaly Detection

Lu, Yen-Cheng (Virginia Tech) | Chen, Feng (Carnegie Mellon University) | Chen, Yang (Virginia Tech) | Lu, Chang-Tien (Virginia Tech)

AAAI ConferencesJul-9-2013

Anomaly detection for mixed-type data is an important problem that has not been well addressed in the machine learning field. There are two challenging issues for mixed-type datasets, namely modeling mutual correlations between mixed-type attributes and capturing large variations due to anomalies. This paper presents BuffDetect, a robust error buffering approach for anomaly detection in mixed-type datasets. A new variant of the generalized linear model is proposed to model the dependency between mixed-type attributes. The model incorporates an error buffering component based on Student-t distribution to absorb the variations caused by anomalies. However, because of the non- Gaussian design, the problem becomes analytically intractable. We propose a novel Bayesian inference approach, which integrates Laplace approximation and several computational optimizations, and is able to efficiently approximate the posterior of high dimensional latent variables by iteratively updating the latent variables in groups. Extensive experimental evaluations based on 13 benchmark datasets demonstrate the effectiveness and efficiency of BuffDetect.

anomaly, dataset, detection, (16 more...)

AAAI Conferences

Twenty-Seventh AAAI Conference on Artificial Intelligence

Country:

South America > Paraguay > Asunción > Asunción (0.04)
North America > United States > New York (0.04)
North America > United States > Virginia (0.04)
(3 more...)

Technology:

Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.99)

Add feedback

Re-Ranking Recommendations Based on Predicted Short-Term Interests - A Protocol and First Experiment

Jannach, Dietmar (TU Dortmund University) | Lerche, Lukas (TU Dortmund University) | Gdaniec, Matthäus (TU Dortmund University)

AAAI ConferencesJul-9-2013

The recommendation of additional shopping items that are potentially interesting for the customer has become a standard feature of modern online stores. In academia, research on recommender systems (RS) is mostly centered around approaches that rely on explicit item ratings and long-term user profiles. In practical environments, however, such rating information is often very sparse and for a large fraction of the users very little is known about their preferences. Furthermore, in particular when the shop offers products from a variety of categories, the decision of what should be recommended can strongly depend on the user's current short-term interests and the navigational context. In this paper, we report the results of an initial experimental analysis evaluating the predictive accuracy of different contextualized and non-contextualized recommendation strategies and discuss the question of appropriate experimental designs for such types of evaluations. To that purpose, we introduce a parameterizable protocol that supports session-specific accuracy measurements. Our analysis, which was based on log data obtained from a large online retailer for clothing and lifestyle products, shows that even a comparably simple contextual post-processing approach based on product features can leverage short-term user interests to increase the accuracy of the recommendations.

artificial intelligence, machine learning, predicted short-term interest, (2 more...)

AAAI Conferences

Workshops at the Twenty-Seventh AAAI Conference on Artificial Intelligence

Industry: Retail > Online (0.53)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.53)

Add feedback