AITopics | Accuracy

Collaborating Authors

Accuracy

News Overviews Instructional Materials AI-Alerts Classics

Kernel Stein Tests for Multiple Model Comparison

Lim, Jen Ning, Yamada, Makoto, Schölkopf, Bernhard, Jitkrittum, Wittawat

arXiv.org Machine LearningOct-27-2019

We address the problem of non-parametric multiple model comparison: given $l$ candidate models, decide whether each candidate is as good as the best one(s) or worse than it. We propose two statistical tests, each controlling a different notion of decision errors. The first test, building on the post selection inference framework, provably controls the number of best models that are wrongly declared worse (false positive rate). The second test is based on multiple correction, and controls the proportion of the models declared worse but are in fact as good as the best (false discovery rate). We prove that under appropriate conditions the first test can yield a higher true positive rate than the second. Experimental results on toy and real (CelebA, Chicago Crime data) problems show that the two tests have high true positive rates with well-controlled error rates. By contrast, the naive approach of choosing the model with the lowest score without correction leads to more false positives.

candidate model, experiment, relmulti, (17 more...)

arXiv.org Machine Learning

1910.12252

Country:

North America > United States > Illinois > Cook County > Chicago (0.24)
Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.04)
Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.04)
(3 more...)

Genre: Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

An Active Approach for Model Interpretation

Lu, Jialin, Ester, Martin

arXiv.org Artificial IntelligenceOct-27-2019

Model interpretation, or explanation of a machine learning classifier, aims to extract generalizable knowledge from a trained classifier into a human-understandable format, for various purposes such as model assessment, debugging and trust. From a computaional viewpoint, it is formulated as approximating the target classifier using a simpler interpretable model, such as rule models like a decision set/list/tree. Often, this approximation is handled as standard supervised learning and the only difference is that the labels are provided by the target classifier instead of ground truth. This paradigm is particularly popular because there exists a variety of well-studied supervised algorithms for learning an interpretable classifier. However, we argue that this paradigm is suboptimal for it does not utilize the unique property of the model interpretation problem, that is, the ability to generate synthetic instances and query the target classifier for their labels. We call this the active-query property, suggesting that we should consider model interpretation from an active learning perspective. Following this insight, we argue that the active-query property should be employed when designing a model interpretation algorithm, and that the generation of synthetic instances should be integrated seamlessly with the algorithm that learns the model interpretation. In this paper, we demonstrate that by doing so, it is possible to achieve more faithful interpretation with simpler model complexity. As a technical contribution, we present an active algorithm Active Decision Set Induction (ADS) to learn a decision set, a set of if-else rules, for model interpretation. ADS performs a local search over the space of all decision sets. In every iteration, ADS computes confidence intervals for the value of the objective function of all local actions and utilizes active-query to determine the best one.

algorithm, interpretation, model interpretation, (14 more...)

arXiv.org Artificial Intelligence

1910.12207

Country:

North America > United States > Texas (0.04)
North America > United States > California (0.04)
North America > Canada (0.04)

Genre: Research Report (0.51)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

Add feedback

Making Fairness an Intrinsic Part of Machine Learning Open Data Science Conference

#artificialintelligenceOct-26-2019, 18:28:29 GMT

Editor's Note: At ODSC Europe 2019, Sray Agarwal will conduct a workshop on fairness and accountability demonstrating how to detect bias and remove bias from ML models. The suitability of Machine Learning models is traditionally measured on its accuracy. A highly accurate model based on metrics like RMSE, MAPE, AUC, ROC, Gini, etc are considered to be high performing models. While such accuracy metrics important, are there other metrics that the data science community has been ignoring so far? The answer is yes--in the pursuit of accuracy, most models sacrifice "fairness" and "interpretability."

fairness, fairness metric, threshold, (12 more...)

#artificialintelligence

Country: Europe (0.26)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.32)

Add feedback

Convolutional Neural Network for Breast Cancer Classification - KDnuggets

#artificialintelligenceOct-26-2019, 06:16:02 GMT

Breast cancer is the second most common cancer in women and men worldwide. In 2012, it represented about 12 percent of all new cancer cases and 25 percent of all cancers in women. Breast cancer starts when cells in the breast begin to grow out of control. These cells usually form a tumor that can often be seen on an x-ray or felt as a lump. The tumor is malignant (cancer) if the cells can grow into (invade) surrounding tissues or spread (metastasize) to distant areas of the body.

batch size, classification, convolutional neural network, (12 more...)

#artificialintelligence

Industry: Health & Medicine > Therapeutic Area > Oncology > Breast Cancer (0.85)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.57)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.54)

Add feedback

Machine Learning Approaches to Predict 6-Month Mortality Among Patients With Cancer

#artificialintelligenceOct-25-2019, 21:14:22 GMT

Question Can machine learning algorithms identify oncology patients at risk of short-term mortality to inform timely conversations between patients and physicians regrading serious illness? Findings In this cohort study of 26 525 patients seen in oncology practices within a large academic health system, machine learning algorithms accurately identified patients at high risk of 6-month mortality with good discrimination and positive predictive value. When the gradient boosting algorithm was applied in real time, most patients who were classified as having high risk were deemed appropriate by oncology clinicians for a conversation regarding serious illness. Meaning In this study, machine learning algorithms accurately identified patients with cancer who were at risk of 6-month mortality, suggesting that these models could facilitate more timely conversations between patients and physicians regarding goals and values. Importance Machine learning algorithms could identify patients with cancer who are at risk of short-term mortality. However, it is unclear how different machine learning algorithms compare and whether they could prompt clinicians to have timely conversations about treatment and end-of-life preferences. Objectives To develop, validate, and compare machine learning algorithms that use structured electronic health record data before a clinic visit to predict mortality among patients with cancer. Design, Setting, and Participants Cohort study of 26 525 adult patients who had outpatient oncology or hematology/oncology encounters at a large academic cancer center and 10 affiliated community practices between February 1, 2016, and July 1, 2016.

algorithm, gradient, mortality, (14 more...)

#artificialintelligence

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > South Carolina (0.04)
North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.04)
(2 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Health & Medicine > Therapeutic Area > Oncology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.58)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.33)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.30)

Add feedback

Top 7 Checkpoints To Consider During Machine Learning Production

#artificialintelligenceOct-25-2019, 19:26:11 GMT

A major challenge for any company that is starting out in the realm of data-driven markets is the deployment of machine learning pipelines at full scale for their products. To tap the most out of AI, it is necessary to build service-specific tools and frameworks in addition to the existing models. The best strategy varies from product to product; but the rubrics of machine learning stay the same. To democratise the use of machine learning, Google has condensed their years of research into a paper titled "A Rubric for ML Production Readiness", where they listed out their findings in the form of 28 specific tests that have shown promising results. The offline/online metric relationship can be measured in one or more small scale A/B experiments using an intentionally degraded model.

linkedin, machine learning production, prediction, (13 more...)

#artificialintelligence

Industry: Education (0.31)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.31)

Add feedback

Toward a better trade-off between performance and fairness with kernel-based distribution matching

Prost, Flavien, Qian, Hai, Chen, Qiuwen, Chi, Ed H., Chen, Jilin, Beutel, Alex

arXiv.org Machine LearningOct-25-2019

As recent literature has demonstrated how classifiers often carry unintended biases toward some subgroups, deploying machine learned models to users demands careful consideration of the social consequences. How should we address this problem in a real-world system? How should we balance core performance and fairness metrics? In this paper, we introduce a MinDiff framework for regularizing classifiers toward different fairness metrics and analyze a technique with kernel-based statistical dependency tests. We run a thorough study on an academic dataset to compare the Pareto frontier achieved by different regularization approaches, and apply our kernel-based method to two large-scale industrial systems demonstrating real-world improvements.

classification, fairness, performance and fairness, (15 more...)

arXiv.org Machine Learning

1910.11779

Country:

North America > United States (0.14)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.53)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.71)

Add feedback

Incode raises $10 million to verify identities with AI

#artificialintelligenceOct-24-2019, 13:12:40 GMT

Incode, a San Francisco startup developing what it describes as an omnichannel biometric identity platform, today announced that it's raised $10 million in seed funding from undisclosed investors. Founder and CEO Ricardo Amper said that the newfound capital will enable Incode to accelerate the development and rollout of its tools globally, some of which are already being used by major banks, financial institutions, governments, and retailers. "The modern consumer is all about experiences and convenience," said Amper. "What they want is a seamless, consistent and secure way to perform daily tasks like access their ATM, make payments, and access online accounts. Yet, what they get today is quite the opposite. The ecosystem is fragmented by multiple vendors and devices, making processes clunky and inefficient. That's precisely why we've built Incode Omni: to help companies provide a frictionless, secure and convenient experience for the next generation of consumers."

incode, incode raise, verify identity, (5 more...)

#artificialintelligence

Country:

North America > United States > California > San Francisco County > San Francisco (0.30)
South America (0.06)
North America > Central America (0.06)
Europe (0.06)

Industry:

Banking & Finance (0.93)
Information Technology > Security & Privacy (0.37)
Government > Regional Government > North America Government > United States Government (0.35)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.33)

Add feedback

Securing machine learning models against adversarial attacks

#artificialintelligenceOct-24-2019, 06:14:08 GMT

Beware: many defence methods can lead to gradient masking, whether intentional or not. Gradient masking does not guarantee adversarial robustness, and has been shown to be circumventable (Tramèr et al., 2017; Athalye et al, 2018). We hope this article provides helpful insights on how to defend against adversarial examples. Please feel free to provide suggestions in the comment section if we're missing something.

adversarial example, defence method, gradient, (15 more...)

#artificialintelligence

Industry:

Information Technology > Security & Privacy (0.41)
Government > Military (0.41)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.47)

Add feedback

Torus Graphs for Multivariate Phase Coupling Analysis

Klein, Natalie, Orellana, Josue, Brincat, Scott, Miller, Earl K., Kass, Robert E.

arXiv.org Machine LearningOct-24-2019

Angular measurements are often modeled as circular random variables, where there are natural circular analogues of moments, including correlation. Because a product of circles is a torus, a d-dimensional vector of circular random variables lies on a d-dimensional torus. For such vectors we present here a class of graphical models, which we call torus graphs, based on the full exponential family with pairwise interactions. The topological distinction between a torus and Euclidean space has several important consequences. Our development was motivated by the problem of identifying phase coupling among oscillatory signals recorded from multiple electrodes in the brain: oscillatory phases across electrodes might tend to advance or recede together, indicating coordination across brain areas. The data analyzed here consisted of 24 phase angles measured repeatedly across 840 experimental trials (replications) during a memory task, where the electrodes were in 4 distinct brain regions, all known to be active while memories are being stored or retrieved. In realistic numerical simulations, we found that a standard pairwise assessment, known as phase locking value, is unable to describe multivariate phase interactions, but that torus graphs can accurately identify conditional associations. Torus graphs generalize several more restrictive approaches that have appeared in various scientific literatures, and produced intuitive results in the data we analyzed. Torus graphs thus unify multivariate analysis of circular data and present fertile territory for future research.

graph, phase difference, torus graph, (12 more...)

arXiv.org Machine Learning

1910.11044

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Asia > Middle East > Jordan (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.66)

Add feedback