Goto

Collaborating Authors

 Accuracy


(Visually) Interpreting the confusion-matrix:

#artificialintelligence

But first, what is a confusion matrix? In machine learning, a confusion matrix is a kind-of confusing table used to understand how well our model predictions perform(especially confusing when we have multiple classes and not the classic binary 0/1 problems). However, gradually I figured out that the confusion-matrix is not so confusing and helps me a ton in understanding the model behaviour and interpreting the results. So I'm going to try to do the same here.. make it less confusing, more interesting and easier to interpret! The columns represent predictions made by our model and the rows represent the actual classes(this is the format of the very popular Python library for ML: sklearn.


New Artificial Intelligence Tool Improves Breast Cancer Detection on Mammography

#artificialintelligence

The upper panels show the craniocaudal and the mediolateral oblique views. The lower panels show a close-up of the left breast area containing the lesion. The case is one of the false-negative cases included in the dataset. Accordingly, the initial screening assessment was a BI-RADS 2, meaning visible findings were judged as benign. After 1 year, the patient presented for another screening examination.


Efficient Permutation Discovery in Causal DAGs

arXiv.org Machine Learning

The problem of learning a directed acyclic graph (DAG) up to Markov equivalence is equivalent to the problem of finding a permutation of the variables that induces the sparsest graph. Without additional assumptions, this task is known to be NP-hard. Building on the minimum degree algorithm for sparse Cholesky decomposition, but utilizing DAG-specific problem structure, we introduce an efficient algorithm for finding such sparse permutations. We show that on jointly Gaussian distributions, our method with depth $w$ runs in $O(p^{w+3})$ time. We compare our method with $w = 1$ to algorithms for finding sparse elimination orderings of undirected graphs, and show that taking advantage of DAG-specific problem structure leads to a significant improvement in the discovered permutation. We also compare our algorithm to provably consistent causal structure learning algorithms, such as the PC algorithm, GES, and GSP, and show that our method achieves comparable performance with a shorter runtime. Thus, our method can be used on its own for causal structure discovery. Finally, we show that there exist dense graphs on which our method achieves almost perfect performance, so that unlike most existing causal structure learning algorithms, the situations in which our algorithm achieves both good performance and good runtime are not limited to sparse graphs.


Ridge Regression with Frequent Directions: Statistical and Optimization Perspectives

arXiv.org Machine Learning

Despite its impressive theory \& practical performance, Frequent Directions (\acrshort{fd}) has not been widely adopted for large-scale regression tasks. Prior work has shown randomized sketches (i) perform worse in estimating the covariance matrix of the data than \acrshort{fd}; (ii) incur high error when estimating the bias and/or variance on sketched ridge regression. We give the first constant factor relative error bounds on the bias \& variance for sketched ridge regression using \acrshort{fd}. We complement these statistical results by showing that \acrshort{fd} can be used in the optimization setting through an iterative scheme which yields high-accuracy solutions. This improves on randomized approaches which need to compromise the need for a new sketch every iteration with speed of convergence. In both settings, we also show using \emph{Robust Frequent Directions} further enhances performance.


Selective Classification via One-Sided Prediction

arXiv.org Machine Learning

We propose a novel method for selective classification (SC), a problem which allows a classifier to abstain from predicting some instances, thus trading off accuracy against coverage (the fraction of instances predicted). In contrast to prior gating or confidence-set based work, our proposed method optimises a collection of class-wise decoupled one-sided empirical risks, and is in essence a method for explicitly finding the largest decision sets for each class that have few false positives. This one-sided prediction (OSP) based relaxation yields an SC scheme that attains near-optimal coverage in the practically relevant high target accuracy regime, and further admits efficient implementation, leading to a flexible and principled method for SC. We theoretically derive generalization bounds for SC and OSP, and empirically we show that our scheme strongly outperforms state of the art methods in coverage at small error levels.


Acoustics Based Intent Recognition Using Discovered Phonetic Units for Low Resource Languages

arXiv.org Artificial Intelligence

With recent advancements in language technologies, humansare now interacting with technology through speech. To in-crease the reach of these technologies, we need to build suchsystems in local languages. A major bottleneck here are theunderlying data-intensive parts that make up such systems,including automatic speech recognition (ASR) systems thatrequire large amounts of labelled data. With the aim of aidingdevelopment of dialog systems in low resourced languages,we propose a novel acoustics based intent recognition systemthat uses discovered phonetic units for intent classification.The system is made up of two blocks - the first block gen-erates a transcript of discovered phonetic units for the inputaudio, and the second block which performs intent classifi-cation from the generated phonemic transcripts. Our workpresents results for such a system for two languages families- Indic languages and Romance languages, for two differentintent recognition tasks. We also perform multilingual train-ing of our intent classifier and show improved cross-lingualtransfer and performance on an unknown language with zeroresources in the same language family.


Instant Payments

#artificialintelligence

PelicanPayments is a complete payments solution delivering exceptional levels of efficiency, control and flexibility. PelicanSecure is a comprehensive suite of real-time financial crime compliance and anti-fraud solutions. Pelican Open Banking Hub connects 6000 banks in Europe with an ability to connect to over 500 million customers. This Open Banking webinar focuses on opportunities for banks in opting for the an aggressive and proactive approach to winning new customers. To read a full report, click "Learn More:


FDA: Antigen tests for COVID-19 are rapid but can lead to false positives

Boston Herald

The U.S. Food and Drug Administration is alerting clinical laboratory staff and health care providers that false positive results can occur with antigen tests for the virus that causes COVID-19. In a letter to stakeholders, the FDA said Tuesday that while antigen tests can be used for the rapid detection of SARS-CoV-2, false positive results can occur, especially if users don't follow the instructions. "The FDA is aware of reports of false positive results associated with antigen tests used in nursing homes and other settings and continues to monitor and evaluate these reports and other available information about device safety and performance," the letter said. A Boston-area infectious disease expert said the antigen tests are good for large scale screening, when used properly, but must be followed up with more accurate testing. "If you are testing a population at low risk, it's fine to do these tests for screening," said Dr. Daniel Kuritzkes, chief of the Division of Infectious Diseases at Brigham and Women's Hospital.


Re-Assessing the "Classify and Count" Quantification Method

arXiv.org Artificial Intelligence

Learning to quantify (a.k.a.\ quantification) is a task concerned with training unbiased estimators of class prevalence via supervised learning. This task originated with the observation that "Classify and Count" (CC), the trivial method of obtaining class prevalence estimates, is often a biased estimator, and thus delivers suboptimal quantification accuracy; following this observation, several methods for learning to quantify have been proposed that have been shown to outperform CC. In this work we contend that previous works have failed to use properly optimised versions of CC. We thus reassess the real merits of CC (and its variants), and argue that, while still inferior to some cutting-edge methods, they deliver near-state-of-the-art accuracy once (a) hyperparameter optimisation is performed, and (b) this optimisation is performed by using a true quantification loss instead of a standard classification-based loss. Experiments on three publicly available binary sentiment classification datasets support these conclusions.


Detecting Backdoors in Neural Networks Using Novel Feature-Based Anomaly Detection

arXiv.org Artificial Intelligence

This paper proposes a new defense against neural network backdooring attacks that are maliciously trained to mispredict in the presence of attacker-chosen triggers. Our defense is based on the intuition that the feature extraction layers of a backdoored network embed new features to detect the presence of a trigger and the subsequent classification layers learn to mispredict when triggers are detected. Therefore, to detect backdoors, the proposed defense uses two synergistic anomaly detectors trained on clean validation data: the first is a novelty detector that checks for anomalous features, while the second detects anomalous mappings from features to outputs by comparing with a separate classifier trained on validation data. The approach is evaluated on a wide range of backdoored networks (with multiple variations of triggers) that successfully evade state-of-the-art defenses. Additionally, we evaluate the robustness of our approach on imperceptible perturbations, scalability on large-scale datasets, and effectiveness under domain shift. This paper also shows that the defense can be further improved using data augmentation.