Goto

Collaborating Authors

 Performance Analysis


How Confusion Matrix is useful in solving Cyber Crimes

#artificialintelligence

In Machine Learning we feed the features into our model and get the output in the form of probabilities. But how can we measure the accuracy and effectiveness of that model? This is where confusion matrix comes into the play. Confusion matrix is used to describe the performance of the classification model. First let us understand about the confusion matrix.


MemStream: Memory-Based Anomaly Detection in Multi-Aspect Streams with Concept Drift

arXiv.org Artificial Intelligence

Given a stream of entries over time in a multi-aspect data setting where concept drift is present, how can we detect anomalous activities? Most of the existing unsupervised anomaly detection approaches seek to detect anomalous events in an offline fashion and require a large amount of data for training. This is not practical in real-life scenarios where we receive the data in a streaming manner and do not know the size of the stream beforehand. Thus, we need a data-efficient method that can detect and adapt to changing data trends, or concept drift, in an online manner. In this work, we propose MemStream, a streaming multi-aspect anomaly detection framework, allowing us to detect unusual events as they occur while being resilient to concept drift. We leverage the power of a denoising autoencoder to learn representations and a memory module to learn the dynamically changing trend in data without the need for labels. We prove the optimum memory size required for effective drift handling. Furthermore, MemStream makes use of two architecture design choices to be robust to memory poisoning. Experimental results show the effectiveness of our approach compared to state-of-the-art streaming baselines using 2 synthetic datasets and 11 real-world datasets.


Accurate and robust Shapley Values for explaining predictions and focusing on local important variables

arXiv.org Machine Learning

Although Shapley Values (SV) are widely used in explainable AI, they can be poorly understood and estimated, which implies that their analysis may lead to spurious inferences and explanations. As a starting point, we remind an invariance principle for SV and derive the correct approach for computing the SV of categorical variables that are particularly sensitive to the encoding used. In the case of tree-based models, we introduce two estimators of Shapley Values that exploit efficiently the tree structure and are more accurate than state-of-the-art methods. For interpreting additive explanations, we recommend to filter the non-influential variables and to compute the Shapley Values only for groups of influential variables. For this purpose, we use the concept of "Same Decision Probability" (SDP) that evaluates the robustness of a prediction when some variables are missing. This prior selection procedure produces sparse additive explanations easier to visualize and analyse. Simulations and comparisons are performed with state-of-the-art algorithm, and show the practical gain of our approach.


Bias Mitigation of Face Recognition Models Through Calibration

arXiv.org Machine Learning

Face recognition models suffer from bias: for example, the probability of a false positive (incorrect face match) strongly depends on sensitive attributes like ethnicity. As a result, these models may disproportionately and negatively impact minority groups when used in law enforcement. In this work, we introduce the Bias Mitigation Calibration (BMC) method, which (i) increases model accuracy (improving the state-of-the-art), (ii) produces fairly-calibrated probabilities, (iii) significantly reduces the gap in the false positive rates, and (iv) does not require knowledge of the sensitive attribute.


Ethical AI, Monetizing False Negatives and Growing Total Addressable Market

#artificialintelligence

What if I told you that companies that don't embrace Ethical AI are leaving significant amounts of "Money on the Table"; that they are not only missing out on potentially profitable customers, but that over time they are eroding their Total Addressable Market (TAM)? Do I have your attention now? After I published the blog "The Ethical AI Application Pyramid", a question from Karrie Sullivan coupled with a mentoring session with the startup unfog.ai "If your AI model doesn't take into consideration the ultimate outcomes of the AI model's False Negatives, then confirmation bias in the AI model could set in and eventually the company's Total Addressable Market (TAM) could shrink to a point where the business might no longer be viable." Yea, not only is Ethical AI the right thing to do from a cultural and society perspective, but there are direct bottom-line financial ramifications if your AI models are not learning and adapting from the AI model's False Negatives.


Confusion Matrix In Cyber Security

#artificialintelligence

In today's article I'm going to explain all about Intrusion detection system in cyber security, confusion matrix, how it is used in IDS, how it is impacting in cyber security with example .So let's get started to this amazing topic. In today's technological world where everything is going to digitalized everything is online now. Along with this the most important thing is data and data security. All activities we do on internet, what we searched,what we post, what we buy, which site we visited all this data is stored in datacenters servers. This all data must be secured from hackers and any kind of data loss.


Cybersecurity: When we talk about the confusion matrix

#artificialintelligence

Confusion Matrix The Confusion Matrix is a table that summarizes the number of true and false predictions made by a classifier. It is used to measure the performance of a classification model. It can be used to assess the performance of a classification model by calculating performance indicators such as accuracy, precision, recall, and F1 score. If you are working with an unbalanced dataset, you had better use the confusion matrix as the endpoint for your machine learning model. Here are the basic terms that will help us identify the metrics we are looking for.


Virtual Screening of Pharmaceutical Compounds with hERG Inhibitory Activity (Cardiotoxicity) using Ensemble Learning

arXiv.org Artificial Intelligence

In silico prediction of cardiotoxicity with high sensitivity and specificity for potential drug molecules can be of immense value. Hence, building machine learning classification models, based on some features extracted from the molecular structure of drugs, which are capable of efficiently predicting cardiotoxicity is critical. In this paper, we consider the application of various machine learning approaches, and then propose an ensemble classifier for the prediction of molecular activity on a Drug Discovery Hackathon (DDH) (1st reference) dataset. We have used only 2-D descriptors of SMILE notations for our prediction. Our ensemble classification uses 5 classifiers (2 Random Forest Classifiers, 2 Support Vector Machines and a Dense Neural Network) and uses Max-Voting technique and Weighted-Average technique for final decision. Introduction It is well known that drug discovery is complex, long drawn, and requires interdisciplinary expertise to discover new molecules. Drug safety is an important issue in the process of drug discovery. Failure in clinical trials in the 2000s was majorly due to efficacy and safety (approx 30%) (Kola, I. and Landis, J., 2004). One important aspect of drug safety is drug toxicity.


Subgroup Fairness in Two-Sided Markets

arXiv.org Artificial Intelligence

It is well known that two-sided markets are unfair in a number of ways. For instance, female workers at Uber earn less than their male colleagues per mile driven. Similar observations have been made for other minority subgroups in other two-sided markets. Here, we suggest a novel market-clearing mechanism for two-sided markets, which promotes equalisation of the pay per hour worked across multiple subgroups, as well as within each subgroup. In the process, we introduce a novel notion of subgroup fairness (which we call Inter-fairness), which can be combined with other notions of fairness within each subgroup (called Intra-fairness), and the utility for the customers (Customer-Care) in the objective of the market-clearing problem. While the novel non-linear terms in the objective complicate market clearing by making the problem non-convex, we show that a certain non-convex augmented Lagrangian relaxation can be approximated to any precision in time polynomial in the number of market participants using semi-definite programming. This makes it possible to implement the market-clearing mechanism efficiently. On the example of driver-ride assignment in an Uber-like system, we demonstrate the efficacy and scalability of the approach, and trade-offs between Inter- and Intra-fairness.


Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition

arXiv.org Artificial Intelligence

Integrating external language models (LMs) into end-to-end (E2E) models remains a challenging task for domain-adaptive speech recognition. Recently, internal language model estimation (ILME)-based LM fusion has shown significant word error rate (WER) reduction from Shallow Fusion by subtracting a weighted internal LM score from an interpolation of E2E model and external LM scores during beam search. However, on different test sets, the optimal LM interpolation weights vary over a wide range and have to be tuned extensively on well-matched validation sets. In this work, we perform LM fusion in the minimum WER (MWER) training of an E2E model to obviate the need for LM weights tuning during inference. Besides MWER training with Shallow Fusion (MWER-SF), we propose a novel MWER training with ILME (MWER-ILME) where the ILME-based fusion is conducted to generate N-best hypotheses and their posteriors. Additional gradient is induced when internal LM is engaged in MWER-ILME loss computation. During inference, LM weights pre-determined in MWER training enable robust LM integrations on test sets from different domains. Experimented with 30K-hour trained transformer transducers, MWER-ILME achieves on average 8.8% and 5.8% relative WER reductions from MWER and MWER-SF training, respectively, on 6 different test sets