Goto

Collaborating Authors

 confusionmatrix


Pinaki Laskar on LinkedIn: #ConfusionMatrix #machinelearning #datascience

#artificialintelligence

AI Researcher, Cognitive Technologist Inventor - AI Thinking, Think Chain Innovator - AIOT, XAI, Autonomous Cars, IIOT Founder Fisheyebox Spatial Computing Savant, Transformative Leader, Industry X.0 Practitioner Why do we need #ConfusionMatrix? Generally, It's a tool that helps to understand if the model is working really well. Moreover, from it, you can derive many evaluation measures, such as accuracy, precision, recall, etc. It's based on the fact that we need to compare the class predicted by the classifier with the actual class for each observation. So, the efficiency of a model can be seen using this amazing matrix. Each column corresponds to the predicted class, while each row indicates the actual class.


AP-Perf: Incorporating Generic Performance Metrics in Differentiable Learning

Fathony, Rizal, Kolter, J. Zico

arXiv.org Machine Learning

We propose a method that enables practitioners to conveniently incorporate custom non-decomposable performance metrics into differentiable learning pipelines, notably those based upon deep learning architectures. Our approach is based on the recently-developed adversarial prediction framework, a distributionally robust approach that optimizes a metric in the worst case given the statistical summary of the empirical distribution. We formulate a marginal distribution technique to reduce the complexity of optimizing the adversarial prediction formulation over a vast range of non-decomposable metrics. We demonstrate how easy it is to write and incorporate complex custom metrics using our provided tool. Finally, we show the effectiveness of our approach for image classification tasks using MNIST and Fashion-MNIST datasets as well as classification task on tabular data using UCI repository and benchmark datasets.


Rebalancing Learning on Evolving Data Streams

Bernardo, Alessio, Della Valle, Emanuele, Bifet, Albert

arXiv.org Machine Learning

Albert Bifet University of W aikato, New Zealand LTCI, T el ecom ParisT ech, France abifet@waikato.ac.nz Abstract --Nowadays, every device connected to the Internet generates an ever-growing stream of data (formally, unbounded). Machine Learning on unbounded data streams is a grand challenge due to its resource constraints. In fact, standard machine learning techniques are not able to deal with data whose statistics is subject to gradual or sudden changes without any warning. Massive Online Analysis (MOA) is the collective name, as well as a software library, for new learners that are able to manage data streams. In this paper, we present a research study on streaming rebalancing. Indeed, data streams can be imbalanced as static data, but there is not a method to rebalance them incrementally, one element at a time. For this reason we propose a new streaming approach able to rebalance data streams online. Our new methodology is evaluated against some synthetically generated datasets using prequential evaluation in order to demonstrate that it outperforms the existing approaches.