Large Deviations for Classification Performance Analysis of Machine Learning Systems

Braca, Paolo, Millefiori, Leonardo M., Aubry, Augusto, De Maio, Antonio, Willett, Peter

Jan-16-2023–arXiv.org Artificial Intelligence

We study the performance of machine learning binary classification techniques in terms of error probabilities. The statistical test is based on the Data-Driven Decision Function (D3F), learned in the training phase, i.e., what is thresholded before the final binary decision is made. Based on large deviations theory, we show that under appropriate conditions the classification error probabilities vanish exponentially, as $\sim \exp\left(-n\,I + o(n) \right)$, where $I$ is the error rate and $n$ is the number of observations available for testing. We also propose two different approximations for the error probability curves, one based on a refined asymptotic formula (often referred to as exact asymptotics), and another one based on the central limit theorem. The theoretical findings are finally tested using the popular MNIST dataset.

artificial intelligence, error probability, machine learning, (18 more...)

arXiv.org Artificial Intelligence

Jan-16-2023

arXiv.org PDF

Add feedback

Country:
- Europe > Italy (0.05)
- North America > United States
  - New York (0.04)
  - Connecticut (0.04)

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Performance Analysis > Accuracy (1.00)
  - Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found