Structure of Classifier Boundaries: Case Study for a Naive Bayes Classifier

Karr, Alan F., Bowen, Zac, Porter, Adam A.

Dec-8-2022–arXiv.org Artificial Intelligence

Whether based on models, training data or a combination, classifiers place (possibly complex) input data into one of a relatively small number of output categories. In this paper, we study the structure of the boundary--those points for which a neighbor is classified differently--in the context of an input space that is a graph, so that there is a concept of neighboring inputs, The scientific setting is a model-based naive Bayes classifier for DNA reads produced by Next Generation Sequencers. We show that the boundary is both large and complicated in structure. We create a new measure of uncertainty, called Neighbor Similarity, that compares the result for a point to the distribution of results for its neighbors. This measure not only tracks two inherent uncertainty measures for the Bayes classifier, but also can be implemented, at a computational cost, for classifiers without inherent measures of uncertainty.

artificial intelligence, boundary, machine learning, (18 more...)

arXiv.org Artificial Intelligence

Dec-8-2022

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York (0.04)
  - Maryland > Prince George's County
    - College Park (0.04)
  - Florida > Palm Beach County
    - Boca Raton (0.04)
- Europe > Austria
  - Vienna (0.14)

Genre:
- Research Report (0.82)

Industry:
- Health & Medicine
  - Pharmaceuticals & Biotechnology (1.00)
  - Therapeutic Area > Infections and Infectious Diseases (0.73)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Performance Analysis > Accuracy (1.00)
  - Learning Graphical Models > Directed Networks
    - Bayesian Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found