Minimax Lower Bounds for Cost Sensitive Classification

Kamalaruban, Parameswaran, Williamson, Robert C.

May-20-2018–arXiv.org Machine Learning

The central problem of this paper is the cost-sensitive binary classification problem, where different costs are associated with different types of mistakes. Several important machine learning applications such as medical decision making, targeted marketing, and intrusion detection can be naturally formalized as costsensitive classification setup ([1]). In these domains, the cost of missing a target is much higher than that of a false-positive, and classifiers that do not take misclassification costs into account do not perform well. The cost-sensitive classification problem has been extensively studied, and people have developed efficient algorithms with provable guarantees on the (generalization) error [6, 9, 26, 27, 11, 4]. These methods primarily take existing classification methods based on empirical risk minimization and try to adapt them in various ways to be sensitive to these misclassification costs. Despite all these efforts, the understanding of the fundamental limits of this problem is still missing. In this paper, we study the hardness of this problem by obtaining minimax lower bounds. In particular, we are interested in understanding how the cost parameter influences the hardness or complexity of the cost-sensitive classification. Minimax Lower Bounds Understanding the hardness or fundamental limits of a learning problem is important for practice for the following reasons: - They give an estimate on the number of samples required for a good performance of a learning algorithm.

artificial intelligence, classification problem, machine learning, (14 more...)

arXiv.org Machine Learning

May-20-2018

arXiv.org PDF

Add feedback

Genre:
- Research Report (0.50)

Industry:
- Education (0.49)
- Health & Medicine (0.48)
- Law Enforcement & Public Safety (0.48)
- Information Technology > Security & Privacy (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning
    - Search (0.84)
    - Uncertainty > Bayesian Inference (0.68)
  - Machine Learning
    - Statistical Learning (0.95)
    - Performance Analysis > Accuracy (0.66)
    - Learning Graphical Models > Directed Networks
      - Bayesian Learning (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found