High Dimensional Classification via Empirical Risk Minimization: Improvements and Optimality

May-31-2019–arXiv.org Machine Learning

In this article, we investigate a family of classification algorithms defined by the principle of empirical risk minimization, in the high dimensional regime where the feature dimension $p$ and data number $n$ are both large and comparable. Based on recent advances in high dimensional statistics and random matrix theory, we provide under mixture data model a unified stochastic characterization of classifiers learned with different loss functions. Our results are instrumental to an in-depth understanding as well as practical improvements on this fundamental classification approach. As the main outcome, we demonstrate the existence of a universally optimal loss function which yields the best high dimensional performance at any given $n/p$ ratio.

artificial intelligence, classifier, machine learning, (17 more...)

arXiv.org Machine Learning

May-31-2019

arXiv.org PDF

Add feedback

Country:
- Europe (0.28)

Genre:
- Research Report > New Finding (0.49)

Technology:
- Information Technology
  - Information Management (1.00)
  - Artificial Intelligence > Machine Learning
    - Performance Analysis > Accuracy (0.47)
    - Statistical Learning > Regression (0.47)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found