Diagonal Discriminant Analysis with Feature Selection for High Dimensional Data

Romanes, Sarah Elizabeth, Ormerod, John Thomas, Yang, Jean YH

Jul-3-2018–arXiv.org Machine Learning

Classification problems involving high dimensional data are extensive in many fields such as finance, marketing, and bioinformatics. Unique challenges with high dimensional datasets are numerous and well known, with many classifiers built under traditional low dimensional frameworks simply unable to be applied to such high dimensional data. Discriminant Analysis (DA) is one such example (Fisher, 1936). DA classifiers work by assuming the distribution of the features is strictly Gaussian at the class level, and assign a particular point to the class label which minimises the Mahalanobis (for linear discriminant analysis (LDA)) distance between that point and the mean of the multivariate normal corresponding to such class. Although extraordinary simple and easy to use in low dimensional settings, DA is well known to be unusable in high dimensions due to the maximum likelihood estimate of the corresponding covariance matrix being singular when the number of features is greater than that of the observations.

artificial intelligence, bayesian inference, machine learning, (19 more...)

arXiv.org Machine Learning

Jul-3-2018

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia (0.04)
- North America > United States
  - New York (0.04)
- Europe
  - Austria > Vienna (0.14)
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)

Genre:
- Research Report (1.00)

Industry:
- Health & Medicine
  - Therapeutic Area > Oncology (1.00)
  - Pharmaceuticals & Biotechnology (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Uncertainty
    - Bayesian Inference (0.48)
  - Machine Learning
    - Statistical Learning (1.00)
    - Learning Graphical Models > Directed Networks
      - Bayesian Learning (0.66)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found