RaSE: A Variable Screening Framework via Random Subspace Ensembles

Feb-7-2021–arXiv.org Machine Learning

With the rapid advancement of computing power and technology, high-dimensional data become ubiquitous in many disciplines such as genomics, image analysis, and tomography. With high-dimensional data, the number of variables p could be much larger than the sample size n. What makes statistical inference possible is the sparsity assumption, which assumes only a few variables have contributions to the response. Under this sparsity assumption, there has been a rich literature on the topic of variable selection, including LASSO (Tibshirani, 1996), SCAD (Fan and Li, 2001), elastic net (Zou and Hastie, 2005), and MCP (Zhang, 2010). Despite of the success of these methods in many applications, for the ultra-high dimensional scenario where the dimension p grows exponentially with n, they may not work well due to the "curse of dimensionality" in terms of simultaneous challenges to computational expediency, statistical accuracy, and algorithmic stability (Fan and Lv, 2008).

screening, screening method, subspace, (15 more...)

arXiv.org Machine Learning

Feb-7-2021

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)

Genre:
- Research Report (1.00)

Industry:
- Health & Medicine > Pharmaceuticals & Biotechnology (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (0.92)
  - Machine Learning > Statistical Learning
    - Regression (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found