Randomer Forests

Tomita, Tyler M., Browne, James, Shen, Cencheng, Priebe, Carey E., Burns, Randal, Maggioni, Mauro, Vogelstein, Joshua T.

Mar-19-2018–arXiv.org Machine Learning

Ensemble methods -- particularly those based on decision trees -- have recently demonstrated superior performance in a variety of machine learning settings. Specifically, Random Forest (RF) was found to outperform >100 other methods in several manuscripts, and gradient boosting trees have been a crucial component of several recent Kaggle competition victories. Building off these successes and recent advances in sparse learning and random matrix theory, we propose a novel ensemble tree method called "Randomer Forest" (RerF). The key intuition behind RerF is that we can use sparse linear combinations at each decision node rather than just one feature (as in RF) or all of them (as in Rotation Forests). RerF significantly outperforms other methods on a standard benchmark suite containing 105 problems with varying dimension, sample size, and number of classes. Moreover, we provide an implementation that scales as or more efficiently than other available packages. Via a combination of basic principles, theory, and extensive numerical experiments, we demonstrate why, when, and how RerF achieves its performance properties.

artificial intelligence, machine learning, projection, (20 more...)

arXiv.org Machine Learning

Mar-19-2018

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - California (0.46)
  - Maryland (0.28)

Genre:
- Research Report
  - New Finding (1.00)
  - Experimental Study (0.92)

Industry:
- Health & Medicine (0.68)
- Education (0.67)
- Government
  - Military (0.67)
  - Regional Government > North America Government
    - United States Government (1.00)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Statistical Learning (1.00)
  - Ensemble Learning (0.87)
  - Decision Tree Learning (0.69)
  - Performance Analysis > Accuracy (0.30)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found