Cost-complexity pruning of random forests

Jul-19-2017–arXiv.org Machine Learning

Random forests perform bootstrap-aggregation by sampling the training samples with replacement. This enables the evaluation of out-of-bag error which serves as a internal cross-validation mechanism. Our motivation lies in using the unsampled training samples to improve each decision tree in the ensemble. We study the effect of using the out-of-bag samples to improve the generalization error first of the decision trees and second the random forest by post-pruning. A preliminary empirical study on four UCI repository datasets show consistent decrease in the size of the forests without considerable loss in accuracy.

machine learning, pruning, subtree, (15 more...)

arXiv.org Machine Learning

Jul-19-2017

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York (0.04)
  - California > Alameda County
    - Berkeley (0.04)

Genre:
- Research Report (0.64)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found