Stability of Random Forests and Coverage of Random-Forest Prediction Intervals

Jan-18-2025, 21:36:37 GMT–Neural Information Processing Systems

We establish stability of random forests under the mild condition that the squared response ( Y 2) does not have a heavy tail. In particular, our analysis holds for the practical version of random forests that is implemented in popular packages like \texttt{randomForest} in \texttt{R}. Empirical results show that stability may persist even beyond our assumption and hold for heavy-tailed Y 2 . Using the stability property, we prove a non-asymptotic lower bound for the coverage probability of prediction intervals constructed from the out-of-bag error of random forests. With another mild condition that is typically satisfied when Y is continuous, we also establish a complementary upper bound, which can be similarly established for the jackknife prediction interval constructed from an arbitrary stable algorithm.

coverage probability, random forest and coverage, random-forest prediction interval, (3 more...)

Neural Information Processing Systems

Jan-18-2025, 21:36:37 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Ensemble Learning (1.00)
  - Decision Tree Learning (1.00)