An information criterion for automatic gradient tree boosting
Lunde, Berent Ånund Strømnes, Kleppe, Tore Selland, Skaug, Hans Julius
This article is motivated by the problem of selecting the functional form of trees and ensemble size in gradient tree boosting (Friedman, 2001; Mason et al., 2000). Gradient tree boosting (GTB) has become extremely popular in recent years, both in academia and industry: At present, an increase in the size of datasets, both in the number of observations and the richness of the data, or number of features, is seen. This, coupled with an exponential increase in computational power and a growing revelation and acceptance for data-driven decisions in the industry makes for an increasing interest in statistical learning (Hastie et al., 2001). For these new datasets, standard statistical methods such as generalized linear models (McCullagh and Nelder, 1989) that have a fixed learning rate due to their constrained functional form with bounded complexity, struggle in terms of predictive power, as they stop learning at certain information thresholds. The interest is therefore geared towards more flexible approaches such as ensembles of learners.
Aug-13-2020
- Country:
- North America > United States
- California (0.04)
- New York > New York County
- New York City (0.04)
- Europe
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Norway > Western Norway
- United Kingdom > England
- North America > United States
- Genre:
- Research Report (0.82)
- Technology: