agtboost: Adaptive and Automatic Gradient Tree Boosting Computations

Lunde, Berent Ånund Strømnes, Kleppe, Tore Selland

Aug-28-2020–arXiv.org Machine Learning

Gradient tree boosting (GTB) (Friedman 2001; Mason, Baxter, Bartlett, and Frean 1999) has risen to prominence for regression problems after the introduction of xgboost (Chen and Guestrin 2016). The GTB model is an ensemble-type model, that consist of classification and regression trees (CART) (Breiman, Friedman, Stone, and Olshen 1984) that are learned in an iterative manner. GTB models are very flexible in that they automatically learn nonlinear relationships and interaction effects. However, with the increased flexibility of GTB models comes substantial worries of overfitting. The top performing gradient tree boosting libraries, such as xgboost, LightGBM (Ke, Meng, Finley, Wang, Chen, Ma, Ye, and Liu 2017) and catboost (Dorogush, Ershov, and Gulin 2018), all come with a large number of hyperparameters available for manual tuning to constrain the complexity of the GTB models. Training of gradient tree boosting models, in general, thus require some familiarity with both the chosen package, and the data for efficient tuning and application to the problem at hand. The main focus of the hyperparameters and tuning are to solve the following problems: - The complexity of trees: What are the topology of all the different trees?

artificial intelligence, iteration, machine learning, (18 more...)

arXiv.org Machine Learning

Aug-28-2020

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - California (0.04)
  - New York > New York County
    - New York City (0.04)
- Europe
  - Austria > Vienna (0.14)
  - Norway > Western Norway
    - Rogaland > Stavanger (0.05)
  - Netherlands > South Holland
    - Leiden (0.04)

Genre:
- Research Report (0.64)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Statistical Learning (1.00)
  - Ensemble Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found