Quantized Training of Gradient Boosting Decision Trees

Jan-13-2025, 14:11:58 GMT–Neural Information Processing Systems

Recent years have witnessed significant success in Gradient Boosting Decision Trees (GBDT) for a wide range of machine learning applications. Generally, a consensus about GBDT's training algorithms is gradients and statistics are computed based on high-precision floating points. In this paper, we investigate an essentially important question which has been largely ignored by the previous literature - how many bits are needed for representing gradients in training GBDT? To solve this mystery, we propose to quantize all the high-precision gradients in a very simple yet effective way in the GBDT's training algorithm. Surprisingly, both our theoretical analysis and empirical studies show that the necessary precisions of gradients without hurting any performance can be quite low, e.g., 2 or 3 bits.

decision tree, gbdt, quantized training, (6 more...)

Neural Information Processing Systems

Jan-13-2025, 14:11:58 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Diagnosis (0.64)
  - Machine Learning
    - Ensemble Learning (0.64)
    - Decision Tree Learning (0.64)