Reviews: Minimal Variance Sampling in Stochastic Gradient Boosting
–Neural Information Processing Systems
The authors propose a non-uniform sampling strategy for stochastic gradient boosted decision trees. In particular, sampling probability of the training data is optimized towards maximizing the estimation accuracy of the splitting score of decision trees. The optimization problem allows an approximate closed-form solution. Experiment results demonstrate superior performance of the proposed strategy. The reviewers agree that the paper can not only help understand sampling within GBDT from a more rigorous perspective but also improve GBDT implementations in practice.
Neural Information Processing Systems
Jan-24-2025, 00:31:05 GMT