Reviews: Minimal Variance Sampling in Stochastic Gradient Boosting

Jan-24-2025, 00:51:32 GMT–Neural Information Processing Systems

Update: I read authors' responce RE:sampling rate does not tell the whole story - i was suggesting to add information about on average how many instances were used for each of the splits (because it is not equal to sampling rate * total dataset size). I am keeping my accept rating, hoping that authors do make the changes to improve the derivations/clarity in the final submission Summary: this paper is concerned with a common trick that a lot of GBDT implementation apply - subsampling instances in order to speed up calculations for finding the best split. The authors formulate the problem of choosing the instances to sample as an optimization problem and derive a modified sampling scheme that is aimed at mimicking the gain that would be assigned to a split on all the of the data by using a gain calculated only on a subsampled instances. The experiments demonstrate good results. The paper is well written and easy to follow, apart from a couple of places in derivations(see my questions).

artificial intelligence, minimal variance sampling, quantile, (4 more...)

Neural Information Processing Systems

Jan-24-2025, 00:51:32 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Ensemble Learning (0.53)
    - Statistical Learning > Gradient Descent (0.40)
  - Representation & Reasoning > Mathematical & Statistical Methods (0.40)