Online Gradient Boosting Decision Tree: In-Place Updates for Efficient Adding/Deleting Data
Lin, Huawei, Chung, Jun Woo, Lao, Yingjie, Zhao, Weijie
Gradient Boosting Decision Tree (GBDT) is one of the most popular machine learning models in various applications. However, in the traditional settings, all data should be simultaneously accessed in the training procedure: it does not allow to add or delete any data instances after training. In this paper, we propose an efficient online learning framework for GBDT supporting both incremental and decremental learning. To the best of our knowledge, this is the first work that considers an in-place unified incremental and decremental learning on GBDT. To reduce the learning cost, we present a collection of optimizations for our framework, so that it can add or delete a small fraction of data on the fly. We theoretically show the relationship between the hyper-parameters of the proposed optimizations, which enables trading off accuracy and cost on incremental and decremental learning. The backdoor attack results show that our framework can successfully inject and remove backdoor in a well-trained model using incremental and decremental learning, and the empirical results on public datasets confirm the effectiveness and efficiency of our proposed online learning framework and optimizations.
Feb-3-2025
- Country:
- South America > Brazil
- Rio de Janeiro > Rio de Janeiro (0.04)
- North America
- United States
- Maryland > Baltimore (0.04)
- Hawaii (0.04)
- District of Columbia > Washington (0.04)
- Idaho > Ada County
- Boise (0.04)
- Colorado > Denver County
- Denver (0.04)
- Florida > Broward County
- Fort Lauderdale (0.04)
- Massachusetts > Suffolk County
- Boston (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- Washington > King County
- Seattle (0.04)
- California
- San Francisco County > San Francisco (0.14)
- Santa Clara County > San Jose (0.04)
- Los Angeles County > Long Beach (0.04)
- Canada
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.28)
- United States
- Europe
- United Kingdom
- Scotland > City of Edinburgh
- Edinburgh (0.04)
- England > Oxfordshire
- Oxford (0.04)
- Scotland > City of Edinburgh
- Spain > Valencian Community
- Valencia Province > Valencia (0.04)
- Italy > Liguria
- Genoa (0.04)
- France > Hauts-de-France
- United Kingdom
- Asia
- Africa > Middle East
- Egypt > Cairo Governorate > Cairo (0.04)
- South America > Brazil
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Education (0.92)
- Technology:
- Information Technology > Artificial Intelligence > Machine Learning
- Statistical Learning (1.00)
- Ensemble Learning (1.00)
- Decision Tree Learning (1.00)
- Neural Networks > Deep Learning (0.46)
- Information Technology > Artificial Intelligence > Machine Learning