Glitches in Decision Tree Ensemble Models
Chandra, Satyankar, Gupta, Ashutosh, Mallik, Kaushik, Shankaranarayanan, Krishna, Varshney, Namrita
Many critical decision-making tasks are now delegated to machine-learned models, and it is imperative that their decisions are trustworthy and reliable, and their outputs are consistent across similar inputs. We identify a new source of unreliable behaviors-called glitches-which may significantly impair the reliability of AI models having steep decision boundaries. Roughly speaking, glitches are small neighborhoods in the input space where the model's output abruptly oscillates with respect to small changes in the input. We provide a formal definition of glitches, and use well-known models and datasets from the literature to demonstrate that they have widespread existence and argue they usually indicate potential model inconsistencies in the neighborhood of where they are found. We proceed to the algorithmic search of glitches for widely used gradient-boosted decision tree (GBDT) models. We prove that the problem of detecting glitches is NP-complete for tree ensembles, already for trees of depth 4. Our glitch-search algorithm for GBDT models uses an MILP encoding of the problem, and its effectiveness and computational feasibility are demonstrated on a set of widely used GBDT benchmarks taken from the literature.
Jul-22-2025
- Country:
- North America
- United States
- New Jersey > Hudson County
- Hoboken (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- California
- San Francisco County > San Francisco (0.14)
- Los Angeles County > Long Beach (0.04)
- New Jersey > Hudson County
- Canada > British Columbia
- Vancouver (0.04)
- United States
- Europe
- Spain > Galicia
- Madrid (0.04)
- Belgium > Wallonia
- Walloon Brabant > Louvain-la-Neuve (0.04)
- Spain > Galicia
- Asia > India
- Maharashtra > Mumbai (0.04)
- North America
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Health & Medicine > Therapeutic Area (0.94)
- Technology: