RS-ORT: A Reduced-Space Branch-and-Bound Algorithm for Optimal Regression Trees
Heredia, Cristobal, Chumpitaz-Flores, Pedro, Hua, Kaixun
–arXiv.org Artificial Intelligence
Mixed-integer programming (MIP) has emerged as a powerful framework for learning optimal decision trees. Yet, existing MIP approaches for regression tasks are either limited to purely binary features or become computationally intractable when continuous, large-scale data are involved. Naively binarizing continuous features sacrifices global optimality and often yields needlessly deep trees. We recast the optimal regression-tree training as a two-stage optimization problem and propose Reduced-Space Optimal Regression Trees (RS-ORT) - a specialized branch-and-bound (BB) algorithm that branches exclusively on tree-structural variables. This design guarantees the algorithm's convergence and its independence from the number of training samples. Leveraging the model's structure, we introduce several bound tightening techniques - closed-form leaf prediction, empirical threshold discretization, and exact depth-1 subtree parsing - that combine with decomposable upper and lower bounding strategies to accelerate the training. The BB node-wise decomposition enables trivial parallel execution, further alleviating the computational intractability even for million-size datasets. Based on the empirical studies on several regression benchmarks containing both binary and continuous features, RS-ORT also delivers superior training and testing performance than state-of-the-art methods. Notably, on datasets with up to 2,000,000 samples with continuous features, RS-ORT can obtain guaranteed training performance with a simpler tree structure and a better generalization ability in four hours.
arXiv.org Artificial Intelligence
Oct-29-2025
- Country:
- Asia
- Japan (0.04)
- South Korea > Seoul
- Seoul (0.05)
- Europe
- North America
- Costa Rica > Heredia Province
- Heredia (0.04)
- United States
- California > Santa Clara County
- San Jose (0.04)
- Florida > Hillsborough County
- Tampa (0.14)
- Massachusetts > Middlesex County
- Belmont (0.04)
- California > Santa Clara County
- Costa Rica > Heredia Province
- Asia
- Genre:
- Research Report > Promising Solution (0.34)
- Industry:
- Energy (0.68)
- Information Technology (0.46)
- Technology: