Enhancing Bagging Ensemble Regression with Data Integration for Time Series-Based Diabetes Prediction
Ngo, Vuong M., Vinh, Tran Quang, Kearney, Patricia, Roantree, Mark
–arXiv.org Artificial Intelligence
Diabetes is a chronic metabolic disease characterized by elevated blood glucose levels, leading to complications like heart disease, kidney failure, and nerve damage. Accurate state-level predictions are vital for effective healthcare planning and targeted interventions, but in many cases, data for necessary analyses are incomplete. This study begins with a data engineering process to integrate diabetes-related datasets from 2011 to 2021 to create a comprehensive feature set. We then introduce an enhanced bagging ensemble regression model (EBMBag+) for time series forecasting to predict diabetes prevalence across U.S. cities. Several baseline models, including SVMReg, BDTree, LSBoost, NN, LSTM, and ERMBag, were evaluated for comparison with our EBMBag+ algorithm. The experimental results demonstrate that EBMBag+ achieved the best performance, with an MAE of 0.41, RMSE of 0.53, MAPE of 4.01, and an R2 of 0.9.
arXiv.org Artificial Intelligence
Jun-18-2025
- Country:
- Africa > Middle East
- Egypt (0.04)
- Asia
- Bangladesh (0.04)
- Middle East > Saudi Arabia (0.04)
- Vietnam > Hồ Chí Minh City
- Hồ Chí Minh City (0.05)
- Europe > Ireland
- Leinster > County Dublin
- Dublin (0.04)
- Munster > County Cork
- Cork (0.04)
- Leinster > County Dublin
- North America > United States
- Alaska (0.04)
- California > Alameda County
- Berkeley (0.04)
- Africa > Middle East
- Genre:
- Research Report > New Finding (0.48)
- Industry:
- Health & Medicine > Therapeutic Area > Endocrinology > Diabetes (1.00)
- Technology:
- Information Technology
- Artificial Intelligence > Machine Learning
- Ensemble Learning (1.00)
- Neural Networks > Deep Learning (0.71)
- Statistical Learning > Regression (0.66)
- Data Science (1.00)
- Artificial Intelligence > Machine Learning
- Information Technology