TY - CONF
T1 - Enhancing Bagging Ensemble Regression with Data Integration for Time Series-Based Diabetes Prediction
T2 - Lecture Notes in Computer Science
AU - Ngo, V.M.
AU - Tran, Q.V.
AU - Kearney, P.
AU - Roantree, M.
N1 - Export Date: 08 December 2025; Cited By: 0
PY - 2026
Y1 - 2026
N2 - Diabetes is a chronic metabolic disease characterized by elevated blood glucose levels, leading to complications like heart disease, kidney failure, and nerve damage. Accurate state-level predictions are vital for effective healthcare planning and targeted interventions, but in many cases, data for necessary analyses are incomplete. This study begins with a data engineering process to integrate diabetes-related datasets from 2011 to 2021 to create a comprehensive feature set. We then introduce an enhanced bagging ensemble regression model (EBMBag+) for time series forecasting to predict diabetes prevalence across U.S. cities. Several baseline models, including SVMReg, BDTree, LSBoost, NN, LSTM, and ERMBag, were evaluated for comparison with our EBMBag+ algorithm. The experimental results demonstrate that EBMBag+ achieved the best performance, with an MAE of 0.41, RMSE of 0.53, MAPE of 4.01, and an R2 of 0.91. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.
AB - Diabetes is a chronic metabolic disease characterized by elevated blood glucose levels, leading to complications like heart disease, kidney failure, and nerve damage. Accurate state-level predictions are vital for effective healthcare planning and targeted interventions, but in many cases, data for necessary analyses are incomplete. This study begins with a data engineering process to integrate diabetes-related datasets from 2011 to 2021 to create a comprehensive feature set. We then introduce an enhanced bagging ensemble regression model (EBMBag+) for time series forecasting to predict diabetes prevalence across U.S. cities. Several baseline models, including SVMReg, BDTree, LSBoost, NN, LSTM, and ERMBag, were evaluated for comparison with our EBMBag+ algorithm. The experimental results demonstrate that EBMBag+ achieved the best performance, with an MAE of 0.41, RMSE of 0.53, MAPE of 4.01, and an R2 of 0.91. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.
U2 - 10.1007/978-3-032-09318-9_31
DO - 10.1007/978-3-032-09318-9_31
M3 - Paper
SP - 449
EP - 463
ER -