Enhancing survival prediction for COVID-19 in diabetic patients in Mexico: integrating RMST, propensity score matching, and ensemble machine learning Academic Article in Scopus uri icon

abstract

  • Background: This study evaluates the survival impact of diabetes on hospitalized COVID-19 patients in Mexico by combining traditional survival methods (Restricted Mean Survival Time, RMST) with machine learning (ML) prediction. The goal is to understand how diabetes and associated comorbidities affect short-term survival and to develop accurate, interpretable models that support data-driven decision-making. Methods: A national dataset of over one million COVID-19 cases was analyzed. Diabetic and non-diabetic cohorts were matched using propensity scores based on key covariates (e.g., age, gender, and comorbidities). RMST differences were estimated using survival curves and statistical testing. Separately, machine learning models (Random Forest (RF) and Variational Deep Neural Network (VDNN)) were trained to predict individual RMST values, and SHapley Additive exPlanations (SHAP) were used for model interpretability. Results: The RMST for diabetic patients was lower than that for non-diabetic patients, with a difference of 2.32 days (p = 0.0583) after matching. Predictive models achieved strong internal validity (R2 > 0.60). SHAP analysis revealed obesity, smoking, and hypertension as the top predictors and suggested that temporal variables and comorbidities played a central role in short-term survival. Conclusion: Combining survival analysis with machine learning provides both inferential and predictive insights into the mortality risk of diabetic COVID-19 patients. More importantly, results show that traditional survival analyzes with modern machine learning yields accurate and interpretable predictions that can support personalized interventions tailored to patients with COVID-19 and comorbid diabetes: such as prioritizing early clinical monitoring, individualized treatment plans, or risk-informed hospital admission decisions, and guide a more efficient allocation of healthcare resources. © © 2026 Vargas-Santiago, León-Velasco, Monroy and Quezada-García.

publication date

  • January 1, 2026