Explainable Machine Learning-Based Approach to Identify People at Risk of Diabetes using Physical Activity Monitoring

Research output: Contribution to journalJournal articleResearchpeer-review

19 Downloads (Pure)

Abstract

Objective: This study aimed to investigate the utilization of patterns derived from physical activity monitoring (PAM) for the identification of individuals at risk of type 2 diabetes mellitus (T2DM) through an at-home screening approach employing machine learning techniques. Methods: Data from the 2011–2014 National Health and Nutrition Examination Survey (NHANES) were scrutinized, focusing on the PAM component. The primary objective involved the identification of diabetes, characterized by an HbA1c ≥ 6.5% (48 mmol/mol), while the secondary objective included individuals with prediabetes, defined by an HbA1c ≥ 5.7% (39 mmol/mol). Features derived from PAM, along with age, were utilized as inputs for an XGBoost classification model. SHapley Additive exPlanations (SHAP) was employed to enhance the interpretability of the models. Results: The study included 7532 subjects with both PAM and HbA1c data. The model, which solely included PAM features, had a test dataset ROC-AUC of 0.74 (95% CI = 0.72–0.76). When integrating the PAM features with age, the model’s ROC-AUC increased to 0.79 (95% CI = 0.78–0.80) in the test dataset. When addressing the secondary target of prediabetes, the XGBoost model exhibited a test dataset ROC-AUC of 0.80 [95% CI; 0.79–0.81]. Conclusions: The objective quantification of physical activity through PAM yields valuable information that can be employed in the identification of individuals with undiagnosed diabetes and prediabetes.
Original languageEnglish
Article number1
JournalBioMedInformatics
Volume5
Issue number1
ISSN2673-7426
DOIs
Publication statusPublished - Mar 2025

Keywords

  • XGBoost
  • physical activity monitoring
  • prediabetes
  • prediction
  • screening
  • type 2 diabetes mellitus

Fingerprint

Dive into the research topics of 'Explainable Machine Learning-Based Approach to Identify People at Risk of Diabetes using Physical Activity Monitoring'. Together they form a unique fingerprint.

Cite this