Journal of Oral Science Research ›› 2026, Vol. 42 ›› Issue (5): 426-432.DOI: 10.13701/j.cnki.kqyxyj.2026.05.011

Previous Articles     Next Articles

Machine Learning-based Construction of Risk Prediction Model for Root Caries in Elderly Patients Aged 60 Years and Above in Shanghai

WANG Hui1*, ZHANG Yi2   

  1. 1. Department of Stomatology, Shanghai Yangpu District Central Hospital, Shanghai 200090, China;
    2. Product R&D Department, an Information Technology Enterp, Shanghai 200090, China
  • Received:2025-08-18 Published:2026-05-25

Abstract: Objective: To construct a risk prediction model for root caries in the elderly using machine learning, identify risk factors, and provide a basis for clinical screening and prevention. Methods: 4,826 patients aged ≥60 from Shanghai Yangpu District Central Hospital (2024-2025) were included. Data were collected via oral examinations, clinical records, and questionnaires. Risk factors were screened by multivariate Logistic regression. The dataset was split into training (70%) and validation (30%) sets. Four machine learning models were built using Python and evaluated effectiveness based on the area under curve (AUC). Results: The prevalence of root caries was 45.3%. Multivariate analysis showed 7 factors (e.g., age, diabetes) were significantly associated (P<0.05). The four models had AUC 0.849-0.894, with Logistic regression optimal and plaque index contributed most. Conclusion: Logistic regression had good predictive performance. Intervening on key risk factors can reduce root caries incidence in the elderly and improve oral health management.

Key words: root caries in the elderly, machine learning, Logistic regression, risk factors, prediction model