口腔医学研究 ›› 2026, Vol. 42 ›› Issue (5): 426-432.DOI: 10.13701/j.cnki.kqyxyj.2026.05.011

• 龋病学研究 • 上一篇    下一篇

基于机器学习的上海某医院60岁及以上就诊老人根面龋风险预测模型构建

王辉1*, 张翼2   

  1. 1.上海市杨浦区中心医院口腔科 上海 200090;
    2.信息技术领域企业 产品研发部 200090
  • 收稿日期:2025-08-18 发布日期:2026-05-25
  • 通讯作者: *王辉,E-mail:745930620@qq.com
  • 作者简介:王辉(1996~ ),女,上海人,口腔医师,硕士,主要从事口腔临床相关研究。

Machine Learning-based Construction of Risk Prediction Model for Root Caries in Elderly Patients Aged 60 Years and Above in Shanghai

WANG Hui1*, ZHANG Yi2   

  1. 1. Department of Stomatology, Shanghai Yangpu District Central Hospital, Shanghai 200090, China;
    2. Product R&D Department, an Information Technology Enterp, Shanghai 200090, China
  • Received:2025-08-18 Published:2026-05-25

摘要: 目的:采用机器学习算法构建老年根面龋风险预测模型,识别根面龋发病的危险因素,为临床筛查高危患者及制定预防策略提供依据。方法:选取2024~2025年上海杨浦区中心医院4826例60岁及以上的患者,经口腔检查、临床数据收集及问卷收集信息。多因素Logistic回归分析筛选危险因素,数据集7∶3分训练集与验证集,Python构建4种机器学习模型,以曲线下面积(area under curve,AUC)评估效果。结果:根面龋患病率45.3%。多因素分析显示年龄、糖尿病等7项因素关联显著(P<0.05)。4种模型AUC为0.849~0.894,Logistic回归最优,菌斑指数贡献最高。结论:Logistic回归模型预测效能良好,针对关键危险因素干预可降低老年根面龋发生率,提升口腔健康管理水平。

关键词: 老年根面龋, 机器学习, Logistic回归, 危险因素, 预测模型

Abstract: Objective: To construct a risk prediction model for root caries in the elderly using machine learning, identify risk factors, and provide a basis for clinical screening and prevention. Methods: 4,826 patients aged ≥60 from Shanghai Yangpu District Central Hospital (2024-2025) were included. Data were collected via oral examinations, clinical records, and questionnaires. Risk factors were screened by multivariate Logistic regression. The dataset was split into training (70%) and validation (30%) sets. Four machine learning models were built using Python and evaluated effectiveness based on the area under curve (AUC). Results: The prevalence of root caries was 45.3%. Multivariate analysis showed 7 factors (e.g., age, diabetes) were significantly associated (P<0.05). The four models had AUC 0.849-0.894, with Logistic regression optimal and plaque index contributed most. Conclusion: Logistic regression had good predictive performance. Intervening on key risk factors can reduce root caries incidence in the elderly and improve oral health management.

Key words: root caries in the elderly, machine learning, Logistic regression, risk factors, prediction model