Journal of Oral Science Research ›› 2025, Vol. 41 ›› Issue (1): 16-20.DOI: 10.13701/j.cnki.kqyxyj.2025.01.004

Previous Articles     Next Articles

Research on Classification of Benign and Malignant Oral Lesions Using ViT-B Deep Learning Model

CUI Yuchen1, XIE Yuandong2, WU Yumiao1, NIU Lingxiao3, CHANG Luguangda1, ZHU Xianchun1*   

  1. 1. Department of Orthodontics, Hospital of Stomatology, Jilin University, Changchun 130021, China;
    2. Jilin Provincial Key Laboratory of Tooth Development and Bone Remodeling, Changchun 130021, China;
    3. Department of Oral and Maxillofacial Surgery, School and Hospital of Stomatology, Jilin University, Changchun 130021, China
  • Received:2024-08-13 Online:2025-01-28 Published:2025-01-24

Abstract: Objective: To analyze the performance of ViT-B model in detecting oral benign and malignant lesions based on deep learning algorithms. Methods: A public dataset containing images of oral benign and malignant lesions was used, with preprocessing and data augmentation applied. The data was randomly divided into training, validation, and test sets in a 7∶2∶1 ratio. Five deep learning models, including ViT-B, VGG16, ResNet101, DenseNet121, and EfficientNetV2, were selected for training and evaluation. The generalization ability of the ViT-B model was evaluated using external data, and the model was analyzed based on the visualization of attention weights. Results: The ViT-B model demonstrated the best performance among five models, with an area under the receiver operating characteristic curve (AUC) of 0.9715 and an accuracy of 91.00%. The model effectively distinguished between images of oral benign and malignant lesions, demonstrating strong generalization ability and clinical applicability. Conclusion: The ViT-B model performs well in the recognition of oral benign and malignant lesions, supporting the early detection and accurate diagnosis of oral cancer.

Key words: oral cancer, oral lesions, deep learning, ViT-B