Abstract:Objective To evaluate the application value of a multi‑label classification model (Endosmart) based on ResNeSt‑50 network for gastroscopic site recognition. Methods A total of 10 172 gastroscopic images involving 24 gastric regions and 13 617 labels were enrolled from Tianjin Medical University General Hospital Airport Hospital and Tianjin Nankai Hospital from January 2018 to July 2022. Of these, 8 501 images were used to develop the Endosmart via deep learning, enabling simultaneous multi‑label prediction of multiple anatomical sites from a single gastroscopic image. Internal validation was performed using 1 671 gastroscopic images. External validation was conducted using 100 gastroscopic video clips from Tianjin Medical University General Hospital from August 2022 to December 2024. The precision, recall, and F1‑score of Endosmart for gastric site recognition were assessed in both the internal image and external video validation sets. The overall recognition performance of Endosmart was further compared with those of 4 mid‑career endoscopists in the video validation set. Results In the internal image validation set, the Endosmart achieved an average precision of 94.2% (95%CI: 92.6%‑95.5%), a recall of 90.3% (95%CI: 88.7%‑91.7%), and an F1‑score of 86.2% (95%CI: 84.2%‑88.2%) for gastric anatomical site recognition at the image level. In the external video validation set, the precision, recall and macro‑average F1‑score of Endosmart for comprehensive recognition of gastric anatomical sites were 95.5% (95%CI: 94.0%‑96.9%), 93.7% (95%CI: 92.4%‑95.0%) and 94.5% (95%CI: 93.3%‑95.6%), respectively. While the 4 mid‑career endoscopists yielded an average precision of 95.5% (95%CI: 93.9%‑96.9%), a recall of 90.7% (95%CI: 89.1%‑91.8%), and a macro-average F1-score of 91.7% (95%CI: 90.3%-93.9%). In the gastric anatomical site recognition task, the per-video micro F1-score of the Endosmart was 94.6% (95%CI: 93.5%-95.6%), compared with the average level of 92.1% (95%CI: 90.6%-93.5%) among the 4 endoscopists. The difference in recognition performance between the model and endoscopists was statistically significant (P0.001). Conclusion The ResNeSt-50-based Endosmart multi-label classification model demonstrates excellent performance in the recognition of anatomical sites from gastroscopic images and videos, and shows superior accuracy to those of mid-career endoscopists. This model can serve as an auxiliary tool for gastroscopy to monitor examination completeness, reduce blind spots, and improve examination quality.