Chrome Extension
WeChat Mini Program
Use on ChatGLM

DBPF-net: Dual-Branch Structural Feature Extraction Reinforcement Network for Ocular Surface Disease Image Classification

FRONTIERS IN MEDICINE(2024)

Nanjing Univ Aeronaut & Astronaut | Jinan Univ

Cited 1|Views21
Abstract
Pterygium and subconjunctival hemorrhage are two common types of ocular surface diseases that can cause distress and anxiety in patients. In this study, 2855 ocular surface images were collected in four categories: normal ocular surface, subconjunctival hemorrhage, pterygium to be observed, and pterygium requiring surgery. We propose a diagnostic classification model for ocular surface diseases, dual-branch network reinforced by PFM block (DBPF-Net), which adopts the conformer model with two-branch architectural properties as the backbone of a four-way classification model for ocular surface diseases. In addition, we propose a block composed of a patch merging layer and a FReLU layer (PFM block) for extracting spatial structure features to further strengthen the feature extraction capability of the model. In practice, only the ocular surface images need to be input into the model to discriminate automatically between the disease categories. We also trained the VGG16, ResNet50, EfficientNetB7, and Conformer models, and evaluated and analyzed the results of all models on the test set. The main evaluation indicators were sensitivity, specificity, F1-score, area under the receiver operating characteristics curve (AUC), kappa coefficient, and accuracy. The accuracy and kappa coefficient of the proposed diagnostic model in several experiments were averaged at 0.9789 and 0.9681, respectively. The sensitivity, specificity, F1-score, and AUC were, respectively, 0.9723, 0.9836, 0.9688, and 0.9869 for diagnosing pterygium to be observed, and, respectively, 0.9210, 0.9905, 0.9292, and 0.9776 for diagnosing pterygium requiring surgery. The proposed method has high clinical reference value for recognizing these four types of ocular surface images.
More
Translated text
Key words
subconjunctival hemorrhage,pterygium,visual recognition,deep learning,computer aided diagnosis
PDF
Bibtex
AI Read Science
AI Summary
AI Summary is the key point extracted automatically understanding the full text of the paper, including the background, methods, results, conclusions, icons and other key content, so that you can get the outline of the paper at a glance.
Example
Background
Key content
Introduction
Methods
Results
Related work
Fund
Key content
  • Pretraining has recently greatly promoted the development of natural language processing (NLP)
  • We show that M6 outperforms the baselines in multimodal downstream tasks, and the large M6 with 10 parameters can reach a better performance
  • We propose a method called M6 that is able to process information of multiple modalities and perform both single-modal and cross-modal understanding and generation
  • The model is scaled to large model with 10 billion parameters with sophisticated deployment, and the 10 -parameter M6-large is the largest pretrained model in Chinese
  • Experimental results show that our proposed M6 outperforms the baseline in a number of downstream tasks concerning both single modality and multiple modalities We will continue the pretraining of extremely large models by increasing data to explore the limit of its performance
Try using models to generate summary,it takes about 60s
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Related Papers

Multi-resolution Visual Mamba with Multi-Directional Selective Mechanism for Retinal Disease Detection

Qiankun Zuo, Zhengkun Shi, Bo Liu, Na Ping, Jiangtao Wang, Xi Cheng, Kexin Zhang,Jia Guo, Yixian Wu,Jin Hong
FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY 2024

被引用1

Data Disclaimer
The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn
Chat Paper

要点】:本研究提出了一种用于眼部表面疾病图像分类的双分支结构特征提取增强网络(DBPF-Net),通过PFM块增强了特征提取能力,在识别四种眼部表面疾病上具有高临床参考价值。

方法】:研究采用基于conformer模型的两分支架构作为基础,引入了由补丁合并层和FReLU层构成的PFM块,以强化模型的空间结构特征提取能力。

实验】:使用了2855张眼部表面图像,分为正常、结膜下出血、需观察的翼状胬肉和需手术的翼状胬肉四类,训练并比较了VGG16、ResNet50、EfficientNetB7和Conformer模型,实验结果显示所提模型的平均准确度和Kappa系数分别为0.9789和0.9681,对需观察和需手术的翼状胬肉的敏感度、特异性、F1分数和AUC均表现出较高值。