Electronic Science and Technology ›› 2025, Vol. 38 ›› Issue (3): 47-59.doi: 10.16180/j.cnki.issn1007-7820.2025.03.007

Previous Articles     Next Articles

Integration of CNN and Transformer for Retinal OCT Image Fluid Segmentation Method

CHEN Yuyang(), LI Feng   

  1. School of Optical-Electrical and Computer Engineering,University of Shanghai for Science and Technology,Shanghai 200093,China
  • Received:2023-09-03 Revised:2023-10-05 Online:2025-03-15 Published:2025-03-11
  • Supported by:
    National Key R&D Program of China(2020YFC2008704)

Abstract:

In view of the problems such as small size, heterogeneous shape and fuzzy details of the fluid accumulation area, this study integrates CNN(Convolutional Neural Networks) and Transformer to propose an innovative multi-branch segmentation network. The network consists of full convolutional path, Transformer path and CNN-Transformer fusion path. The fully convolutional path is used to capture detailed features of the lesion area, while the Transformer path extracts multi-scale non-local feature information with long-range dependencies. The fusion path takes advantage of both CNN and Transformer to make up for the shortcomings of other branches. The features of the three branches are integrated through the prediction head to generate the final segmentation map. The performance of retinal effusion segmentation is tested on Kermany, UMN and DUKE data sets for intraretinal effusion and subretinal effusion. The experimental results show that the Dice coefficient of the proposed method is 86.63%, the crossover ratio is 77.02%, the sensitivity is 89.47%, and the accuracy is 85.51%, which proves its effectiveness and provides a feasible solution for the automatic segmentation of retinal effusion.

Key words: retinal OCT images, convolutional neural network, Transformer, segmentation network, IRF, SRF, retinal effusion, attention mechanism

CLC Number: 

  • TP391