Volume 19 , Issue 2 , PP: 151-169, 2025 | Cite this article as | XML | Html | PDF | Full Length Article
Archana E. 1 , Geetha S. 2 , Victo Sudha George G. 3
Doi: https://doi.org/10.54216/FPA.190212
Glaucoma is a dangerous eye illness that greatly reduces the sharpness of a person's vision. If not caught early enough, this retinal disorder can damage the optic nerve head (ONH) and cause permanent blindness. Automated glaucoma diagnosis now has tool support thanks to recent advances in deep learning besides the convenience of computing resources. The low reliability of generic convolutional neural networks has prevented their widespread usage in medical procedures, even if deep learning has improved illness diagnosis using medical pictures. While there has been a rise in the use of deep learning for glaucoma classification, very few studies have tested whether or not the models are easy to understand and interpret, which bodes well for their future use. Medical picture feature extraction using Vision Transformers is showcased in this study utilising an EvoTransform: Advanced Evolutionary Algorithm Integration in Transformer Networks named as (EvoTAEA). Combining the powers of Convolutional Neural Networks with Vision Transformers, the suggested EAT Former architecture takes advantage of their data pattern recognition in addition adaptability capabilities. The classification accuracy is enhanced by using the Wild Geese Migration Optimizer (WGMO) to fine-tune the parameters of the proposed feature extraction. The design makes use of new parts, such as the Multi-Scale Region Aggregation, Global and Local Interaction, and Enhanced EA based Transformer blocks with Feed-Forward networks. For dynamically simulating non-standard places, it also presents the Modulated Deformable MSA module. Important components of the Vision Transformer (ViT) model are covered in the study, including patch-based processing, Multi-Head Attention mechanism, and positional context inclusion. In order to give an inductive bias, it presents the Multi-Scale Region Aggregation module, which combines data from several receptive fields. The MSA-based global module is improved by the Global and Local Interaction module, which adds a local path for extracting discriminative local info. An approach to glaucoma diagnosis that integrates ResNet-50, DenseNet-201, and Xception is suggested in the study. Two publicly available datasets, ORIGA and ACRIMA, are used to evaluate the trials. This model can help with the automated diagnosis of glaucoma using fundus pictures.
EvoTransform: Advanced Evolutionary Algorithm , Wild Geese Migration Optimizer , Multi-Scale Region Aggregation , Fundus Images , DenseNet-201 , Glaucoma detection
[1] R. Fan et al., “Detecting glaucoma from fundus photographs using deep learning without convolutions: Transformer for improved generalization,” Ophthalmology Science, vol. 3, no. 1, p. 100233, 2023.
[2] R. Hemelings et al., “A generalizable deep learning regression model for automated glaucoma screening from fundus images,” NPJ Digital Medicine, vol. 6, no. 1, p. 112, 2023.
[3] A. Shoukat, S. Akbar, S. A. Hassan, S. Iqbal, A. Mehmood, and Q. M. Ilyas, “Automatic diagnosis of glaucoma from retinal images using deep learning approach,” Diagnostics, vol. 13, no. 10, p. 1738, 2023.
[4] R. Thanki, “A deep neural network and machine learning approach for retinal fundus image classification,” Healthcare Analytics, vol. 3, p. 100140, 2023.
[5] L. J. Coan et al., “Automatic detection of glaucoma via fundus imaging and artificial intelligence: A review,” Survey of Ophthalmology, vol. 68, no. 1, pp. 17–41, 2023.
[6] A. Elmoufidi, A. Skouta, S. Jai-Andaloussi, and O. Ouchetto, “CNN with multiple inputs for automatic glaucoma assessment using fundus images,” International Journal of Image and Graphics, vol. 23, no. 1, p. 2350012, 2023.
[7] B. Gunapriya, T. Rajesh, A. Thirumalraj, and B. Manjunatha, “LW-CNN-based extraction with optimized encoder-decoder model for detection of diabetic retinopathy,” Frontier Scientific Publishing Pte. Ltd., vol. 1095, 2023.
[8] S. Santhosh and D. V. Babu, “Retinal glaucoma detection from digital fundus images using deep learning approach,” in Proc. 7th Int. Conf. Comput. Methodol. Commun. (ICCMC), Feb. 2023, pp. 68–72.
[9] V. Kurilová et al., “Detecting glaucoma from fundus images using ensemble learning,” Journal of Electrical Engineering, vol. 74, no. 4, pp. 328–335, 2023.
[10] V. A. Devi, A. Thirumalraj, B. P. Kavin, and G. H. Seng, “Securing the predicted disease data using transfer learning in cloud-based healthcare 5.0,” in Intelligent Systems and Industrial Internet of Things for Sustainable Development, Chapman and Hall/CRC, pp. 101–117.
[11] R. Shanthakumari et al., “Glaucoma detection using fundus images using deep learning,” in Proc. Int. Conf. Circuit Power Comput. Technol. (ICCPCT), Aug. 2023, pp. 1887–1894.
[12] D. Chen et al., “Applications of artificial intelligence and deep learning in glaucoma,” Asia-Pacific Journal of Ophthalmology, vol. 12, no. 1, pp. 80–93, 2023.
[13] R. K. Patel and M. Kashyap, “Automated screening of glaucoma stages from retinal fundus images using BPS and LBP based GLCM features,” International Journal of Imaging Systems and Technology, vol. 33, no. 1, pp. 246–261, 2023.
[14] T. Shyamalee, D. Meedeniya, G. Lim, and M. Karunarathne, “Automated tool support for glaucoma identification with explainability using fundus images,” IEEE Access, 2024.
[15] D. Das, D. R. Nayak, S. V. Bhandary, and U. R. Acharya, “CDAM-Net: Channel shuffle dual attention based multi-scale CNN for efficient glaucoma detection using fundus images,” Engineering Applications of Artificial Intelligence, vol. 133, p. 108454, 2024.
[16] G. S. Nugraha, A. Juliansyah, and M. Tajuddin, “Glaucoma detection based on texture feature of neuro retinal rim area in retinal fundus image,” International Journal of Health and Information System, vol. 1, no. 3, pp. 117–127, 2024.
[17] M. Govindan, V. K. Dhakshnamurthy, K. Sreerangan, M. D. Nagarajan, and S. K. Rajamanickam, “A framework for early detection of glaucoma in retinal fundus images using deep learning,” Engineering Proceedings, vol. 62, no. 1, p. 3, 2024.
[18] L. K. Singh, M. Khanna, S. Thawkar, and R. Singh, “A novel hybridized feature selection strategy for the effective prediction of glaucoma in retinal fundus images,” Multimedia Tools and Applications, vol. 83, no. 15, pp. 46087–46159, 2024.
[19] X. R. Gao, F. Wu, P. T. Yuhas, R. K. Rasel, and M. Chiariglione, “Automated vertical cup-to-disc ratio determination from fundus images for glaucoma detection,” Scientific Reports, vol. 14, no. 1, p. 4494, 2024.
[20] K. Gunasekaran et al., “An efficient cardiovascular disease prediction using multi-scale weighted feature fusion-based convolutional neural network with residual gated recurrent unit,” Computer Methods in Biomechanics and Biomedical Engineering, vol. 27, no. 9, pp. 1181–1205, 2024.
[21] A. Diaz-Pinto et al., “CNNs for automatic glaucoma assessment using fundus images: An extensive validation,” Biomed. Eng. OnLine, vol. 18, no. 1, pp. 1–19, Dec. 2019.
[22] J. Sivaswamy et al., “Drishti-GS: Retinal image dataset for optic nerve head (ONH) segmentation,” in Proc. IEEE 11th Int. Symp. Biomed. Imag. (ISBI), Apr. 2014, pp. 53–56.
[23] W. Wang et al., “Pyramid vision transformer: A versatile backbone for dense prediction without convolutions,” in Proc. IEEE/CVF Int. Conf. Comput. Vis., 2021, pp. 568–578.
[24] A. Dosovitskiy et al., “An image is worth 16×16 words: Transformers for image recognition at scale,” arXiv preprint arXiv: 2010.11929, 2020.
[25] F. Chollet, “Xception: Deep learning with depthwise separable convolutions,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., Honolulu, HI, USA, 21–26 July 2017, pp. 1251–1258.