Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Attention-Aided Generative Learning for Multi-Scale Multi-Modal Fundus Image Translationopen access

Authors
Pham, V[Pham, Van-Nguyen]Le, DT[Le, Duc-Tai]Bum, J[Bum, Junghyun]Lee, EJ[Lee, Eun Jung]Han, JC[Han, Jong Chul]Choo, H[Choo, Hyunseung]
Issue Date
2023
Publisher
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
Keywords
Optical imaging; Task analysis; Diseases; Ophthalmology; Mathematical models; Deep learning; Biomedical optical imaging; Conventional fundus images; deep learning; generative learning; ophthalmology; unpaired image-to-image translation; ultra wide-field fundus images
Citation
IEEE ACCESS, v.11, pp.51701 - 51711
Indexed
SCIE
SCOPUS
Journal Title
IEEE ACCESS
Volume
11
Start Page
51701
End Page
51711
URI
https://scholarworks.bwise.kr/skku/handle/2021.sw.skku/106502
DOI
10.1109/ACCESS.2023.3278596
ISSN
2169-3536
Abstract
Conventional fundus images (CFIs) and ultra-widefield fundus images (UFIs) are two fundamental image modalities in ophthalmology. While CFIs provide a detailed view of the optic nerve head and the posterior pole of an eye, their clinical use is associated with high costs and patient inconvenience due to the requirement of good pupil dilation. On the other hand, UFIs capture peripheral lesions, but their image quality is sensitive to factors such as pupil size, eye position, and eyelashes, leading to greater variability between examinations compared to CFIs. The widefield retina view of UFIs offers the theoretical possibility of generating CFIs from available UFIs to reduce patient examination costs. A recent study has shown the feasibility of this approach by leveraging deep learning techniques for the UFI-to-CFI translation task. However, the technique suffers from the heterogeneous scales of the image modalities and variations in the brightness of the training data. In this paper, we address these issues with a novel framework consisting of three stages: cropping, enhancement, and translation. The first stage is an optic disc-centered cropping strategy that helps to alleviate the scale difference between the two image domains. The second stage mitigates the variation in training data brightness and unifies the mask between the two modalities. In the last stage, we introduce an attention-aided generative learning model to translate a given UFI into the CFI domain. Our experimental results demonstrate the success of the proposed method on 1,011 UFIs, with 99.8% of the generated CFIs evaluated as good quality and usable. Expert evaluations confirm significant visual quality improvements in the generated CFIs compared to the UFIs, ranging from 10% to 80% for features such as optic nerve structure, vascular distribution, and drusen. Furthermore, using generated CFIs in an AI-based diagnosis system for age-related macular degeneration results in superior accuracy compared to UFIs and competitive performance relative to real CFIs. These results showcase the potential of our approach for automatic disease diagnosis and monitoring.
Files in This Item
There are no files associated with this item.
Appears in
Collections
Computing and Informatics > Computer Science and Engineering > 1. Journal Articles
Medicine > Department of Medicine > 1. Journal Articles
Information and Communication Engineering > School of Electronic and Electrical Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher HAN, JUNG CHUL photo

HAN, JUNG CHUL
Medicine (Medicine)
Read more

Altmetrics

Total Views & Downloads

BROWSE