61 research outputs found

    ARA-net: an attention-aware retinal atrophy segmentation network coping with fundus images

    Get PDF
    BackgroundAccurately detecting and segmenting areas of retinal atrophy are paramount for early medical intervention in pathological myopia (PM). However, segmenting retinal atrophic areas based on a two-dimensional (2D) fundus image poses several challenges, such as blurred boundaries, irregular shapes, and size variation. To overcome these challenges, we have proposed an attention-aware retinal atrophy segmentation network (ARA-Net) to segment retinal atrophy areas from the 2D fundus image.MethodsIn particular, the ARA-Net adopts a similar strategy as UNet to perform the area segmentation. Skip self-attention connection (SSA) block, comprising a shortcut and a parallel polarized self-attention (PPSA) block, has been proposed to deal with the challenges of blurred boundaries and irregular shapes of the retinal atrophic region. Further, we have proposed a multi-scale feature flow (MSFF) to challenge the size variation. We have added the flow between the SSA connection blocks, allowing for capturing considerable semantic information to detect retinal atrophy in various area sizes.ResultsThe proposed method has been validated on the Pathological Myopia (PALM) dataset. Experimental results demonstrate that our method yields a high dice coefficient (DICE) of 84.26%, Jaccard index (JAC) of 72.80%, and F1-score of 84.57%, which outperforms other methods significantly.ConclusionOur results have demonstrated that ARA-Net is an effective and efficient approach for retinal atrophic area segmentation in PM

    A method for quantifying sectoral optic disc pallor in fundus photographs and its association with peripapillary RNFL thickness

    Full text link
    Purpose: To develop an automatic method of quantifying optic disc pallor in fundus photographs and determine associations with peripapillary retinal nerve fibre layer (pRNFL) thickness. Methods: We used deep learning to segment the optic disc, fovea, and vessels in fundus photographs, and measured pallor. We assessed the relationship between pallor and pRNFL thickness derived from optical coherence tomography scans in 118 participants. Separately, we used images diagnosed by clinical inspection as pale (N=45) and assessed how measurements compared to healthy controls (N=46). We also developed automatic rejection thresholds, and tested the software for robustness to camera type, image format, and resolution. Results: We developed software that automatically quantified disc pallor across several zones in fundus photographs. Pallor was associated with pRNFL thickness globally (\b{eta} = -9.81 (SE = 3.16), p < 0.05), in the temporal inferior zone (\b{eta} = -29.78 (SE = 8.32), p < 0.01), with the nasal/temporal ratio (\b{eta} = 0.88 (SE = 0.34), p < 0.05), and in the whole disc (\b{eta} = -8.22 (SE = 2.92), p < 0.05). Furthermore, pallor was significantly higher in the patient group. Lastly, we demonstrate the analysis to be robust to camera type, image format, and resolution. Conclusions: We developed software that automatically locates and quantifies disc pallor in fundus photographs and found associations between pallor measurements and pRNFL thickness. Translational relevance: We think our method will be useful for the identification, monitoring and progression of diseases characterized by disc pallor/optic atrophy, including glaucoma, compression, and potentially in neurodegenerative disorders.Comment: 44 pages, 20 figures, 7 tables, submitte

    Machine Learning Approaches for Automated Glaucoma Detection using Clinical Data and Optical Coherence Tomography Images

    Full text link
    Glaucoma is a multi-factorial, progressive blinding optic-neuropathy. A variety of factors, including genetics, vasculature, anatomy, and immune factors, are involved. Worldwide more than 80 million people are affected by glaucoma, and around 300,000 in Australia, where 50% remain undiagnosed. Untreated glaucoma can lead to blindness. Early detection by Artificial intelligence (AI) is crucial to accelerate the diagnosis process and can prevent further vision loss. Many proposed AI systems have shown promising performance for automated glaucoma detection using two-dimensional (2D) data. However, only a few studies had optimistic outcomes for glaucoma detection and staging. Moreover, the automated AI system still faces challenges in diagnosing at the cliniciansโ€™ level due to the lack of interpretability of the ML algorithms and integration of multiple clinical data. AI technology would be welcomed by doctors and patients if the "black box" notion is overcome by developing an explainable, transparent AI system with similar pathological markers used by clinicians as the sign of early detection and progression of glaucomatous damage. Therefore, the thesis aimed to develop a comprehensive AI model to detect and stage glaucoma by incorporating a variety of clinical data and utilising advanced data analysis and machine learning (ML) techniques. The research first focuses on optimising glaucoma diagnostic features by combining structural, functional, demographic, risk factor, and optical coherence tomography (OCT) features. The significant features were evaluated using statistical analysis and trained in ML algorithms to observe the detection performance. Three crucial structural ONH OCT features: cross-sectional 2D radial B-scan, 3D vascular angiography and temporal-superior-nasal-inferior-temporal (TSNIT) B-scan, were analysed and trained in explainable deep learning (DL) models for automated glaucoma prediction. The explanation behind the decision making of DL models were successfully demonstrated using the feature visualisation. The structural features or distinguished affected regions of TSNIT OCT scans were precisely localised for glaucoma patients. This is consistent with the concept of explainable DL, which refers to the idea of making the decision-making processes of DL models transparent and interpretable to humans. However, artifacts and speckle noise often result in misinterpretation of the TSNIT OCT scans. This research also developed an automated DL model to remove the artifacts and noise from the OCT scans, facilitating error-free retinal layers segmentation, accurate tissue thickness estimation and image interpretation. Moreover, to monitor and grade glaucoma severity, the visual field (VF) test is commonly followed by clinicians for treatment and management. Therefore, this research uses the functional features extracted from VF images to train ML algorithms for staging glaucoma from early to advanced/severe stages. Finally, the selected significant features were used to design and develop a comprehensive AI model to detect and grade glaucoma stages based on the data quantity and availability. In the first stage, a DL model was trained with TSNIT OCT scans, and its output was combined with significant structural and functional features and trained in ML models. The best-performed ML model achieved an area under the curve (AUC): 0.98, an accuracy of 97.2%, a sensitivity of 97.9%, and a specificity of 96.4% for detecting glaucoma. The model achieved an overall accuracy of 90.7% and an F1 score of 84.0% for classifying normal, early, moderate, and advanced-stage glaucoma. In conclusion, this thesis developed and proposed a comprehensive, evidence-based AI model that will solve the screening problem for large populations and relieve experts from manually analysing a slew of patient data and associated misinterpretation problems. Moreover, this thesis demonstrated three structural OCT features that could be added as excellent diagnostic markers for precise glaucoma diagnosis

    Towards non-vascular fundus image analysis and disease detection

    Get PDF
    Assessment of retinal fundus image is very informative and preventive in early ocular disease detection. This non-invasive assessment of fundus images also helps in the early diagnosis of vascular diseases. This unique combination help in the early diagnosis of diseases. Applying image enhancement techniques with advanced Deep learning techniques helps to overcome such a challenging problem. Most Deep learning models give a diagnosis without attention to underlying pathological abnormalities. In this thesis, we tried to solve the problem in the same way as ophthalmologists and experts in the field approach the problem. We created models that can detect an Optic disc, Optic cup, and vascular regions in the image. This work can be integrated into any ocular disease detection, such as glaucoma, and vascular disease detection, such as diabetes. Extensive work is applied for better sampling when all models were suffering from a lack of data in the medical imaging field. The entire work on the retinal fundus image was in 2d images. In the extension of this work, we applied our knowledge to 3d MRI-Brain images. We attempt to predict attention scores in children, which is a big factor in the detection of kids with ADHD. But both work on fundus images and brain MRI images are under the umbrella of medical imaging. We believe this advancement in this line of research can be very valuable for future researchers in the area of automated medical imaging, especially in automated retinal disease diagnosis

    ๋”ฅ๋Ÿฌ๋‹์„ ์ด์šฉํ•œ ๋…น๋‚ด์žฅ ์ง„๋‹จ ๋ณด์กฐ ์‹œ์Šคํ…œ

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ (๋ฐ•์‚ฌ) -- ์„œ์šธ๋Œ€ํ•™๊ต ๋Œ€ํ•™์› : ๊ณต๊ณผ๋Œ€ํ•™ ํ˜‘๋™๊ณผ์ • ๋ฐ”์ด์˜ค์—”์ง€๋‹ˆ์–ด๋ง์ „๊ณต, 2021. 2. ๊น€ํฌ์ฐฌ.๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” ๋”ฅ ๋Ÿฌ๋‹ ๊ธฐ๋ฐ˜์˜ ์ง„๋‹จ ๋ณด์กฐ ์‹œ์Šคํ…œ์„ ์ œ์•ˆํ•˜์˜€๋‹ค. ์ƒˆ๋กœ์šด ๋ฐฉ๋ฒ•์ด ๋…น๋‚ด์žฅ ๋ฐ์ดํ„ฐ์— ์ ์šฉ๋˜์—ˆ๊ณ  ๊ฒฐ๊ณผ๋ฅผ ํ‰๊ฐ€ํ•˜์˜€๋‹ค. ์ฒซ๋ฒˆ์งธ ์—ฐ๊ตฌ์—์„œ๋Š” ์ŠคํŽ™ํŠธ๋Ÿผ์˜์—ญ ๋น›๊ฐ„์„ญ๋‹จ์ธต์ดฌ์˜๊ธฐ(SD-OCT)๋ฅผ ๋”ฅ ๋Ÿฌ๋‹ ๋ถ„๋ฅ˜ ๊ธฐ๋ฅผ ์ด์šฉํ•ด ๋ถ„์„ํ•˜์˜€๋‹ค. ์ŠคํŽ™ํŠธ๋Ÿผ์˜์—ญ ๋น›๊ฐ„์„ญ๋‹จ์ธต์ดฌ์˜๊ธฐ๋Š” ๋…น๋‚ด์žฅ์œผ๋กœ ์ธํ•œ ๊ตฌ์กฐ์  ์†์ƒ์„ ํ‰๊ฐ€ํ•˜๊ธฐ ์œ„ํ•ด ์‚ฌ์šฉํ•˜๋Š” ์žฅ๋น„์ด๋‹ค. ๋ถ„๋ฅ˜ ์•Œ๊ณ ๋ฆฌ์ฆ˜์€ ํ•ฉ์„ฑ ๊ณฑ ์‹ ๊ฒฝ๋ง์„ ์ด์šฉํ•ด ๊ฐœ๋ฐœ ๋˜์—ˆ์œผ๋ฉฐ, ์ŠคํŽ™ํŠธ๋Ÿผ์˜์—ญ ๋น›๊ฐ„์„ญ๋‹จ์ธต์ดฌ์˜๊ธฐ์˜ ๋ง๋ง‰์‹ ๊ฒฝ์„ฌ์œ ์ธต(RNFL)๊ณผ ํ™ฉ๋ฐ˜๋ถ€ ์‹ ๊ฒฝ์ ˆ์„ธํฌ๋‚ด๋ง์ƒ์ธต (GCIPL) ์‚ฌ์ง„์„ ์ด์šฉํ•ด ํ•™์Šตํ–ˆ๋‹ค. ์ œ์•ˆํ•œ ๋ฐฉ๋ฒ•์€ ๋‘๊ฐœ์˜ ์ด๋ฏธ์ง€๋ฅผ ์ž…๋ ฅ์œผ๋กœ ๋ฐ›๋Š” ์ด์ค‘์ž…๋ ฅํ•ฉ์„ฑ๊ณฑ์‹ ๊ฒฝ๋ง(DICNN)์ด๋ฉฐ, ๋”ฅ ๋Ÿฌ๋‹ ๋ถ„๋ฅ˜์—์„œ ํšจ๊ณผ์ ์ธ ๊ฒƒ์œผ๋กœ ์•Œ๋ ค์ ธ ์žˆ๋‹ค. ์ด์ค‘์ž…๋ ฅํ•ฉ์„ฑ๊ณฑ์‹ ๊ฒฝ๋ง์€ ๋ง๋ง‰์‹ ๊ฒฝ์„ฌ์œ ์ธต ๊ณผ ์‹ ๊ฒฝ์ ˆ์„ธํฌ์ธต ์˜ ๋‘๊ป˜ ์ง€๋„๋ฅผ ์ด์šฉํ•˜์—ฌ ํ•™์Šต ๋์œผ๋ฉฐ, ํ•™์Šต๋œ ๋„คํŠธ์›Œํฌ๋Š” ๋…น๋‚ด์žฅ๊ณผ ์ •์ƒ ๊ตฐ์„ ๊ตฌ๋ถ„ํ•œ๋‹ค. ์ด์ค‘์ž…๋ ฅํ•ฉ์„ฑ๊ณฑ์‹ ๊ฒฝ๋ง์€ ์ •ํ™•๋„์™€ ์ˆ˜์‹ ๊ธฐ๋™์ž‘ํŠน์„ฑ๊ณก์„ ํ•˜๋ฉด์  (AUC)์œผ๋กœ ํ‰๊ฐ€ ๋˜์—ˆ๋‹ค. ๋ง๋ง‰์‹ ๊ฒฝ์„ฌ์œ ์ธต๊ณผ ์‹ ๊ฒฝ์ ˆ์„ธํฌ์ธต ๋‘๊ป˜ ์ง€๋„๋กœ ํ•™์Šต๋œ ์„ค๊ณ„ํ•œ ๋”ฅ ๋Ÿฌ๋‹ ๋ชจ๋ธ์„ ์กฐ๊ธฐ ๋…น๋‚ด์žฅ๊ณผ ์ •์ƒ ๊ตฐ์„ ๋ถ„๋ฅ˜ํ•˜๋Š” ์„ฑ๋Šฅ์„ ํ‰๊ฐ€ํ•˜๊ณ  ๋น„๊ตํ•˜์˜€๋‹ค. ์„ฑ๋Šฅํ‰๊ฐ€ ๊ฒฐ๊ณผ ์ด์ค‘์ž…๋ ฅํ•ฉ์„ฑ๊ณฑ์‹ ๊ฒฝ๋ง์€ ์กฐ๊ธฐ ๋…น๋‚ด์žฅ์„ ๋ถ„๋ฅ˜ํ•˜๋Š”๋ฐ 0.869์˜ ์ˆ˜์‹ ๊ธฐ๋™์ž‘ํŠน์„ฑ๊ณก์„ ์˜๋„“์ด์™€ 0.921์˜ ๋ฏผ๊ฐ๋„, 0.756์˜ ํŠน์ด๋„๋ฅผ ๋ณด์˜€๋‹ค. ๋‘๋ฒˆ์งธ ์—ฐ๊ตฌ์—์„œ๋Š” ๋”ฅ ๋Ÿฌ๋‹์„ ์ด์šฉํ•ด ์‹œ์‹ ๊ฒฝ์œ ๋‘์‚ฌ์ง„์˜ ํ•ด์ƒ๋„์™€ ๋Œ€๋น„, ์ƒ‰๊ฐ, ๋ฐ๊ธฐ๋ฅผ ๋ณด์ •ํ•˜๋Š” ๋ฐฉ๋ฒ•์„ ์ œ์•ˆํ•˜์˜€๋‹ค. ์‹œ์‹ ๊ฒฝ์œ ๋‘์‚ฌ์ง„์€ ๋…น๋‚ด์žฅ์„ ์ง„๋‹จํ•˜๋Š”๋ฐ ์žˆ์–ด ํšจ๊ณผ์ ์ธ ๊ฒƒ์œผ๋กœ ์•Œ๋ ค์ ธ ์žˆ๋‹ค. ํ•˜์ง€๋งŒ, ๋…น๋‚ด์žฅ์˜ ์ง„๋‹จ์—์„œ ํ™˜์ž์˜ ๋‚˜, ์ž‘์€ ๋™๊ณต, ๋งค์ฒด ๋ถˆํˆฌ๋ช…์„ฑ ๋“ฑ์œผ๋กœ ์ธํ•ด ํ‰๊ฐ€๊ฐ€ ์–ด๋ ค์šด ๊ฒฝ์šฐ๊ฐ€ ์žˆ๋‹ค. ์ดˆ ํ•ด์ƒ๋„์™€ ๋ณด์ • ์•Œ๊ณ ๋ฆฌ์ฆ˜์€ ์ดˆ ํ•ด์ƒ๋„ ์ ๋Œ€์ ์ƒ์„ฑ์‹ ๊ฒฝ๋ง์„ ํ†ตํ•ด ๊ฐœ๋ฐœ๋˜์—ˆ๋‹ค. ์›๋ณธ ๊ณ ํ•ด์ƒ๋„์˜ ์‹œ์‹ ๊ฒฝ ์œ ๋‘ ์‚ฌ์ง„์€ ์ €ํ•ด์ƒ๋„ ์‚ฌ์ง„์œผ๋กœ ์ถ•์†Œ๋˜๊ณ , ๋ณด์ •๋œ ๊ณ ํ•ด์ƒ๋„ ์‹œ์‹ ๊ฒฝ์œ ๋‘์‚ฌ์ง„์œผ๋กœ ๋ณด์ • ๋˜๋ฉฐ, ๋ณด์ •๋œ ์‚ฌ์ง„์€ ์‹œ์‹ ๊ฒฝ์—ฌ๋ฐฑ์˜ ๊ฐ€์‹œ์„ฑ๊ณผ ๊ทผ์ฒ˜ ํ˜ˆ๊ด€์„ ์ž˜ ๋ณด์ด๋„๋ก ํ›„์ฒ˜๋ฆฌ ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์ด์šฉํ•œ๋‹ค. ์ €ํ•ด์ƒ๋„์ด๋ฏธ์ง€๋ฅผ ๋ณด์ •๋œ ๊ณ ํ•ด์ƒ๋„์ด๋ฏธ์ง€๋กœ ๋ณต์›ํ•˜๋Š” ๊ณผ์ •์„ ์ดˆํ•ด์ƒ๋„์ ๋Œ€์ ์‹ ๊ฒฝ๋ง์„ ํ†ตํ•ด ํ•™์Šตํ•œ๋‹ค. ์„ค๊ณ„ํ•œ ๋„คํŠธ์›Œํฌ๋Š” ์‹ ํ˜ธ ๋Œ€ ์žก์Œ ๋น„(PSNR)๊ณผ ๊ตฌ์กฐ์ ์œ ์‚ฌ์„ฑ(SSIM), ํ‰๊ท ํ‰๊ฐ€์ (MOS)๋ฅผ ์ด์šฉํ•ด ํ‰๊ฐ€ ๋˜์—ˆ๋‹ค. ํ˜„์žฌ์˜ ์—ฐ๊ตฌ๋Š” ๋”ฅ ๋Ÿฌ๋‹์ด ์•ˆ๊ณผ ์ด๋ฏธ์ง€๋ฅผ 4๋ฐฐ ํ•ด์ƒ๋„์™€ ๊ตฌ์กฐ์ ์ธ ์„ธ๋ถ€ ํ•ญ๋ชฉ์ด ์ž˜ ๋ณด์ด๋„๋ก ๊ฐœ์„ ํ•  ์ˆ˜ ์žˆ๋‹ค๋Š” ๊ฒƒ์„ ๋ณด์—ฌ์ฃผ์—ˆ๋‹ค. ํ–ฅ์ƒ๋œ ์‹œ์‹ ๊ฒฝ์œ ๋‘ ์‚ฌ์ง„์€ ์‹œ์‹ ๊ฒฝ์˜ ๋ณ‘๋ฆฌํ•™์ ์ธ ํŠน์„ฑ์˜ ์ง„๋‹จ ์ •ํ™•๋„๋ฅผ ๋ช…ํ™•ํžˆ ํ–ฅ์ƒ์‹œํ‚จ๋‹ค. ์„ฑ๋Šฅํ‰๊ฐ€๊ฒฐ๊ณผ ํ‰๊ท  PSNR์€ 25.01 SSIM์€ 0.75 MOS๋Š” 4.33์œผ๋กœ ๋‚˜ํƒ€๋‚ฌ๋‹ค. ์„ธ๋ฒˆ์งธ ์—ฐ๊ตฌ์—์„œ๋Š” ํ™˜์ž ์ •๋ณด์™€ ์•ˆ๊ณผ ์˜์ƒ(์‹œ์‹ ๊ฒฝ์œ ๋‘ ์‚ฌ์ง„๊ณผ ๋ถ‰์€์ƒ‰์ด ์—†๋Š” ๋ง๋ง‰์‹ ๊ฒฝ์„ฌ์œ ์ธต ์‚ฌ์ง„)์„ ์ด์šฉํ•ด ๋…น๋‚ด์žฅ ์˜์‹ฌ ํ™˜์ž๋ฅผ ๋ถ„๋ณ„ํ•˜๊ณ  ๋…น๋‚ด์žฅ ์˜์‹ฌ ํ™˜์ž์˜ ๋ฐœ๋ณ‘ ์—ฐ์ˆ˜๋ฅผ ์˜ˆ์ธกํ•˜๋Š” ๋”ฅ ๋Ÿฌ๋‹ ๋ชจ๋ธ์„ ๊ฐœ๋ฐœํ•˜์˜€๋‹ค. ์ž„์ƒ ๋ฐ์ดํ„ฐ๋“ค์€ ๋…น๋‚ด์žฅ์„ ์ง„๋‹จํ•˜๊ฑฐ๋‚˜ ์˜ˆ์ธกํ•˜๋Š”๋ฐ ์œ ์šฉํ•œ ์ •๋ณด๋“ค์„ ๊ฐ€์ง€๊ณ  ์žˆ๋‹ค. ํ•˜์ง€๋งŒ, ์–ด๋–ป๊ฒŒ ๋‹ค์–‘ํ•œ ์œ ํ˜•์˜ ์ž„์ƒ์ •๋ณด๋“ค์„ ์กฐํ•ฉํ•˜๋Š” ๊ฒƒ์ด ๊ฐ๊ฐ์˜ ํ™˜์ž๋“ค์— ๋Œ€ํ•ด ์ž ์žฌ์ ์ธ ๋…น๋‚ด์žฅ์„ ์˜ˆ์ธกํ•˜๋Š”๋ฐ ์–ด๋–ค ์˜ํ–ฅ์„ ์ฃผ๋Š”์ง€์— ๋Œ€ํ•œ ์—ฐ๊ตฌ๊ฐ€ ์ง„ํ–‰ ๋œ ์ ์ด ์—†๋‹ค. ๋…น๋‚ด์žฅ ์˜ ์‹ฌ์ž ๋ถ„๋ฅ˜์™€ ๋ฐœ๋ณ‘ ๋…„ ์ˆ˜ ์˜ˆ์ธก์€ ํ•ฉ์„ฑ ๊ณฑ ์ž๋™ ์ธ์ฝ”๋”(CAE)๋ฅผ ๋น„ ์ง€๋„์  ํŠน์„ฑ ์ถ”์ถœ ๊ธฐ๋กœ ์‚ฌ์šฉํ•˜๊ณ , ๊ธฐ๊ณ„ํ•™์Šต ๋ถ„๋ฅ˜ ๊ธฐ์™€ ํšŒ๊ท€๊ธฐ๋ฅผ ํ†ตํ•ด ์ง„ํ–‰ํ•˜์˜€๋‹ค. ์„ค๊ณ„ํ•œ ๋ชจ๋ธ์€ ์ •ํ™•๋„์™€ ํ‰๊ท ์ œ๊ณฑ์˜ค์ฐจ(MSE)๋ฅผ ํ†ตํ•ด ํ‰๊ฐ€ ๋˜์—ˆ์œผ๋ฉฐ, ์ด๋ฏธ์ง€ ํŠน์ง•๊ณผ ํ™˜์ž ํŠน์ง•์€ ์กฐํ•ฉํ–ˆ์„ ๋•Œ ๋…น๋‚ด์žฅ ์˜์‹ฌ ํ™˜์ž ๋ถ„๋ฅ˜์™€ ๋ฐœ๋ณ‘ ๋…„ ์ˆ˜ ์˜ˆ์ธก์˜ ์„ฑ๋Šฅ์ด ์ด๋ฏธ์ง€ ํŠน์ง•๊ณผ ํ™˜์ž ํŠน์ง•์„ ๊ฐ๊ฐ ์ผ์„ ๋•Œ๋ณด๋‹ค ์„ฑ๋Šฅ์ด ์ข‹์•˜๋‹ค. ์ •๋‹ต๊ณผ์˜ MSE๋Š” 2.613์œผ๋กœ ๋‚˜ํƒ€๋‚ฌ๋‹ค. ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” ๋”ฅ ๋Ÿฌ๋‹์„ ์ด์šฉํ•ด ๋…น๋‚ด์žฅ ๊ด€๋ จ ์ž„์ƒ ๋ฐ์ดํ„ฐ ์ค‘ ๋ง๋ง‰์‹ ๊ฒฝ์„ฌ์œ ์ธต, ์‹ ๊ฒฝ์ ˆ์„ธํฌ์ธต ์‚ฌ์ง„์„ ๋…น๋‚ด์žฅ ์ง„๋‹จ์— ์ด์šฉ๋˜์—ˆ๊ณ , ์‹œ์‹ ๊ฒฝ์œ ๋‘ ์‚ฌ์ง„์€ ์‹œ์‹ ๊ฒฝ์˜ ๋ณ‘๋ฆฌํ•™์ ์ธ ์ง„๋‹จ ์ •ํ™•๋„๋ฅผ ๋†’์˜€๊ณ , ํ™˜์ž ์ •๋ณด๋Š” ๋ณด๋‹ค ์ •ํ™•ํ•œ ๋…น๋‚ด์žฅ ์˜์‹ฌ ํ™˜์ž ๋ถ„๋ฅ˜์™€ ๋ฐœ๋ณ‘ ๋…„ ์ˆ˜ ์˜ˆ์ธก์— ์ด์šฉ๋˜์—ˆ๋‹ค. ํ–ฅ์ƒ๋œ ๋…น๋‚ด์žฅ ์ง„๋‹จ ์„ฑ๋Šฅ์€ ๊ธฐ์ˆ ์ ์ด๊ณ  ์ž„์ƒ์ ์ธ ์ง€ํ‘œ๋“ค์„ ํ†ตํ•ด ๊ฒ€์ฆ๋˜์—ˆ๋‹ค.This paper presents deep learning-based methods for improving glaucoma diagnosis support systems. Novel methods were applied to glaucoma clinical cases and the results were evaluated. In the first study, a deep learning classifier for glaucoma diagnosis based on spectral-domain optical coherence tomography (SD-OCT) images was proposed and evaluated. Spectral-domain optical coherence tomography (SD-OCT) is commonly employed as an imaging modality for the evaluation of glaucomatous structural damage. The classification model was developed using convolutional neural network (CNN) as a base, and was trained with SD-OCT retinal nerve fiber layer (RNFL) and macular ganglion cell-inner plexiform layer (GCIPL) images. The proposed network architecture, termed Dual-Input Convolutional Neural Network (DICNN), showed great potential as an effective classification algorithm based on two input images. DICNN was trained with both RNFL and GCIPL thickness maps that enabled it to discriminate between normal and glaucomatous eyes. The performance of the proposed DICNN was evaluated with accuracy and area under the receiver operating characteristic curve (AUC), and was compared to other methods using these metrics. Compared to other methods, the proposed DICNN model demonstrated high diagnostic ability for the discrimination of early-stage glaucoma patients in normal subjects. AUC, sensitivity and specificity was 0.869, 0.921, 0.756 respectively. In the second study, a deep-learning method for increasing the resolution and improving the legibility of Optic-disc Photography(ODP) was proposed. ODP has been proven to be useful for optic nerve evaluation in glaucoma. But in clinical practice, limited patient cooperation, small pupil or media opacities can limit the performance of ODP. A model to enhance the resolution of ODP images, termed super-resolution, was developed using Super Resolution Generative Adversarial Network(SR-GAN). To train this model, high-resolution original ODP images were transformed into two counterparts: (1) down-scaled low-resolution ODPs, and (2) compensated high-resolution ODPs with enhanced visibility of the optic disc margin and surrounding retinal vessels which were produced using a customized image post-processing algorithm. The SR-GAN was trained to learn and recognize the differences between these two counterparts. The performance of the network was evaluated using Peak Signal to Noise Ratio (PSNR), Structural Similarity (SSIM), and Mean Opinion Score (MOS). The proposed study demonstrated that deep learning can be applied to create a generative model that is capable of producing enhanced ophthalmic images with 4x resolution and with improved structural details. The proposed method can be used to enhance ODPs and thereby significantly increase the detection accuracy of optic disc pathology. The average PSNR, SSIM and MOS was 25.01, 0.75, 4.33 respectively In the third study, a deep-learning model was used to classify suspected glaucoma and to predict subsequent glaucoma onset-year in glaucoma suspects using clinical data and retinal images (ODP & Red-free Fundus RNFL Photo). Clinical data contains useful information about glaucoma diagnosis and prediction. However, no study has been undertaken to investigate how combining different types of clinical information would be helpful for predicting the subsequent course of glaucoma in an individual patient. For this study, image features extracted using Convolutional Auto Encoder (CAE) along with clinical features were used for glaucoma suspect classification and onset-year prediction. The performance of the proposed model was evaluated using accuracy and Mean Squared Error (MSE). Combing the CAE extracted image features and clinical features improved glaucoma suspect classification and on-set year prediction performance as compared to using the image features and patient features separately. The average MSE between onset-year and predicted onset year was 2.613 In this study, deep learning methodology was applied to clinical images related to glaucoma. DICNN with RNFL and GCIPL images were used for classification of glaucoma, SR-GAN with ODP images were used to increase detection accuracy of optic disc pathology, and CAE & machine learning algorithm with clinical data and retinal images was used for glaucoma suspect classification and onset-year predication. The improved glaucoma diagnosis performance was validated using both technical and clinical parameters. The proposed methods as a whole can significantly improve outcomes of glaucoma patients by early detection, prediction and enhancing detection accuracy.Contents Abstract i Contents iv List of Tables vii List of Figures viii Chapter 1 General Introduction 1 1.1 Glaucoma 1 1.2 Deep Learning for Glaucoma Diagnosis 3 1.4 Thesis Objectives 3 Chapter 2 Dual-Input Convolutional Neural Network for Glaucoma Diagnosis using Spectral-Domain Optical Coherence Tomography 6 2.1 Introduction 6 2.1.1 Background 6 2.1.2 Related Work 7 2.2 Methods 8 2.2.1 Study Design 8 2.2.2 Dataset 9 2.2.3 Dual-Input Convolutional Neural Network (DICNN) 15 2.2.4 Training Environment 18 2.2.5 Statistical Analysis 19 2.3 Results 20 2.3.1 DICNN Performance 20 2.3.1 Grad-CAM for DICNN 34 2.4 Discussion 37 2.4.1 Research Significance 37 2.4.2 Limitations 40 2.5 Conclusion 42 Chapter 3 Deep-learning-based enhanced optic-disc photography 43 3.1 Introduction 43 3.1.1 Background 43 3.1.2 Needs 44 3.1.3 Related Work 45 3.2 Methods 46 3.2.1 Study Design 46 3.2.2 Dataset 46 3.2.2.1 Details on Customized Image Post-Processing Algorithm 47 3.2.3 SR-GAN Network 50 3.2.3.1 Design of Generative Adversarial Network 50 3.2.3.2 Loss Functions 55 3.2.4 Assessment of Clinical Implications of Enhanced ODPs 58 3.2.5 Statistical Analysis 60 3.2.6 Hardware Specifications & Software Specifications 60 3.3 Results 62 3.3.1 Training Loss of Modified SR-GAN 62 3.3.2 Performance of Final Network 66 3.3.3 Clinical Validation of Enhanced ODP by MOS comparison 77 3.3.4 Comparison of DH-Detection Accuracy 79 3.4 Discussion 80 3.4.1 Research Significance 80 3.4.2 Limitations 85 3.5 Conclusion 88 Chapter 4 Deep Learning Based Prediction of Glaucoma Onset Using Retinal Image and Patient Data 89 4.1 Introduction 89 4.1.1 Background 89 4.1.2 Related Work 90 4.2 Methods 90 4.2.1 Study Design 90 4.2.2 Dataset 91 4.2.3 Design of Overall System 94 4.2.4 Design of Convolutional Auto Encoder 95 4.2.5 Glaucoma Suspect Classification 97 4.2.6 Glaucoma Onset-Year Prediction 97 4.3 Result 99 4.3.1 Performance of Designed CAE 99 4.3.2 Performance of Designed Glaucoma Suspect Classification 101 4.3.3 Performance of Designed Glaucoma Onset-Year Prediction 105 4.4 Discussion 110 4.4.1 Research Significance 110 4.4.2 Limitations 110 4.5 Conclusion 111 Chapter 5 Summary and Future Works 112 5.1 Thesis Summary 112 5.2 Limitations and Future Works 113 Bibliography 115 Abstract in Korean 127 Acknowledgement 130Docto

    Efficient Pyramid Channel Attention Network for Pathological Myopia Detection

    Full text link
    Pathological myopia (PM) is the leading ocular disease for impaired vision and blindness worldwide. The key to detecting PM as early as possible is to detect informative features in global and local lesion regions, such as fundus tessellation, atrophy and maculopathy. However, applying classical convolutional neural networks (CNNs) to efficiently highlight global and local lesion context information in feature maps is quite challenging. To tackle this issue, we aim to fully leverage the potential of global and local lesion information with attention module design. Based on this, we propose an efficient pyramid channel attention (EPCA) module, which dynamically explores the relative importance of global and local lesion context information in feature maps. Then we combine the EPCA module with the backbone network to construct EPCA-Net for automatic PM detection based on fundus images. In addition, we construct a PM dataset termed PM-fundus by collecting fundus images of PM from publicly available datasets (e.g., the PALM dataset and ODIR dataset). The comprehensive experiments are conducted on three datasets, demonstrating that our EPCA-Net outperforms state-of-the-art methods in detecting PM. Furthermore, motivated by the recent pretraining-and-finetuning paradigm, we attempt to adapt pre-trained natural image models for PM detection by freezing them and treating the EPCA module and other attention modules as the adapters. The results show that our method with the pretraining-and-finetuning paradigm achieves competitive performance through comparisons to part of methods with traditional fine-tuning methods with fewer tunable parameters.Comment: 12 page
    • โ€ฆ
    corecore