169 research outputs found

    YONA: You Only Need One Adjacent Reference-frame for Accurate and Fast Video Polyp Detection

    Full text link
    Accurate polyp detection is essential for assisting clinical rectal cancer diagnoses. Colonoscopy videos contain richer information than still images, making them a valuable resource for deep learning methods. Great efforts have been made to conduct video polyp detection through multi-frame temporal/spatial aggregation. However, unlike common fixed-camera video, the camera-moving scene in colonoscopy videos can cause rapid video jitters, leading to unstable training for existing video detection models. Additionally, the concealed nature of some polyps and the complex background environment further hinder the performance of existing video detectors. In this paper, we propose the \textbf{YONA} (\textbf{Y}ou \textbf{O}nly \textbf{N}eed one \textbf{A}djacent Reference-frame) method, an efficient end-to-end training framework for video polyp detection. YONA fully exploits the information of one previous adjacent frame and conducts polyp detection on the current frame without multi-frame collaborations. Specifically, for the foreground, YONA adaptively aligns the current frame's channel activation patterns with its adjacent reference frames according to their foreground similarity. For the background, YONA conducts background dynamic alignment guided by inter-frame difference to eliminate the invalid features produced by drastic spatial jitters. Moreover, YONA applies cross-frame contrastive learning during training, leveraging the ground truth bounding box to improve the model's perception of polyp and background. Quantitative and qualitative experiments on three public challenging benchmarks demonstrate that our proposed YONA outperforms previous state-of-the-art competitors by a large margin in both accuracy and speed.Comment: 11 pages, 3 figures, Accepted by MICCAI202

    An Efficient Approach for Polyps Detection in Endoscopic Videos Based on Faster R-CNN

    Full text link
    Polyp has long been considered as one of the major etiologies to colorectal cancer which is a fatal disease around the world, thus early detection and recognition of polyps plays a crucial role in clinical routines. Accurate diagnoses of polyps through endoscopes operated by physicians becomes a challenging task not only due to the varying expertise of physicians, but also the inherent nature of endoscopic inspections. To facilitate this process, computer-aid techniques that emphasize fully-conventional image processing and novel machine learning enhanced approaches have been dedicatedly designed for polyp detection in endoscopic videos or images. Among all proposed algorithms, deep learning based methods take the lead in terms of multiple metrics in evolutions for algorithmic performance. In this work, a highly effective model, namely the faster region-based convolutional neural network (Faster R-CNN) is implemented for polyp detection. In comparison with the reported results of the state-of-the-art approaches on polyps detection, extensive experiments demonstrate that the Faster R-CNN achieves very competing results, and it is an efficient approach for clinical practice.Comment: 6 pages, 10 figures,2018 International Conference on Pattern Recognitio

    Colo-SCRL: Self-Supervised Contrastive Representation Learning for Colonoscopic Video Retrieval

    Full text link
    Colonoscopic video retrieval, which is a critical part of polyp treatment, has great clinical significance for the prevention and treatment of colorectal cancer. However, retrieval models trained on action recognition datasets usually produce unsatisfactory retrieval results on colonoscopic datasets due to the large domain gap between them. To seek a solution to this problem, we construct a large-scale colonoscopic dataset named Colo-Pair for medical practice. Based on this dataset, a simple yet effective training method called Colo-SCRL is proposed for more robust representation learning. It aims to refine general knowledge from colonoscopies through masked autoencoder-based reconstruction and momentum contrast to improve retrieval performance. To the best of our knowledge, this is the first attempt to employ the contrastive learning paradigm for medical video retrieval. Empirical results show that our method significantly outperforms current state-of-the-art methods in the colonoscopic video retrieval task.Comment: Accepted by ICME 202

    μž„μƒμˆ κΈ° ν–₯상을 μœ„ν•œ λ”₯λŸ¬λ‹ 기법 연ꡬ: λŒ€μž₯λ‚΄μ‹œκ²½ 진단 및 λ‘œλ΄‡μˆ˜μˆ  술기 평가에 적용

    Get PDF
    ν•™μœ„λ…Όλ¬Έ (박사) -- μ„œμšΈλŒ€ν•™κ΅ λŒ€ν•™μ› : κ³΅κ³ΌλŒ€ν•™ ν˜‘λ™κ³Όμ • μ˜μš©μƒμ²΄κ³΅ν•™μ „κ³΅, 2020. 8. 김희찬.This paper presents deep learning-based methods for improving performance of clinicians. Novel methods were applied to the following two clinical cases and the results were evaluated. In the first study, a deep learning-based polyp classification algorithm for improving clinical performance of endoscopist during colonoscopy diagnosis was developed. Colonoscopy is the main method for diagnosing adenomatous polyp, which can multiply into a colorectal cancer and hyperplastic polyps. The classification algorithm was developed using convolutional neural network (CNN), trained with colorectal polyp images taken by a narrow-band imaging colonoscopy. The proposed method is built around an automatic machine learning (AutoML) which searches for the optimal architecture of CNN for colorectal polyp image classification and trains the weights of the architecture. In addition, gradient-weighted class activation mapping technique was used to overlay the probabilistic basis of the prediction result on the polyp location to aid the endoscopists visually. To verify the improvement in diagnostic performance, the efficacy of endoscopists with varying proficiency levels were compared with or without the aid of the proposed polyp classification algorithm. The results confirmed that, on average, diagnostic accuracy was improved and diagnosis time was shortened in all proficiency groups significantly. In the second study, a surgical instruments tracking algorithm for robotic surgery video was developed, and a model for quantitatively evaluating the surgeons surgical skill based on the acquired motion information of the surgical instruments was proposed. The movement of surgical instruments is the main component of evaluation for surgical skill. Therefore, the focus of this study was develop an automatic surgical instruments tracking algorithm, and to overcome the limitations presented by previous methods. The instance segmentation framework was developed to solve the instrument occlusion issue, and a tracking framework composed of a tracker and a re-identification algorithm was developed to maintain the type of surgical instruments being tracked in the video. In addition, algorithms for detecting the tip position of instruments and arm-indicator were developed to acquire the movement of devices specialized for the robotic surgery video. The performance of the proposed method was evaluated by measuring the difference between the predicted tip position and the ground truth position of the instruments using root mean square error, area under the curve, and Pearsons correlation analysis. Furthermore, motion metrics were calculated from the movement of surgical instruments, and a machine learning-based robotic surgical skill evaluation model was developed based on these metrics. These models were used to evaluate clinicians, and results were similar in the developed evaluation models, the Objective Structured Assessment of Technical Skill (OSATS), and the Global Evaluative Assessment of Robotic Surgery (GEARS) evaluation methods. In this study, deep learning technology was applied to colorectal polyp images for a polyp classification, and to robotic surgery videos for surgical instruments tracking. The improvement in clinical performance with the aid of these methods were evaluated and verified.λ³Έ 논문은 μ˜λ£Œμ§„μ˜ μž„μƒμˆ κΈ° λŠ₯λ ₯을 ν–₯μƒμ‹œν‚€κΈ° μœ„ν•˜μ—¬ μƒˆλ‘œμš΄ λ”₯λŸ¬λ‹ 기법듀을 μ œμ•ˆν•˜κ³  λ‹€μŒ 두 가지 싀둀에 λŒ€ν•΄ μ μš©ν•˜μ—¬ κ·Έ κ²°κ³Όλ₯Ό ν‰κ°€ν•˜μ˜€λ‹€. 첫 번째 μ—°κ΅¬μ—μ„œλŠ” λŒ€μž₯λ‚΄μ‹œκ²½μœΌλ‘œ κ΄‘ν•™ 진단 μ‹œ, λ‚΄μ‹œκ²½ μ „λ¬Έμ˜μ˜ 진단 λŠ₯λ ₯을 ν–₯μƒμ‹œν‚€κΈ° μœ„ν•˜μ—¬ λ”₯λŸ¬λ‹ 기반의 μš©μ’… λΆ„λ₯˜ μ•Œκ³ λ¦¬μ¦˜μ„ κ°œλ°œν•˜κ³ , λ‚΄μ‹œκ²½ μ „λ¬Έμ˜μ˜ 진단 λŠ₯λ ₯ ν–₯상 μ—¬λΆ€λ₯Ό κ²€μ¦ν•˜κ³ μž ν•˜μ˜€λ‹€. λŒ€μž₯λ‚΄μ‹œκ²½ κ²€μ‚¬λ‘œ μ•”μ’…μœΌλ‘œ 증식할 수 μžˆλŠ” μ„ μ’…κ³Ό 과증식성 μš©μ’…μ„ μ§„λ‹¨ν•˜λŠ” 것은 μ€‘μš”ν•˜λ‹€. λ³Έ μ—°κ΅¬μ—μ„œλŠ” ν˜‘λŒ€μ—­ μ˜μƒ λ‚΄μ‹œκ²½μœΌλ‘œ μ΄¬μ˜ν•œ λŒ€μž₯ μš©μ’… μ˜μƒμœΌλ‘œ ν•©μ„±κ³± 신경망을 ν•™μŠ΅ν•˜μ—¬ λΆ„λ₯˜ μ•Œκ³ λ¦¬μ¦˜μ„ κ°œλ°œν•˜μ˜€λ‹€. μ œμ•ˆν•˜λŠ” μ•Œκ³ λ¦¬μ¦˜μ€ μžλ™ κΈ°κ³„ν•™μŠ΅ (AutoML) λ°©λ²•μœΌλ‘œ, λŒ€μž₯ μš©μ’… μ˜μƒμ— μ΅œμ ν™”λœ ν•©μ„±κ³± 신경망 ꡬ쑰λ₯Ό μ°Ύκ³  μ‹ κ²½λ§μ˜ κ°€μ€‘μΉ˜λ₯Ό ν•™μŠ΅ν•˜μ˜€λ‹€. λ˜ν•œ 기울기-κ°€μ€‘μΉ˜ 클래슀 ν™œμ„±ν™” 맡핑 기법을 μ΄μš©ν•˜μ—¬ κ°œλ°œν•œ ν•©μ„±κ³± 신경망 결과의 ν™•λ₯ μ  κ·Όκ±°λ₯Ό μš©μ’… μœ„μΉ˜μ— μ‹œκ°μ μœΌλ‘œ λ‚˜νƒ€λ‚˜λ„λ‘ ν•¨μœΌλ‘œ λ‚΄μ‹œκ²½ μ „λ¬Έμ˜μ˜ 진단을 돕도둝 ν•˜μ˜€λ‹€. λ§ˆμ§€λ§‰μœΌλ‘œ, μˆ™λ ¨λ„ κ·Έλ£Ήλ³„λ‘œ λ‚΄μ‹œκ²½ μ „λ¬Έμ˜κ°€ μš©μ’… λΆ„λ₯˜ μ•Œκ³ λ¦¬μ¦˜μ˜ κ²°κ³Όλ₯Ό μ°Έκ³ ν•˜μ˜€μ„ λ•Œ 진단 λŠ₯λ ₯이 ν–₯μƒλ˜μ—ˆλŠ”μ§€ 비ꡐ μ‹€ν—˜μ„ μ§„ν–‰ν•˜μ˜€κ³ , λͺ¨λ“  κ·Έλ£Ήμ—μ„œ μœ μ˜λ―Έν•˜κ²Œ 진단 정확도가 ν–₯μƒλ˜κ³  진단 μ‹œκ°„μ΄ λ‹¨μΆ•λ˜μ—ˆμŒμ„ ν™•μΈν•˜μ˜€λ‹€. 두 번째 μ—°κ΅¬μ—μ„œλŠ” λ‘œλ΄‡μˆ˜μˆ  λ™μ˜μƒμ—μ„œ μˆ˜μˆ λ„κ΅¬ μœ„μΉ˜ 좔적 μ•Œκ³ λ¦¬μ¦˜μ„ κ°œλ°œν•˜κ³ , νšλ“ν•œ μˆ˜μˆ λ„κ΅¬μ˜ μ›€μ§μž„ 정보λ₯Ό λ°”νƒ•μœΌλ‘œ 수술자의 μˆ™λ ¨λ„λ₯Ό μ •λŸ‰μ μœΌλ‘œ ν‰κ°€ν•˜λŠ” λͺ¨λΈμ„ μ œμ•ˆν•˜μ˜€λ‹€. μˆ˜μˆ λ„κ΅¬μ˜ μ›€μ§μž„μ€ 수술자의 λ‘œλ΄‡μˆ˜μˆ  μˆ™λ ¨λ„λ₯Ό ν‰κ°€ν•˜κΈ° μœ„ν•œ μ£Όμš”ν•œ 정보이닀. λ”°λΌμ„œ λ³Έ μ—°κ΅¬λŠ” λ”₯λŸ¬λ‹ 기반의 μžλ™ μˆ˜μˆ λ„κ΅¬ 좔적 μ•Œκ³ λ¦¬μ¦˜μ„ κ°œλ°œν•˜μ˜€μœΌλ©°, λ‹€μŒ 두가지 μ„ ν–‰μ—°κ΅¬μ˜ ν•œκ³„μ μ„ κ·Ήλ³΅ν•˜μ˜€λ‹€. μΈμŠ€ν„΄μŠ€ λΆ„ν•  (Instance Segmentation) ν”„λ ˆμž„μ›μ„ κ°œλ°œν•˜μ—¬ 폐색 (Occlusion) 문제λ₯Ό ν•΄κ²°ν•˜μ˜€κ³ , 좔적기 (Tracker)와 μž¬μ‹λ³„ν™” (Re-Identification) μ•Œκ³ λ¦¬μ¦˜μœΌλ‘œ κ΅¬μ„±λœ 좔적 ν”„λ ˆμž„μ›μ„ κ°œλ°œν•˜μ—¬ λ™μ˜μƒμ—μ„œ μΆ”μ ν•˜λŠ” μˆ˜μˆ λ„κ΅¬μ˜ μ’…λ₯˜κ°€ μœ μ§€λ˜λ„λ‘ ν•˜μ˜€λ‹€. λ˜ν•œ λ‘œλ΄‡μˆ˜μˆ  λ™μ˜μƒμ˜ νŠΉμˆ˜μ„±μ„ κ³ λ €ν•˜μ—¬ μˆ˜μˆ λ„κ΅¬μ˜ μ›€μ§μž„μ„ νšλ“ν•˜κΈ°μœ„ν•΄ μˆ˜μˆ λ„κ΅¬ 끝 μœ„μΉ˜μ™€ λ‘œλ΄‡ νŒ”-인디케이터 (Arm-Indicator) 인식 μ•Œκ³ λ¦¬μ¦˜μ„ κ°œλ°œν•˜μ˜€λ‹€. μ œμ•ˆν•˜λŠ” μ•Œκ³ λ¦¬μ¦˜μ˜ μ„±λŠ₯은 μ˜ˆμΈ‘ν•œ μˆ˜μˆ λ„κ΅¬ 끝 μœ„μΉ˜μ™€ μ •λ‹΅ μœ„μΉ˜ κ°„μ˜ 평균 제곱근 였차, 곑선 μ•„λž˜ 면적, ν”Όμ–΄μŠ¨ μƒκ΄€λΆ„μ„μœΌλ‘œ ν‰κ°€ν•˜μ˜€λ‹€. λ§ˆμ§€λ§‰μœΌλ‘œ, μˆ˜μˆ λ„κ΅¬μ˜ μ›€μ§μž„μœΌλ‘œλΆ€ν„° μ›€μ§μž„ μ§€ν‘œλ₯Ό κ³„μ‚°ν•˜κ³  이λ₯Ό λ°”νƒ•μœΌλ‘œ κΈ°κ³„ν•™μŠ΅ 기반의 λ‘œλ΄‡μˆ˜μˆ  μˆ™λ ¨λ„ 평가 λͺ¨λΈμ„ κ°œλ°œν•˜μ˜€λ‹€. κ°œλ°œν•œ 평가 λͺ¨λΈμ€ 기쑴의 Objective Structured Assessment of Technical Skill (OSATS), Global Evaluative Assessment of Robotic Surgery (GEARS) 평가 방법과 μœ μ‚¬ν•œ μ„±λŠ₯을 λ³΄μž„μ„ ν™•μΈν•˜μ˜€λ‹€. λ³Έ 논문은 μ˜λ£Œμ§„μ˜ μž„μƒμˆ κΈ° λŠ₯λ ₯을 ν–₯μƒμ‹œν‚€κΈ° μœ„ν•˜μ—¬ λŒ€μž₯ μš©μ’… μ˜μƒκ³Ό λ‘œλ΄‡μˆ˜μˆ  λ™μ˜μƒμ— λ”₯λŸ¬λ‹ κΈ°μˆ μ„ μ μš©ν•˜κ³  κ·Έ μœ νš¨μ„±μ„ ν™•μΈν•˜μ˜€μœΌλ©°, ν–₯후에 μ œμ•ˆν•˜λŠ” 방법이 μž„μƒμ—μ„œ μ‚¬μš©λ˜κ³  μžˆλŠ” 진단 및 평가 λ°©λ²•μ˜ λŒ€μ•ˆμ΄ 될 κ²ƒμœΌλ‘œ κΈ°λŒ€ν•œλ‹€.Chapter 1 General Introduction 1 1.1 Deep Learning for Medical Image Analysis 1 1.2 Deep Learning for Colonoscipic Diagnosis 2 1.3 Deep Learning for Robotic Surgical Skill Assessment 3 1.4 Thesis Objectives 5 Chapter 2 Optical Diagnosis of Colorectal Polyps using Deep Learning with Visual Explanations 7 2.1 Introduction 7 2.1.1 Background 7 2.1.2 Needs 8 2.1.3 Related Work 9 2.2 Methods 11 2.2.1 Study Design 11 2.2.2 Dataset 14 2.2.3 Preprocessing 17 2.2.4 Convolutional Neural Networks (CNN) 21 2.2.4.1 Standard CNN 21 2.2.4.2 Search for CNN Architecture 22 2.2.4.3 Searched CNN Training 23 2.2.4.4 Visual Explanation 24 2.2.5 Evaluation of CNN and Endoscopist Performances 25 2.3 Experiments and Results 27 2.3.1 CNN Performance 27 2.3.2 Results of Visual Explanation 31 2.3.3 Endoscopist with CNN Performance 33 2.4 Discussion 45 2.4.1 Research Significance 45 2.4.2 Limitations 47 2.5 Conclusion 49 Chapter 3 Surgical Skill Assessment during Robotic Surgery by Deep Learning-based Surgical Instrument Tracking 50 3.1 Introduction 50 3.1.1 Background 50 3.1.2 Needs 51 3.1.3 Related Work 52 3.2 Methods 56 3.2.1 Study Design 56 3.2.2 Dataset 59 3.2.3 Instance Segmentation Framework 63 3.2.4 Tracking Framework 66 3.2.4.1 Tracker 66 3.2.4.2 Re-identification 68 3.2.5 Surgical Instrument Tip Detection 69 3.2.6 Arm-Indicator Recognition 71 3.2.7 Surgical Skill Prediction Model 71 3.3 Experiments and Results 78 3.3.1 Performance of Instance Segmentation Framework 78 3.3.2 Performance of Tracking Framework 82 3.3.3 Evaluation of Surgical Instruments Trajectory 83 3.3.4 Evaluation of Surgical Skill Prediction Model 86 3.4 Discussion 90 3.4.1 Research Significance 90 3.4.2 Limitations 92 3.5 Conclusion 96 Chapter 4 Summary and Future Works 97 4.1 Thesis Summary 97 4.2 Limitations and Future Works 98 Bibliography 100 Abstract in Korean 116 Acknowledgement 119Docto

    Endoscopic Polyp Segmentation Using a Hybrid 2D/3D CNN

    Get PDF
    Colonoscopy is the gold standard for early diagnosis and pre-emptive treatment of colorectal cancer by detecting and removing colonic polyps. Deep learning approaches to polyp detection have shown potential for enhancing polyp detection rates. However, the majority of these systems are developed and evaluated on static images from colonoscopies, whilst applied treatment is performed on a real-time video feed. Non-curated video data includes a high proportion of low-quality frames in comparison to selected images but also embeds temporal information that can be used for more stable predictions. To exploit this, a hybrid 2D/3D convolutional neural network architecture is presented. The network is used to improve polyp detection by encompassing spatial and temporal correlation of the predictions while preserving real-time detections. Extensive experiments show that the hybrid method outperforms a 2D baseline. The proposed architecture is validated on videos from 46 patients. The results show that real-world clinical implementations of automated polyp detection can benefit from the hybrid algorithm

    Enhancing endoscopic navigation and polyp detection using artificial intelligence

    Get PDF
    Colorectal cancer (CRC) is one most common and deadly forms of cancer. It has a very high mortality rate if the disease advances to late stages however early diagnosis and treatment can be curative is hence essential to enhancing disease management. Colonoscopy is considered the gold standard for CRC screening and early therapeutic treatment. The effectiveness of colonoscopy is highly dependent on the operator’s skill, as a high level of hand-eye coordination is required to control the endoscope and fully examine the colon wall. Because of this, detection rates can vary between different gastroenterologists and technology have been proposed as solutions to assist disease detection and standardise detection rates. This thesis focuses on developing artificial intelligence algorithms to assist gastroenterologists during colonoscopy with the potential to ensure a baseline standard of quality in CRC screening. To achieve such assistance, the technical contributions develop deep learning methods and architectures for automated endoscopic image analysis to address both the detection of lesions in the endoscopic image and the 3D mapping of the endoluminal environment. The proposed detection models can run in real-time and assist visualization of different polyp types. Meanwhile the 3D reconstruction and mapping models developed are the basis for ensuring that the entire colon has been examined appropriately and to support quantitative measurement of polyp sizes using the image during a procedure. Results and validation studies presented within the thesis demonstrate how the developed algorithms perform on both general scenes and on clinical data. The feasibility of clinical translation is demonstrated for all of the models on endoscopic data from human participants during CRC screening examinations

    An objective validation of polyp and instrument segmentation methods in colonoscopy through Medico 2020 polyp segmentation and MedAI 2021 transparency challenges

    Full text link
    Automatic analysis of colonoscopy images has been an active field of research motivated by the importance of early detection of precancerous polyps. However, detecting polyps during the live examination can be challenging due to various factors such as variation of skills and experience among the endoscopists, lack of attentiveness, and fatigue leading to a high polyp miss-rate. Deep learning has emerged as a promising solution to this challenge as it can assist endoscopists in detecting and classifying overlooked polyps and abnormalities in real time. In addition to the algorithm's accuracy, transparency and interpretability are crucial to explaining the whys and hows of the algorithm's prediction. Further, most algorithms are developed in private data, closed source, or proprietary software, and methods lack reproducibility. Therefore, to promote the development of efficient and transparent methods, we have organized the "Medico automatic polyp segmentation (Medico 2020)" and "MedAI: Transparency in Medical Image Segmentation (MedAI 2021)" competitions. We present a comprehensive summary and analyze each contribution, highlight the strength of the best-performing methods, and discuss the possibility of clinical translations of such methods into the clinic. For the transparency task, a multi-disciplinary team, including expert gastroenterologists, accessed each submission and evaluated the team based on open-source practices, failure case analysis, ablation studies, usability and understandability of evaluations to gain a deeper understanding of the models' credibility for clinical deployment. Through the comprehensive analysis of the challenge, we not only highlight the advancements in polyp and surgical instrument segmentation but also encourage qualitative evaluation for building more transparent and understandable AI-based colonoscopy systems

    Spatio-temporal classification for polyp diagnosis

    Get PDF
    Colonoscopy remains the gold standard investigation for colorectal cancer screening as it offers the opportunity to both detect and resect pre-cancerous polyps. Computer-aided polyp characterisation can determine which polyps need polypectomy and recent deep learning-based approaches have shown promising results as clinical decision support tools. Yet polyp appearance during a procedure can vary, making automatic predictions unstable. In this paper, we investigate the use of spatio-temporal information to improve the performance of lesions classification as adenoma or non-adenoma. Two methods are implemented showing an increase in performance and robustness during extensive experiments both on internal and openly available benchmark datasets
    • …
    corecore