892 research outputs found

    Clinically Applicable Segmentation of Head and Neck Anatomy for Radiotherapy: Deep Learning Algorithm Development and Validation Study

    Get PDF
    BACKGROUND: Over half a million individuals are diagnosed with head and neck cancer each year globally. Radiotherapy is an important curative treatment for this disease, but it requires manual time to delineate radiosensitive organs at risk. This planning process can delay treatment while also introducing interoperator variability, resulting in downstream radiation dose differences. Although auto-segmentation algorithms offer a potentially time-saving solution, the challenges in defining, quantifying, and achieving expert performance remain. OBJECTIVE: Adopting a deep learning approach, we aim to demonstrate a 3D U-Net architecture that achieves expert-level performance in delineating 21 distinct head and neck organs at risk commonly segmented in clinical practice. METHODS: The model was trained on a data set of 663 deidentified computed tomography scans acquired in routine clinical practice and with both segmentations taken from clinical practice and segmentations created by experienced radiographers as part of this research, all in accordance with consensus organ at risk definitions. RESULTS: We demonstrated the model's clinical applicability by assessing its performance on a test set of 21 computed tomography scans from clinical practice, each with 21 organs at risk segmented by 2 independent experts. We also introduced surface Dice similarity coefficient, a new metric for the comparison of organ delineation, to quantify the deviation between organ at risk surface contours rather than volumes, better reflecting the clinical task of correcting errors in automated organ segmentations. The model's generalizability was then demonstrated on 2 distinct open-source data sets, reflecting different centers and countries to model training. CONCLUSIONS: Deep learning is an effective and clinically applicable technique for the segmentation of the head and neck anatomy for radiotherapy. With appropriate validation studies and regulatory approvals, this system could improve the efficiency, consistency, and safety of radiotherapy pathways

    IMPROVING RADIOTHERAPY WORKFLOW: EVALUATION AND IMPLEMENTATION OF DEEP LEARNING AUTO-SEGMENTATION IN A MULTI-USER ENVIRONMENT, AND DEVELOPMENT OF AUTOMATIC CONTOUR QUALITY ASSURANCE SYSTEM

    Get PDF
    Radiotherapy is a frequently used therapeutic modality for cancer patients. Accurately contouring of tumors and organs at risk (OARs) is critical for developing optimal treatment plans in radiotherapy, especially after the implementation of Intensity-modulated radiation therapy (IMRT) and Stereotactic Body Radiation Therapy (SBRT). The manual contouring process is time-consuming and suffers from inter-observer variations. However, manual contouring is often hindered by laborious clinical duties, leading to reduced effectiveness, and increased segmentation errors due to fatigue. Additionally, online adaptive radiation therapy(ART), which has been shown to benefit patient outcomes, places higher demands on contouring and quality assurance (QA) speed. Recently, deep learning auto-segmentation (DLAS) has emerged as an accurate tool for contouring in many anatomical sites. However, DLAS\u27s black-box nature has limited its widespread clinical implementation. Robust evaluations are required prior to the clinical implementation. In this thesis, we present our comprehensive validation approach for assessing the clinical acceptability of DLAS contours in the male pelvis region for automated prostate treatment planning. We then evaluated the DLAS model\u27s capacity for continuous improvement and generalizability and successfully adopted it in a multi-user environment. Additionally, we provided an implementation workflow for this software that can be used by other clinical users. Manual reviewing contour is a time-consuming process that is prone to errors and omissions, leading to dosimetric uncertainties and lower quality of radiation treatment. To assist with the manual contour review process, an automated contouring QA tool is necessary. We proposed a machine learning-based methodology for an automated contour quality assurance system that detects errors in manual contouring, using the precise DLAS contour as a reference. Moreover, we established a knowledge-based contour QA system that can localize and categorize contour errors for improved accuracy and efficiency. Overall, this dissertation provides a more comprehensive understanding of DLAS in a clinical multi-user environment, which will improve the quality and safety of the radiotherapy workflow

    Image-guided adaptive photon and proton radiotherapy

    Get PDF

    Machine learning-based detection of aberrant deep learning segmentations of target and organs at risk for prostate radiotherapy using a secondary segmentation algorithm

    Get PDF
    Objective. The output of a deep learning (DL) auto-segmentation application should be reviewed, corrected if needed and approved before being used clinically. This verification procedure is labour-intensive, time-consuming and user-dependent, which potentially leads to significant errors with impact on the overall treatment quality. Additionally, when the time needed to correct auto-segmentations approaches the time to delineate target and organs at risk from scratch, the usability of the DL model can be questioned. Therefore, an automated quality assurance framework was developed with the aim to detect in advance aberrant auto-segmentations. Approach. Five organs (prostate, bladder, anorectum, femoral head left and right) were auto-delineated on CT acquisitions for 48 prostate patients by an in-house trained primary DL model. An experienced radiation oncologist assessed the correctness of the model output and categorised the auto-segmentations into two classes whether minor or major adaptations were needed. Subsequently, an independent, secondary DL model was implemented to delineate the same structures as the primary model. Quantitative comparison metrics were calculated using both models' segmentations and used as input features for a machine learning classification model to predict the output quality of the primary model. Main results. For every organ, the approach of independent validation by the secondary model was able to detect primary auto-segmentations that needed major adaptation with high sensitivity (recall = 1) based on the calculated quantitative metrics. The surface DSC and APL were found to be the most indicated parameters in comparison to standard quantitative metrics for the time needed to adapt auto-segmentations. Significance. This proposed method includes a proof of concept for the use of an independent DL segmentation model in combination with a ML classifier to improve time saving during QA of auto-segmentations. The integration of such system into current automatic segmentation pipelines can increase the efficiency of the radiotherapy contouring workflow

    An investigation into the risk of population bias in deep learning autocontouring

    Get PDF
    Background and Purpose: To date, data used in the development of Deep Learning-based automatic contouring (DLC) algorithms have been largely sourced from single geographic populations. This study aimed to evaluate the risk of population-based bias by determining whether the performance of an autocontouring system is impacted by geographic population.Materials and methods: 80 Head Neck CT deidentified scans were collected from four clinics in Europe (n = 2) and Asia (n = 2). A single observer manually delineated 16 organs-at-risk in each. Subsequently, the data was contoured using a DLC solution, and trained using single institution (European) data. Autocontours were compared to manual delineations using quantitative measures. A Kruskal-Wallis test was used to test for any difference between populations. Clinical acceptability of automatic and manual contours to observers from each participating institution was assessed using a blinded subjective evaluation.Results: Seven organs showed a significant difference in volume between groups. Four organs showed statistical differences in quantitative similarity measures. The qualitative test showed greater variation in acceptance of contouring between observers than between data from different origins, with greater acceptance by the South Korean observers.Conclusion: Much of the statistical difference in quantitative performance could be explained by the difference in organ volume impacting the contour similarity measures and the small sample size. However, the qualitative assessment suggests that observer perception bias has a greater impact on the apparent clinical acceptability than quantitatively observed differences. This investigation of potential geographic bias should extend to more patients, populations, and anatomical regions in the future.</p
    corecore