295 research outputs found

    Expected-value bias in routine third-trimester growth scans.

    Get PDF
    OBJECTIVES: Operators performing fetal growth scans are usually aware of the gestational age of the pregnancy, which may lead to expected-value bias when performing biometric measurements. We aimed to evaluate the incidence of expected-value bias in routine fetal growth scans and assess its impact on standard biometric measurements. METHODS: We collected prospectively full-length video recordings of routine ultrasound growth scans coupled with operator eye tracking. Expected value was defined as the gestational age at the time of the scan, based on the estimated due date that was established at the dating scan. Expected-value bias was defined as occurring when the operator looked at the measurement box on the screen during the process of caliper adjustment before saving a measurement. We studied the three standard biometric planes on which measurements of head circumference (HC), abdominal circumference (AC) and femur length (FL) are obtained. We evaluated the incidence of expected-value bias and quantified the impact of biased measurements. RESULTS: We analyzed 272 third-trimester growth scans, performed by 16 operators, during which a total of 1409 measurements (354 HC, 703 AC and 352 FL; including repeat measurements) were obtained. Expected-value bias occurred in 91.4% of the saved standard biometric plane measurements (85.0% for HC, 92.9% for AC and 94.9% for FL). The operators were more likely to adjust the measurements towards the expected value than away from it (47.7% vs 19.7% of measurements; P < 0.001). On average, measurements were corrected by 2.3 ± 5.6, 2.4 ± 10.4 and 3.2 ± 10.4 days of gestation towards the expected gestational age for the HC, AC, and FL measurements, respectively. Additionally, we noted a statistically significant reduction in measurement variance once the operator was biased (P = 0.026). Comparing the lowest and highest possible estimated fetal weight (using the smallest and largest biased HC, AC and FL measurements), we noted that the discordance, in percentage terms, was 10.1% ± 6.5%, and that in 17% (95% CI, 12-21%) of the scans, the fetus could be considered as small-for-gestational age or appropriate-for-gestational age if using the smallest or largest possible measurements, respectively. Similarly, in 13% (95% CI, 9-16%) of scans, the fetus could be considered as large-for-gestational age or appropriate-for-gestational age if using the largest or smallest possible measurements, respectively. CONCLUSIONS: During routine third-trimester growth scans, expected-value bias frequently occurs and significantly changes standard biometric measurements obtained. © 2019 the Authors. Ultrasound in Obstetrics & Gynecology published by John Wiley & Sons Ltd on behalf of the International Society of Ultrasound in Obstetrics and Gynecology

    Efficient Ultrasound Image Analysis Models with Sonographer Gaze Assisted Distillation.

    Get PDF
    Recent automated medical image analysis methods have attained state-of-the-art performance but have relied on memory and compute-intensive deep learning models. Reducing model size without significant loss in performance metrics is crucial for time and memory-efficient automated image-based decision-making. Traditional deep learning based image analysis only uses expert knowledge in the form of manual annotations. Recently, there has been interest in introducing other forms of expert knowledge into deep learning architecture design. This is the approach considered in the paper where we propose to combine ultrasound video with point-of-gaze tracked for expert sonographers as they scan to train memory-efficient ultrasound image analysis models. Specifically we develop teacher-student knowledge transfer models for the exemplar task of frame classification for the fetal abdomen, head, and femur. The best performing memory-efficient models attain performance within 5% of conventional models that are 1000× larger in size

    Small for Gestational Age Babies After 37 Weeks: An Impact Study of a Risk Stratification Protocol.

    Get PDF
    OBJECTIVES: Although no clear evidence exists, many international guidelines advocate early term delivery of small for gestational age (SGA) fetuses. The aim of this study was to determine whether a protocol that included monitoring SGA fetuses beyond 37 weeks affected perinatal and maternal outcomes. METHODS: The impact of the introduction in 2014 of a protocol for management of SGA, which included risk stratification with surveillance and expectant management after 37 weeks for lower risk babies (Group 2), was compared with the previous strategy, which recommended delivery at around 37 weeks (Group 1). Data from all referred SGA babies over a 39 month period were analyzed. RESULTS: In group 1 there were 138 SGA babies; in group 2 there were 143. The mean gestation at delivery was 37 + 4 and 38 + 2 weeks respectively (p = 0.04). The incidence of neonatal composite adverse outcomes was lower in Group 2 (9% v 22% v; p < 0.01) as was neonatal NNU admission (13% v 42%; p < 0.01). Induction of labour and caesarean section rates were lower, and vaginal delivery (83% v 60%; p < 0.01) was higher in group 2. Most of the differences were due to delayed delivery of SGA babies that were stratified as low risk. CONCLUSIONS: This study suggests that protocol-based management of SGA babies may improve outcomes and that identification of moderate SGA should not alone prompt delivery. Larger numbers are required to assess any impact on perinatal mortality

    Spatio-Temporal Partitioning And Description Of Full-Length Routine Fetal Anomaly Ultrasound Scans

    Get PDF
    This paper considers automatic clinical workflow description of full-length routine fetal anomaly ultrasound scans using deep learning approaches for spatio-temporal video analysis. Multiple architectures consisting of 2D and 2D + t CNN, LSTM, and convolutional LSTM are investigated and compared. The contributions of short-term and long-term temporal changes are studied, and a multi-stream framework analysis is found to achieve the best top-l accuracy =0.77 and top-3 accuracy =0.94. Automated partitioning and characterisation on unlabelled full-length video scans show high correlation (ρ=0.95, p=0.0004) with workflow statistics of manually labelled videos, suggesting practicality of proposed methods

    Self-Supervised Ultrasound to MRI Fetal Brain Image Synthesis

    Full text link
    Fetal brain magnetic resonance imaging (MRI) offers exquisite images of the developing brain but is not suitable for second-trimester anomaly screening, for which ultrasound (US) is employed. Although expert sonographers are adept at reading US images, MR images which closely resemble anatomical images are much easier for non-experts to interpret. Thus in this paper we propose to generate MR-like images directly from clinical US images. In medical image analysis such a capability is potentially useful as well, for instance for automatic US-MRI registration and fusion. The proposed model is end-to-end trainable and self-supervised without any external annotations. Specifically, based on an assumption that the US and MRI data share a similar anatomical latent space, we first utilise a network to extract the shared latent features, which are then used for MRI synthesis. Since paired data is unavailable for our study (and rare in practice), pixel-level constraints are infeasible to apply. We instead propose to enforce the distributions to be statistically indistinguishable, by adversarial learning in both the image domain and feature space. To regularise the anatomical structures between US and MRI during synthesis, we further propose an adversarial structural constraint. A new cross-modal attention technique is proposed to utilise non-local spatial information, by encouraging multi-modal knowledge fusion and propagation. We extend the approach to consider the case where 3D auxiliary information (e.g., 3D neighbours and a 3D location index) from volumetric data is also available, and show that this improves image synthesis. The proposed approach is evaluated quantitatively and qualitatively with comparison to real fetal MR images and other approaches to synthesis, demonstrating its feasibility of synthesising realistic MR images.Comment: IEEE Transactions on Medical Imaging 202

    International standards for early fetal size and pregnancy dating based on ultrasound measurement of crown-rump length in the first trimester of pregnancy.

    Get PDF
    OBJECTIVES: There are no international standards for relating fetal crown-rump length (CRL) to gestational age (GA), and most existing charts have considerable methodological limitations. The INTERGROWTH-21(st) Project aimed to produce the first international standards for early fetal size and ultrasound dating of pregnancy based on CRL measurement. METHODS: Urban areas in eight geographically diverse countries that met strict eligibility criteria were selected for the prospective, population-based recruitment, between 9 + 0 and 13 + 6 weeks' gestation, of healthy well-nourished women with singleton pregnancies at low risk of fetal growth impairment. GA was calculated on the basis of a certain last menstrual period, regular menstrual cycle and lack of hormonal medication or breastfeeding in the preceding 2 months. CRL was measured using strict protocols and quality-control measures. All women were followed up throughout pregnancy until delivery and hospital discharge. Cases of neonatal and fetal death, severe pregnancy complications and congenital abnormalities were excluded from the study. RESULTS: A total of 4607 women were enrolled in the Fetal Growth Longitudinal Study, one of the three main components of the INTERGROWTH-21(st) Project, of whom 4321 had a live singleton birth in the absence of severe maternal conditions or congenital abnormalities detected by ultrasound or at birth. The CRL was measured in 56 women at < 9 + 0 weeks' gestation; these were excluded, resulting in 4265 women who contributed data to the final analysis. The mean CRL and SD increased with GA almost linearly, and their relationship to GA is given by the following two equations (in which GA is in days and CRL in mm): mean CRL = -50.6562 + (0.815118 × GA) + (0.00535302 × GA(2) ); and SD of CRL = -2.21626 + (0.0984894 × GA). GA estimation is carried out according to the two equations: GA = 40.9041 + (3.21585 × CRL(0.5) ) + (0.348956 × CRL); and SD of GA = 2.39102 + (0.0193474 × CRL). CONCLUSIONS: We have produced international prescriptive standards for early fetal linear size and ultrasound dating of pregnancy in the first trimester that can be used throughout the world

    Ultrasound methodology used to construct the fetal growth standards in the INTERGROWTH-21st Project

    Get PDF
    A unified protocol is essential to ensure that fetal ultrasound measurements taken in multicentre research studies are accurate and reproducible. This paper describes the methodology used to take two-dimensional, ultrasound measurements in the longitudinal, fetal growth component of the INTERGROWTH-21st Project. These standardised methods should minimise the systematic errors associated with pooling data from different study sites. They represent a model for carrying out similar research studies in the future

    Machine learning-based analysis of operator pupillary response to assess cognitive workload in clinical ultrasound imaging.

    Get PDF
    INTRODUCTION: Pupillometry, the measurement of eye pupil diameter, is a well-established and objective modality correlated with cognitive workload. In this paper, we analyse the pupillary response of ultrasound imaging operators to assess their cognitive workload, captured while they undertake routine fetal ultrasound examinations. Our experiments and analysis are performed on real-world datasets obtained using remote eye-tracking under natural clinical environmental conditions. METHODS: Our analysis pipeline involves careful temporal sequence (time-series) extraction by retrospectively matching the pupil diameter data with tasks captured in the corresponding ultrasound scan video in a multi-modal data acquisition setup. This is followed by the pupil diameter pre-processing and the calculation of pupillary response sequences. Exploratory statistical analysis of the operator pupillary responses and comparisons of the distributions between ultrasonographic tasks (fetal heart versus fetal brain) and operator expertise (newly-qualified versus experienced operators) are performed. Machine learning is explored to automatically classify the temporal sequences into the corresponding ultrasonographic tasks and operator experience using temporal, spectral, and time-frequency features with classical (shallow) models, and convolutional neural networks as deep learning models. RESULTS: Preliminary statistical analysis of the extracted pupillary response shows a significant variation for different ultrasonographic tasks and operator expertise, suggesting different extents of cognitive workload in each case, as measured by pupillometry. The best-performing machine learning models achieve receiver operating characteristic (ROC) area under curve (AUC) values of 0.98 and 0.80, for ultrasonographic task classification and operator experience classification, respectively. CONCLUSION: We conclude that we can successfully assess cognitive workload from pupil diameter changes measured while ultrasound operators perform routine scans. The machine learning allows the discrimination of the undertaken ultrasonographic tasks and scanning expertise using the pupillary response sequences as an index of the operators' cognitive workload. A high cognitive workload can reduce operator efficiency and constrain their decision-making, hence, the ability to objectively assess cognitive workload is a first step towards understanding these effects on operator performance in biomedical applications such as medical imaging
    corecore