92 research outputs found
The predictive value of the NICE "red traffic lights" in acutely ill children
Objective: Early recognition and treatment of febrile children with serious infections (SI) improves prognosis, however, early detection can be difficult. We aimed to validate the predictive rule-in value of the National Institute for Health and Clinical Excellence (NICE) most severe alarming signs or symptoms to identify SI in children. Design, Setting and Participants: The 16 most severe ("red") features of the NICE traffic light system were validated in seven different primary care and emergency department settings, including 6,260 children presenting with acute illness. Main Outcome Measures: We focussed on the individual predictive value of single red features for SI and their combinations. Results were presented as positive likelihood ratios, sensitivities and specificities. We categorised "general" and "disease-specific" red features. Changes in pre-test probability versus post-test probability for SI were visualised in Fagan nomograms. Results: Almost all red features had rule-in value for SI, but only four individual red features substantially raised the probability of SI in more than one dataset: "does not wake/stay awake", "reduced skin turgor", "non-blanching rash", and "focal neurological signs". The presence of ≥3 red features improved prediction of SI but still lacked strong rule-in value as likelihood ratios were below 5. Conclusions: The rule-in value of the most severe alarming signs or symptoms of the NICE traffic light system for identifying children with SI was limited, even when multiple red features were present. Our study highlights the importance of assessing the predictive value of alarming signs in clinical guidelines prior to widespread implementation in routine practice
Diagnostic accuracy of blood tests of inflammation in paediatric appendicitis: a systematic review and meta-analysis
Objective: Possible childhood appendicitis is a common emergency presentation. The exact value of blood tests is debated. This study sought to determine the diagnostic accuracy of four blood tests (white cell count (WCC), neutrophil(count or percentage), C reactive protein (CRP) and/or procalcitonin) for childhood appendicitis.
Design: A systematic review and diagnostic meta-analysis. Data sources included MEDLINE, EMBASE, Central, Web of Science searched from inception-March 2022 with reference searching and authors contacted for missing/unclear data. Eligibility criteria was studies reporting the diagnostic accuracy of the four blood tests compared to the reference standard (histology or follow-up). Risk of bias was assessed (QUADAS-2), pooled sensitivity and specificity were generated for each test and commonly presented cut-offs. To provide insight into clinical impact, we present strategies using a hypothetical cohort.
Results: 67 studies were included (34 839 children, 13 342 with appendicitis), all in the hospital setting. The most sensitive tests were WCC (≥10 000 cells/µL, 53 studies sensitivity 0.85 (95% CI 0.80 to 0.89)) and absolute neutrophil count (ANC) (≥7500 cells/µL, five studies sensitivity 0.90 (95% CI 0.85 to 0.94)). Combination of WCC or CRP increased sensitivity further(≥10 000 cells/µL or ≥10 mg/L, individual patient data (IPD) of 6 studies, 0.97 (95% CI 0.93 to 0.99)).
Applying results to a hypothetical cohort(1000 children with appendicitis symptoms, of whom 400 have appendicitis) 60 and 40 children would be wrongly discharged based solely on WCC and ANC, respectively, 12 with combination of WCC or CRP.
The most specific tests were CRP alone (≥50 mg/L, 38 studies, specificity 0.87 (95% CI 0.80 to 0.91)) or combined with WCC (≥10 000 cells/µL and ≥50 mg/L, IPD of six studies, 0.93 (95% CI 0.91 to 0.95)).
Conclusions: The best performing single blood tests for ruling-out paediatric appendicitis are WCC or ANC; with accuracy improved combining WCC and CRP. These tests could be used at the point of care in combination with clinical prediction rules. We provide insight into the best cut-offs for clinical application.
PROSPERO registration number: CRD4201708003
Variation in sensitivity and specificity of diverse diagnostic tests across health-care settings:a meta-epidemiological study
Objectives: Diagnostic test accuracy (DTA) may vary among health-care settings, which among other reasons may be due to referral from primary to secondary care. The true magnitude and direction of any difference is not certain. We analyzed the results of meta-analyses of DTA to compare sensitivity and specificity between patients in nonreferred and referred care settings.Study Design and Setting: We systematically searched EBSCOhost MEDLINE for systematic reviews that included at least ten original studies of the same diagnostic test, with at least three studies each performed in nonreferred and referred care. Random-effects models, with setting as a binary covariate, were used to calculate pooled sensitivity and specificity estimates per test. Sensitivity analyses were conducted limiting the analyses to studies from countries with gatekeeping systems only.Results: In total, nine systematic reviews evaluating thirteen diagnostic tests were included. For signs and symptoms (seven tests), the differences in sensitivity and specificity ranged from +0.03 to +0.30 and from −0.12 to +0.03, respectively; for biomarkers (four tests) differences in sensitivity ranged from −0.11 to +0.21 and specificity from −0.01 to −0.19. Differences in sensitivity and specificity for one questionnaire test were +0.1 and −0.07 respectively and for one imaging test were −0.22 and −0.07. Sensitivity analyses limited to countries with gatekeeping health care systems produced similar results.Conclusion: Sensitivity and specificity vary in both direction and magnitude between nonreferred and referred settings, depending on the test and target condition, with no universal patterns governing performance differences.Plain Language Summary: Doctors use diagnostic tests to help assess the likelihood if a patient has a certain condition. However, the accuracy of these tests may vary depending on where they are used—such as in primary care (where patients first seek help) or in specialist care (after being referred by a doctor). We wanted to find out how much test accuracy changes between these settings. To do this, we analyzed previous studies that reviewed the accuracy of different diagnostic tests. We compared how well these tests worked in patients who had not yet been referred to a specialist vs those who had. Our analysis included results from thirteen different diagnostic tests, covering symptoms, biomarkers (such as blood tests), a questionnaire, and an imaging test. We found that test accuracy varied depending on the type of test and the condition being diagnosed. Some tests had higher sensitivity (correctly identifying patients with the disease) or specificity (correctly identifying healthy individuals) in primary care, while in specialist care, the same test could perform better, worse, or similarly. There was no clear pattern that applied to all tests. This suggests that researchers should consider how test accuracy may differ across health-care settings when conducting and interpreting diagnostic test accuracy studies.</p
Accuracy of routine laboratory tests to predict mortality and deterioration to severe or critical COVID-19 in people with SARS-CoV-2
Background: Identifying patients with COVID-19 disease who will deteriorate can be useful to assess whether they should receive intensive care, or whether they can be treated in a less intensive way or through outpatient care. In clinical care, routine laboratory markers, such as C-reactive protein, are used to assess a person's health status. Objectives: To assess the accuracy of routine blood-based laboratory tests to predict mortality and deterioration to severe or critical (from mild or moderate) COVID-19 in people with SARS-CoV-2. Search methods: On 25 August 2022, we searched the Cochrane COVID-19 Study Register, encompassing searches of various databases such as MEDLINE via PubMed, CENTRAL, Embase, medRxiv, and ClinicalTrials.gov. We did not apply any language restrictions. Selection criteria: We included studies of all designs that produced estimates of prognostic accuracy in participants who presented to outpatient services, or were admitted to general hospital wards with confirmed SARS-CoV-2 infection, and studies that were based on serum banks of samples from people. All routine blood-based laboratory tests performed during the first encounter were included. We included any reference standard used to define deterioration to severe or critical disease that was provided by the authors. Data collection and analysis: Two review authors independently extracted data from each included study, and independently assessed the methodological quality using the Quality Assessment of Prognostic Accuracy Studies tool. As studies reported different thresholds for the same test, we used the Hierarchical Summary Receiver Operator Curve model for meta-analyses to estimate summary curves in SAS 9.4. We estimated the sensitivity at points on the SROC curves that corresponded to the median and interquartile range boundaries of specificities in the included studies. Direct and indirect comparisons were exclusively conducted for biomarkers with an estimated sensitivity and 95% CI of ≥ 50% at a specificity of ≥ 50%. The relative diagnostic odds ratio was calculated as a summary of the relative accuracy of these biomarkers. Main results: We identified a total of 64 studies, including 71,170 participants, of which 8169 participants died, and 4031 participants deteriorated to severe/critical condition. The studies assessed 53 different laboratory tests. For some tests, both increases and decreases relative to the normal range were included. There was important heterogeneity between tests and their cut-off values. None of the included studies had a low risk of bias or low concern for applicability for all domains. None of the tests included in this review demonstrated high sensitivity or specificity, or both. The five tests with summary sensitivity and specificity above 50% were: C-reactive protein increase, neutrophil-to-lymphocyte ratio increase, lymphocyte count decrease, d-dimer increase, and lactate dehydrogenase increase. Inflammation. For mortality, summary sensitivity of a C-reactive protein increase was 76% (95% CI 73% to 79%) at median specificity, 59% (low-certainty evidence). For deterioration, summary sensitivity was 78% (95% CI 67% to 86%) at median specificity, 72% (very low-certainty evidence). For the combined outcome of mortality or deterioration, or both, summary sensitivity was 70% (95% CI 49% to 85%) at median specificity, 60% (very low-certainty evidence).
For mortality, summary sensitivity of an increase in neutrophil-to-lymphocyte ratio was 69% (95% CI 66% to 72%) at median specificity, 63% (very low-certainty evidence). For deterioration, summary sensitivity was 75% (95% CI 59% to 87%) at median specificity, 71% (very low-certainty evidence).
For mortality, summary sensitivity of a decrease in lymphocyte count was 67% (95% CI 56% to 77%) at median specificity, 61% (very low-certainty evidence). For deterioration, summary sensitivity of a decrease in lymphocyte count was 69% (95% CI 60% to 76%) at median specificity, 67% (very low-certainty evidence). For the combined outcome, summary sensitivity was 83% (95% CI 67% to 92%) at median specificity, 29% (very low-certainty evidence).
For mortality, summary sensitivity of a lactate dehydrogenase increase was 82% (95% CI 66% to 91%) at median specificity, 60% (very low-certainty evidence). For deterioration, summary sensitivity of a lactate dehydrogenase increase was 79% (95% CI 76% to 82%) at median specificity, 66% (low-certainty evidence). For the combined outcome, summary sensitivity was 69% (95% CI 51% to 82%) at median specificity, 62% (very low-certainty evidence). Hypercoagulability. For mortality, summary sensitivity of a d-dimer increase was 70% (95% CI 64% to 76%) at median specificity of 56% (very low-certainty evidence). For deterioration, summary sensitivity was 65% (95% CI 56% to 74%) at median specificity of 63% (very low-certainty evidence). For the combined outcome, summary sensitivity was 65% (95% CI 52% to 76%) at median specificity of 54% (very low-certainty evidence). To predict mortality, neutrophil-to-lymphocyte ratio increase had higher accuracy compared to d-dimer increase (RDOR (diagnostic Odds Ratio) 2.05, 95% CI 1.30 to 3.24), C-reactive protein increase (RDOR 2.64, 95% CI 2.09 to 3.33), and lymphocyte count decrease (RDOR 2.63, 95% CI 1.55 to 4.46). D-dimer increase had higher accuracy compared to lymphocyte count decrease (RDOR 1.49, 95% CI 1.23 to 1.80), C-reactive protein increase (RDOR 1.31, 95% CI 1.03 to 1.65), and lactate dehydrogenase increase (RDOR 1.42, 95% CI 1.05 to 1.90). Additionally, lactate dehydrogenase increase had higher accuracy compared to lymphocyte count decrease (RDOR 1.30, 95% CI 1.13 to 1.49). To predict deterioration to severe disease, C-reactive protein increase had higher accuracy compared to d-dimer increase (RDOR 1.76, 95% CI 1.25 to 2.50). The neutrophil-to-lymphocyte ratio increase had higher accuracy compared to d-dimer increase (RDOR 2.77, 95% CI 1.58 to 4.84). Lastly, lymphocyte count decrease had higher accuracy compared to d-dimer increase (RDOR 2.10, 95% CI 1.44 to 3.07) and lactate dehydrogenase increase (RDOR 2.22, 95% CI 1.52 to 3.26). Authors' conclusions: Laboratory tests, associated with hypercoagulability and hyperinflammatory response, were better at predicting severe disease and mortality in patients with SARS-CoV-2 compared to other laboratory tests. However, to safely rule out severe disease, tests should have high sensitivity (> 90%), and none of the identified laboratory tests met this criterion. In clinical practice, a more comprehensive assessment of a patient's health status is usually required by, for example, incorporating these laboratory tests into clinical prediction rules together with clinical symptoms, radiological findings, and patient's characteristics.</p
Erratum to: Methods for evaluating medical tests and biomarkers
[This corrects the article DOI: 10.1186/s41512-016-0001-y.]
Accuracy of routine laboratory tests to predict mortality and deterioration to severe or critical COVID-19 in people with SARS-CoV-2
Objectives This is a protocol for a Cochrane Review (diagnostic). The objectives are as follows: To assess the accuracy of routine blood-based laboratory tests to predict mortality and deterioration to severe or critical (from mild or moderate) COVID-19 in people with SARS-CoV-2 infection. Secondary objectives Where data are available, we will investigate whether prognostic accuracy varies according to a specific measurement or test, reference standard, timing of outcome verification, sample type, study design, and setting, including prevalence of the target condition (either by stratified analysis or meta-regression)
Rapid, point‐of‐care antigen and molecular‐based tests for diagnosis of SARS‐CoV‐2 infection
BackgroundAccurate rapid diagnostic tests for SARS‐CoV‐2 infection could contribute to clinical and public health strategies to manage the COVID‐19 pandemic. Point‐of‐care antigen and molecular tests to detect current infection could increase access to testing and early confirmation of cases, and expediate clinical and public health management decisions that may reduce transmission.ObjectivesTo assess the diagnostic accuracy of point‐of‐care antigen and molecular‐based tests for diagnosis of SARS‐CoV‐2 infection. We consider accuracy separately in symptomatic and asymptomatic population groups.Search methodsElectronic searches of the Cochrane COVID‐19 Study Register and the COVID‐19 Living Evidence Database from the University of Bern (which includes daily updates from PubMed and Embase and preprints from medRxiv and bioRxiv) were undertaken on 30 Sept 2020. We checked repositories of COVID‐19 publications and included independent evaluations from national reference laboratories, the Foundation for Innovative New Diagnostics and the Diagnostics Global Health website to 16 Nov 2020. We did not apply language restrictions.Selection criteriaWe included studies of people with either suspected SARS‐CoV‐2 infection, known SARS‐CoV‐2 infection or known absence of infection, or those who were being screened for infection. We included test accuracy studies of any design that evaluated commercially produced, rapid antigen or molecular tests suitable for a point‐of‐care setting (minimal equipment, sample preparation, and biosafety requirements, with results within two hours of sample collection). We included all reference standards that define the presence or absence of SARS‐CoV‐2 (including reverse transcription polymerase chain reaction (RT‐PCR) tests and established diagnostic criteria).Data collection and analysisStudies were screened independently in duplicate with disagreements resolved by discussion with a third author. Study characteristics were extracted by one author and checked by a second; extraction of study results and assessments of risk of bias and applicability (made using the QUADAS‐2 tool) were undertaken independently in duplicate. We present sensitivity and specificity with 95% confidence intervals (CIs) for each test and pooled data using the bivariate model separately for antigen and molecular‐based tests. We tabulated results by test manufacturer and compliance with manufacturer instructions for use and according to symptom status.Main resultsSeventy‐eight study cohorts were included (described in 64 study reports, including 20 pre‐prints), reporting results for 24,087 samples (7,415 with confirmed SARS‐CoV‐2). Studies were mainly from Europe (n = 39) or North America (n = 20), and evaluated 16 antigen and five molecular assays.We considered risk of bias to be high in 29 (37%) studies because of participant selection; in 66 (85%) because of weaknesses in the reference standard for absence of infection; and in 29 (37%) for participant flow and timing. Studies of antigen tests were of a higher methodological quality compared to studies of molecular tests, particularly regarding the risk of bias for participant selection and the index test. Characteristics of participants in 35 (45%) studies differed from those in whom the test was intended to be used and the delivery of the index test in 39 (50%) studies differed from the way in which the test was intended to be used. Nearly all studies (97%) defined the presence or absence of SARS‐CoV‐2 based on a single RT‐PCR result, and none included participants meeting case definitions for probable COVID‐19.Antigen testsForty‐eight studies reported 58 evaluations of antigen tests. Estimates of sensitivity varied considerably between studies. There were differences between symptomatic (72.0%, 95% CI 63.7% to 79.0%; 37 evaluations; 15530 samples, 4410 cases) and asymptomatic participants (58.1%, 95% CI 40.2% to 74.1%; 12 evaluations; 1581 samples, 295 cases). Average sensitivity was higher in the first week after symptom onset (78.3%, 95% CI 71.1% to 84.1%; 26 evaluations; 5769 samples, 2320 cases) than in the second week of symptoms (51.0%, 95% CI 40.8% to 61.0%; 22 evaluations; 935 samples, 692 cases). Sensitivity was high in those with cycle threshold (Ct) values on PCR ≤25 (94.5%, 95% CI 91.0% to 96.7%; 36 evaluations; 2613 cases) compared to those with Ct values >25 (40.7%, 95% CI 31.8% to 50.3%; 36 evaluations; 2632 cases). Sensitivity varied between brands. Using data from instructions for use (IFU) compliant evaluations in symptomatic participants, summary sensitivities ranged from 34.1% (95% CI 29.7% to 38.8%; Coris Bioconcept) to 88.1% (95% CI 84.2% to 91.1%; SD Biosensor STANDARD Q). Average specificities were high in symptomatic and asymptomatic participants, and for most brands (overall summary specificity 99.6%, 95% CI 99.0% to 99.8%).At 5% prevalence using data for the most sensitive assays in symptomatic people (SD Biosensor STANDARD Q and Abbott Panbio), positive predictive values (PPVs) of 84% to 90% mean that between 1 in 10 and 1 in 6 positive results will be a false positive, and between 1 in 4 and 1 in 8 cases will be missed. At 0.5% prevalence applying the same tests in asymptomatic people would result in PPVs of 11% to 28% meaning that between 7 in 10 and 9 in 10 positive results will be false positives, and between 1 in 2 and 1 in 3 cases will be missed.No studies assessed the accuracy of repeated lateral flow testing or self‐testing.Rapid molecular assaysThirty studies reported 33 evaluations of five different rapid molecular tests. Sensitivities varied according to test brand. Most of the data relate to the ID NOW and Xpert Xpress assays. Using data from evaluations following the manufacturer’s instructions for use, the average sensitivity of ID NOW was 73.0% (95% CI 66.8% to 78.4%) and average specificity 99.7% (95% CI 98.7% to 99.9%; 4 evaluations; 812 samples, 222 cases). For Xpert Xpress, the average sensitivity was 100% (95% CI 88.1% to 100%) and average specificity 97.2% (95% CI 89.4% to 99.3%; 2 evaluations; 100 samples, 29 cases). Insufficient data were available to investigate the effect of symptom status or time after symptom onset.Authors' conclusionsAntigen tests vary in sensitivity. In people with signs and symptoms of COVID‐19, sensitivities are highest in the first week of illness when viral loads are higher. The assays shown to meet appropriate criteria, such as WHO's priority target product profiles for COVID‐19 diagnostics (‘acceptable’ sensitivity ≥ 80% and specificity ≥ 97%), can be considered as a replacement for laboratory‐based RT‐PCR when immediate decisions about patient care must be made, or where RT‐PCR cannot be delivered in a timely manner. Positive predictive values suggest that confirmatory testing of those with positive results may be considered in low prevalence settings. Due to the variable sensitivity of antigen tests, people who test negative may still be infected.Evidence for testing in asymptomatic cohorts was limited. Test accuracy studies cannot adequately assess the ability of antigen tests to differentiate those who are infectious and require isolation from those who pose no risk, as there is no reference standard for infectiousness. A small number of molecular tests showed high accuracy and may be suitable alternatives to RT‐PCR. However, further evaluations of the tests in settings as they are intended to be used are required to fully establish performance in practice.Several important studies in asymptomatic individuals have been reported since the close of our search and will be incorporated at the next update of this review. Comparative studies of antigen tests in their intended use settings and according to test operator (including self‐testing) are required.</p
Biallelic loss-of-function variants in PLD1 cause congenital right-sided cardiac valve defects and neonatal cardiomyopathy
Congenital heart disease is the most common type of birth defect, accounting for one-third of all congenital anomalies. Using whole-exome sequencing of 2718 patients with congenital heart disease and a search in GeneMatcher, we identified 30 patients from 21 unrelated families of different ancestries with biallelic phospholipase D1 (PLD1) variants who presented predominantly with congenital cardiac valve defects. We also associated recessive PLD1 variants with isolated neonatal cardiomyopathy. Furthermore, we established that p.I668F is a founder variant among Ashkenazi Jews (allele frequency of ~2%) and describe the phenotypic spectrum of PLD1-associated congenital heart defects. PLD1 missense variants were overrepresented in regions of the protein critical for catalytic activity, and, correspondingly, we observed a strong reduction in enzymatic activity for most of the mutant proteins in an enzymatic assay. Finally, we demonstrate that PLD1 inhibition decreased endothelial-mesenchymal transition, an established pivotal early step in valvulogenesis. In conclusion, our study provides a more detailed understanding of disease mechanisms and phenotypic expression associated with PLD1 loss of function
Thoracic imaging tests for the diagnosis of COVID-19
BACKGROUND: The respiratory illness caused by SARS-CoV-2 infection continues to present diagnostic challenges. Our 2020 edition of this review showed thoracic (chest) imaging to be sensitive and moderately specific in the diagnosis of coronavirus disease 2019 (COVID-19). In this update, we include new relevant studies, and have removed studies with case-control designs, and those not intended to be diagnostic test accuracy studies.OBJECTIVES: To evaluate the diagnostic accuracy of thoracic imaging (computed tomography (CT), X-ray and ultrasound) in people with suspected COVID-19.SEARCH METHODS: We searched the COVID-19 Living Evidence Database from the University of Bern, the Cochrane COVID-19 Study Register, The Stephen B. Thacker CDC Library, and repositories of COVID-19 publications through to 30 September 2020. We did not apply any language restrictions.SELECTION CRITERIA: We included studies of all designs, except for case-control, that recruited participants of any age group suspected to have COVID-19 and that reported estimates of test accuracy or provided data from which we could compute estimates.DATA COLLECTION AND ANALYSIS: The review authors independently and in duplicate screened articles, extracted data and assessed risk of bias and applicability concerns using the QUADAS-2 domain-list. We presented the results of estimated sensitivity and specificity using paired forest plots, and we summarised pooled estimates in tables. We used a bivariate meta-analysis model where appropriate. We presented the uncertainty of accuracy estimates using 95% confidence intervals (CIs).MAIN RESULTS: We included 51 studies with 19,775 participants suspected of having COVID-19, of whom 10,155 (51%) had a final diagnosis of COVID-19. Forty-seven studies evaluated one imaging modality each, and four studies evaluated two imaging modalities each. All studies used RT-PCR as the reference standard for the diagnosis of COVID-19, with 47 studies using only RT-PCR and four studies using a combination of RT-PCR and other criteria (such as clinical signs, imaging tests, positive contacts, and follow-up phone calls) as the reference standard. Studies were conducted in Europe (33), Asia (13), North America (3) and South America (2); including only adults (26), all ages (21), children only (1), adults over 70 years (1), and unclear (2); in inpatients (2), outpatients (32), and setting unclear (17). Risk of bias was high or unclear in thirty-two (63%) studies with respect to participant selection, 40 (78%) studies with respect to reference standard, 30 (59%) studies with respect to index test, and 24 (47%) studies with respect to participant flow. For chest CT (41 studies, 16,133 participants, 8110 (50%) cases), the sensitivity ranged from 56.3% to 100%, and specificity ranged from 25.4% to 97.4%. The pooled sensitivity of chest CT was 87.9% (95% CI 84.6 to 90.6) and the pooled specificity was 80.0% (95% CI 74.9 to 84.3). There was no statistical evidence indicating that reference standard conduct and definition for index test positivity were sources of heterogeneity for CT studies. Nine chest CT studies (2807 participants, 1139 (41%) cases) used the COVID-19 Reporting and Data System (CO-RADS) scoring system, which has five thresholds to define index test positivity. At a CO-RADS threshold of 5 (7 studies), the sensitivity ranged from 41.5% to 77.9% and the pooled sensitivity was 67.0% (95% CI 56.4 to 76.2); the specificity ranged from 83.5% to 96.2%; and the pooled specificity was 91.3% (95% CI 87.6 to 94.0). At a CO-RADS threshold of 4 (7 studies), the sensitivity ranged from 56.3% to 92.9% and the pooled sensitivity was 83.5% (95% CI 74.4 to 89.7); the specificity ranged from 77.2% to 90.4% and the pooled specificity was 83.6% (95% CI 80.5 to 86.4). For chest X-ray (9 studies, 3694 participants, 2111 (57%) cases) the sensitivity ranged from 51.9% to 94.4% and specificity ranged from 40.4% to 88.9%. The pooled sensitivity of chest X-ray was 80.6% (95% CI 69.1 to 88.6) and the pooled specificity was 71.5% (95% CI 59.8 to 80.8). For ultrasound of the lungs (5 studies, 446 participants, 211 (47%) cases) the sensitivity ranged from 68.2% to 96.8% and specificity ranged from 21.3% to 78.9%. The pooled sensitivity of ultrasound was 86.4% (95% CI 72.7 to 93.9) and the pooled specificity was 54.6% (95% CI 35.3 to 72.6). Based on an indirect comparison using all included studies, chest CT had a higher specificity than ultrasound. For indirect comparisons of chest CT and chest X-ray, or chest X-ray and ultrasound, the data did not show differences in specificity or sensitivity.AUTHORS' CONCLUSIONS: Our findings indicate that chest CT is sensitive and moderately specific for the diagnosis of COVID-19. Chest X-ray is moderately sensitive and moderately specific for the diagnosis of COVID-19. Ultrasound is sensitive but not specific for the diagnosis of COVID-19. Thus, chest CT and ultrasound may have more utility for excluding COVID-19 than for differentiating SARS-CoV-2 infection from other causes of respiratory illness. Future diagnostic accuracy studies should pre-define positive imaging findings, include direct comparisons of the various modalities of interest in the same participant population, and implement improved reporting practices.</p
Erratum to: Methods for evaluating medical tests and biomarkers
[This corrects the article DOI: 10.1186/s41512-016-0001-y.]
- …
