178 research outputs found

    Review of Diagnostic Test Accuracy (DTA) studies in older people

    Get PDF
    Diagnostic Test Accuracy (DTA) describes a field of research that aims to assess how well a test is able to detect or exclude a condition of interest. Although geriatric medicine is not as reliant on investigations as other medical disciplines, almost all patient encounters with older adults will involve some form of diagnostic assessment. Thus, understanding the terminology and methods of DTA is essential for any clinician. In this review we use examples based around the diagnosis of dementia to highlight issues in DTA research. Some of these are generic to any DTA research and some are particularly pertinent to older adults. One can apply a test accuracy framework to a clinical question by defining four key components: the condition of interest; the index test(s) (i.e. the assessment(s) of interest); the reference standard (the best available method for assessing the condition of interest) and the population or healthcare setting in which testing takes place. Test accuracy is often described using complementary measures of sensitivity and specificity. However, many other metrics to describe test accuracy are available; in clinical practice predictive values may have greater utility. These and other descriptive statistics can be derived from a two by two table that cross-classifies the index test results with the reference standard results. Test performance and utility is not only determined by accuracy, other measures such as feasibility and acceptability should be considered and may be of particular importance when describing test performance in older adults with physical and cognitive impairments

    Meta-analytic approaches for summarising and comparing the accuracy of medical tests

    Get PDF
    Medical tests are essential for patient care. Evidence-based assessment of the relative accuracy of competing diagnostic tests informs clinical and policy decision making. This thesis addresses questions centred on assessing the reliability and transparency of evidence from systematic reviews and meta-analyses of comparative test accuracy, including validity of meta-analytic methods. Case studies were used to highlight key methodological issues, and provided rationale and context for the thesis. Published systematic reviews of multiple tests were identified and used to provide a descriptive survey of recent practice. Availability of comparative accuracy studies and differences between meta-analyses of direct (head-to-head) and indirect (between-study) comparisons were assessed. Comparative meta-analysis methods were reviewed and those deemed statistically robust were empirically evaluated. Using simulation, performance of hierarchical methods for meta-analysis of a single test was investigated in challenging scenarios (e.g. few studies or sparse data) and implications for test comparisons were considered. Poor statistical methods and incomplete reporting threatens the reliability of comparative reviews. Differences exist between direct and indirect comparisons but direct comparisons were seldom feasible because comparative studies were unavailable. Furthermore, inappropriate use of meta-analytic methods generated misleading results and conclusions. Therefore, recommendations for use of valid methods and a reporting checklist were developed

    Regional cerebral blood flow single photon emission computed tomography for detection of Frontotemporal dementia in people with suspected dementia.

    Get PDF
    BACKGROUND: In the UK, dementia affects 5% of the population aged over 65 years and 25% of those over 85 years. Frontotemporal dementia (FTD) represents one subtype and is thought to account for up to 16% of all degenerative dementias. Although the core of the diagnostic process in dementia rests firmly on clinical and cognitive assessments, a wide range of investigations are available to aid diagnosis.Regional cerebral blood flow (rCBF) single-photon emission computed tomography (SPECT) is an established clinical tool that uses an intravenously injected radiolabelled tracer to map blood flow in the brain. In FTD the characteristic pattern seen is hypoperfusion of the frontal and anterior temporal lobes. This pattern of blood flow is different to patterns seen in other subtypes of dementia and so can be used to differentiate FTD.It has been proposed that a diagnosis of FTD, (particularly early stage), should be made not only on the basis of clinical criteria but using a combination of other diagnostic findings, including rCBF SPECT. However, more extensive testing comes at a financial cost, and with a potential risk to patient safety and comfort. OBJECTIVES: To determine the diagnostic accuracy of rCBF SPECT for diagnosing FTD in populations with suspected dementia in secondary/tertiary healthcare settings and in the differential diagnosis of FTD from other dementia subtypes. SEARCH METHODS: Our search strategy used two concepts: (a) the index test and (b) the condition of interest. We searched citation databases, including MEDLINE (Ovid SP), EMBASE (Ovid SP), BIOSIS (Ovid SP), Web of Science Core Collection (ISI Web of Science), PsycINFO (Ovid SP), CINAHL (EBSCOhost) and LILACS (Bireme), using structured search strategies appropriate for each database. In addition we searched specialised sources of diagnostic test accuracy studies and reviews including: MEDION (Universities of Maastricht and Leuven), DARE (Database of Abstracts of Reviews of Effects) and HTA (Health Technology Assessment) database.We requested a search of the Cochrane Register of Diagnostic Test Accuracy Studies and used the related articles feature in PubMed to search for additional studies. We tracked key studies in citation databases such as Science Citation Index and Scopus to ascertain any further relevant studies. We identified 'grey' literature, mainly in the form of conference abstracts, through the Web of Science Core Collection, including Conference Proceedings Citation Index and Embase. The most recent search for this review was run on the 1 June 2013.Following title and abstract screening of the search results, full-text papers were obtained for each potentially eligible study. These papers were then independently evaluated for inclusion or exclusion. SELECTION CRITERIA: We included both case-control and cohort (delayed verification of diagnosis) studies. Where studies used a case-control design we included all participants who had a clinical diagnosis of FTD or other dementia subtype using standard clinical diagnostic criteria. For cohort studies, we included studies where all participants with suspected dementia were administered rCBF SPECT at baseline. We excluded studies of participants from selected populations (e.g. post-stroke) and studies of participants with a secondary cause of cognitive impairment. DATA COLLECTION AND ANALYSIS: Two review authors extracted information on study characteristics and data for the assessment of methodological quality and the investigation of heterogeneity. We assessed the methodological quality of each study using the QUADAS-2 (Quality Assessment of Diagnostic Accuracy Studies) tool. We produced a narrative summary describing numbers of studies that were found to have high/low/unclear risk of bias as well as concerns regarding applicability. To produce 2 x 2 tables, we dichotomised the rCBF SPECT results (scan positive or negative for FTD) and cross-tabulated them against the results for the reference standard. These tables were then used to calculate the sensitivity and specificity of the index test. Meta-analysis was not performed due to the considerable between-study variation in clinical and methodological characteristics. MAIN RESULTS: Eleven studies (1117 participants) met our inclusion criteria. These consisted of six case-control studies, two retrospective cohort studies and three prospective cohort studies. Three studies used single-headed camera SPECT while the remaining eight used multiple-headed camera SPECT. Study design and methods varied widely. Overall, participant selection was not well described and the studies were judged as having either high or unclear risk of bias. Often the threshold used to define a positive SPECT result was not predefined and the results were reported with knowledge of the reference standard. Concerns regarding applicability of the studies to the review question were generally low across all three domains (participant selection, index test and reference standard).Sensitivities and specificities for differentiating FTD from non-FTD ranged from 0.73 to 1.00 and from 0.80 to 1.00, respectively, for the three multiple-headed camera studies. Sensitivities were lower for the two single-headed camera studies; one reported a sensitivity and specificity of 0.40 (95% confidence interval (CI) 0.05 to 0.85) and 0.95 (95% CI 0.90 to 0.98), respectively, and the other a sensitivity and specificity of 0.36 (95% CI 0.24 to 0.50) and 0.92 (95% CI 0.88 to 0.95), respectively.Eight of the 11 studies which used SPECT to differentiate FTD from Alzheimer's disease used multiple-headed camera SPECT. Of these studies, five used a case-control design and reported sensitivities of between 0.52 and 1.00, and specificities of between 0.41 and 0.86. The remaining three studies used a cohort design and reported sensitivities of between 0.73 and 1.00, and specificities of between 0.94 and 1.00. The three studies that used single-headed camera SPECT reported sensitivities of between 0.40 and 0.80, and specificities of between 0.61 and 0.97. AUTHORS' CONCLUSIONS: At present, we would not recommend the routine use of rCBF SPECT in clinical practice because there is insufficient evidence from the available literature to support this.Further research into the use of rCBF SPECT for differentiating FTD from other dementias is required. In particular, protocols should be standardised, study populations should be well described, the threshold for 'abnormal' scans predefined and clear details given on how scans are analysed. More prospective cohort studies that verify the presence or absence of FTD during a period of follow up should be undertaken

    Rapid diagnostic tests for diagnosing uncomplicated non-falciparum or Plasmodium vivax malaria in endemic countries

    Get PDF
    Background In settings where both Plasmodium vivax and Plasmodium falciparum infection cause malaria, rapid diagnostic tests (RDTs) need to distinguish which species is causing the patients' symptoms, as different treatments are required. Older RDTs incorporated two test lines to distinguish malaria due to P. falciparum, from malaria due to any other Plasmodium species (non-falciparum). These RDTs can be classified according to which antibodies they use: Type 2 RDTs use HRP-2 (for P. falciparum) and aldolase (all species); Type 3 RDTs use HRP-2 (for P. falciparum) and pLDH (all species); Type 4 use pLDH (fromP. falciparum) and pLDH (all species). More recently, RDTs have been developed to distinguish P. vivax parasitaemia by utilizing a pLDH antibody specific to P. vivax. Objectives To assess the diagnostic accuracy of RDTs for detecting non-falciparum or P. vivax parasitaemia in people living in malaria-endemic areas who present to ambulatory healthcare facilities with symptoms suggestive of malaria, and to identify which types and brands of commercial test best detect non-falciparum and P. vivax malaria. Search methods We undertook a comprehensive search of the following databases up to 31 December 2013: Cochrane Infectious Diseases Group Specialized Register; MEDLINE; EMBASE; MEDION; Science Citation Index; Web of Knowledge; African Index Medicus; LILACS; and IndMED. Selection criteria Studies comparing RDTs with a reference standard (microscopy or polymerase chain reaction) in blood samples from a random or consecutive series of patients attending ambulatory health facilities with symptoms suggestive of malaria in non-falciparum endemic areas. Data collection and analysis For each study, two review authors independently extracted a standard set of data using a tailored data extraction form. We grouped comparisons by type of RDT (defined by the combinations of antibodies used), and combined in meta-analysis where appropriate. Average sensitivities and specificities are presented alongside 95% confidence intervals (95% CI). Main results We included 47 studies enrolling 22,862 participants. Patient characteristics, sampling methods and reference standard methods were poorly reported in most studies. RDTs detecting 'non-falciparum' parasitaemia Eleven studies evaluated Type 2 tests compared with microscopy, 25 evaluated Type 3 tests, and 11 evaluated Type 4 tests. In meta-analyses, average sensitivities and specificities were 78% (95% CI 73% to 82%) and 99% (95% CI 97% to 99%) for Type 2 tests, 78% (95% CI 69% to 84%) and 99% (95% CI 98% to 99%) for Type 3 tests, and 89% (95% CI 79% to 95%) and 98% (95% CI 97% to 99%) for Type 4 tests, respectively. Type 4 tests were more sensitive than both Type 2 (P = 0.01) and Type 3 tests (P = 0.03). Five studies compared Type 3 tests with PCR; in meta-analysis, the average sensitivity and specificity were 81% (95% CI 72% to 88%) and 99% (95% CI 97% to 99%) respectively. RDTs detecting P.vivax parasitaemia Eight studies compared pLDH tests to microscopy; the average sensitivity and specificity were 95% (95% CI 86% to 99%) and 99% (95% CI 99% to 100%), respectively. Authors' conclusions RDTs designed to detect P. vivax specifically, whether alone or as part of a mixed infection, appear to be more accurate than older tests designed to distinguish P. falciparum malaria from non-falciparum malaria. Compared to microscopy, these tests fail to detect around 5% ofP. vivax cases. This Cochrane Review, in combination with other published information about in vitro test performance and stability in the field, can assist policy-makers to choose between the available RDTs

    Diagnostic accuracy of screening tests for COPD : a systematic review and meta-analysis

    Get PDF
    BACKGROUND: Chronic obstructive pulmonary disease (COPD) is widely underdiagnosed. A number of studies have evaluated the accuracy of screening tests for COPD, but their findings have not been formally summarised. We therefore sought to determine and compare the diagnostic accuracy of such screening tests in primary care. METHODS: Systematic review and meta-analysis of the diagnostic accuracy of screening tests for COPD confirmed by spirometry in primary care. We searched MEDLINE, EMBASE and other bibliographic databases from 1997 to 2013 for diagnostic accuracy studies that evaluated 1 or more index tests in primary care among individuals aged ≥35 years with no prior diagnosis of COPD. Bivariate meta-analysis of sensitivity and specificity was performed where appropriate. Methodological quality was assessed independently by 2 reviewers using the QUADAS-2 tool. RESULTS: 10 studies were included. 8 assessed screening questionnaires (the COPD Diagnostic Questionnaire (CDQ) was the most evaluated, n=4), 4 assessed handheld flow meters (eg, COPD-6) and 1 assessed their combination. Among ever smokers, the CDQ (score threshold ≥19.5; n=4) had a pooled sensitivity of 64.5% (95% CI 59.9% to 68.8%) and specificity of 65.2% (52.9% to 75.8%), and handheld flow meters (n=3) had a sensitivity of 79.9% (95% CI 74.2% to 84.7%) and specificity of 84.4% (68.9% to 93.0%). Inadequate blinding between index tests and spirometry was the main risk of bias. CONCLUSIONS: Handheld flow meters demonstrated higher test accuracy than the CDQ for COPD screening in primary care. The choice of alternative screening tests within whole screening programmes should now be fully evaluated. PROSPERO REGISTRATION NUMBER: CRD42012002074

    Performance of methods for meta-analysis of diagnostic test accuracy with few studies or sparse data

    Get PDF
    Hierarchical models such as the bivariate and hierarchical summary receiver operating characteristic (HSROC) models are recommended for meta-analysis of test accuracy studies. These models are challenging to fit when there are few studies and/or sparse data (for example zero cells in contingency tables due to studies reporting 100% sensitivity or specificity); the models may not converge, or give unreliable parameter estimates. Using simulation, we investigated the performance of seven hierarchical models incorporating increasing simplifications in scenarios designed to replicate realistic situations for meta-analysis of test accuracy studies. Performance of the models was assessed in terms of estimability (percentage of meta-analyses that successfully converged and percentage where the between study correlation was estimable), bias, mean square error and coverage of the 95% confidence intervals. Our results indicate that simpler hierarchical models are valid in situations with few studies or sparse data. For synthesis of sensitivity and specificity, univariate random effects logistic regression models are appropriate when a bivariate model cannot be fitted. Alternatively, an HSROC model that assumes a symmetric SROC curve (by excluding the shape parameter) can be used if the HSROC model is the chosen meta-analytic approach. In the absence of heterogeneity, fixed effect equivalent of the models can be applied
    • …
    corecore