11 research outputs found

    A case study of an individual participant data meta-analysis of diagnostic accuracy showed that prediction regions represented heterogeneity well

    Get PDF
    The diagnostic accuracy of a screening tool is often characterized by its sensitivity and specificity. An analysis of these measures must consider their intrinsic correlation. In the context of an individual participant data meta-analysis, heterogeneity is one of the main components of the analysis. When using a random-effects meta-analytic model, prediction regions provide deeper insight into the effect of heterogeneity on the variability of estimated accuracy measures across the entire studied population, not just the average. This study aimed to investigate heterogeneity via prediction regions in an individual participant data meta-analysis of the sensitivity and specificity of the Patient Health Questionnaire-9 for screening to detect major depression. From the total number of studies in the pool, four dates were selected containing roughly 25%, 50%, 75% and 100% of the total number of participants. A bivariate random-effects model was fitted to studies up to and including each of these dates to jointly estimate sensitivity and specificity. Two-dimensional prediction regions were plotted in ROC-space. Subgroup analyses were carried out on sex and age, regardless of the date of the study. The dataset comprised 17,436 participants from 58 primary studies of which 2322 (13.3%) presented cases of major depression. Point estimates of sensitivity and specificity did not differ importantly as more studies were added to the model. However, correlation of the measures increased. As expected, standard errors of the logit pooled TPR and FPR consistently decreased as more studies were used, while standard deviations of the random-effects did not decrease monotonically. Subgroup analysis by sex did not reveal important contributions for observed heterogeneity; however, the shape of the prediction regions differed. Subgroup analysis by age did not reveal meaningful contributions to the heterogeneity and the prediction regions were similar in shape. Prediction intervals and regions reveal previously unseen trends in a dataset. In the context of a meta-analysis of diagnostic test accuracy, prediction regions can display the range of accuracy measures in different populations and settings

    The Diagnostic Accuracy of the Patient Health Questionnaire-2 (PHQ-2) Patient Health Questionnaire-8 (PHQ-8) and Patient Health Questionnaire-9 (PHQ-9) for Detecting Major Depression: Protocol for a Systematic Review and Individual Patient Data Meta-analyses.

    No full text
    Background: Major depressive disorder (MDD) may be present in 10%-20% of patients in medical settings. Routine depression screening is sometimes recommended to improve depression management. However, studies of the diagnostic accuracy of depression screening tools have typically used data-driven, exploratory methods to select optimal cutoffs. Often, these studies report results from a small range of cutoff points around whatever cutoff score is most accurate in that given study. When published data are combined in meta-analyses, estimates of accuracy for different cutoff points may be based on data from different studies, rather than data from all studies for each possible cutoff point. As a result, traditional meta-analyses may generate exaggerated estimates of accuracy. Individual patient data (IPD) meta-analyses can address this problem by synthesizing data from all studies for each cutoff score to obtain diagnostic accuracy estimates. The nine-item Patient Health Questionnaire-9 (PHQ-9) and the shorter PHQ-2 and PHQ-8 are commonly recommended for depression screening. Thus, the primary objectives of our IPD meta-analyses are to determine the diagnostic accuracy of the PHQ-9, PHQ-8, and PHQ-2 to detect MDD among adults across all potentially relevant cutoff scores. Secondary analyses involve assessing accuracy accounting for patient factors that may influence accuracy (age, sex, medical comorbidity). Methods/Design: Data sources will include MEDLINE, MEDLINE In-Process & Other Non-Indexed Citations, PsycINFO, and Web of Science. We will include studies that included a Diagnostic and Statistical Manual or International Classification of Diseases diagnosis of MDD based on a validated structured or semi-structured clinical interview administered within 2 weeks of the administration of the PHQ. Two reviewers will independently screen titles and abstracts, perform full article review, and extract study data. Disagreements will be resolved by consensus. Risk of bias will be assessed with the Quality Assessment of Diagnostic Accuracy Studies-2 tool. Bivariate random-effects meta-analysis will be conducted for the full range of plausible cutoff values. Discussion: The proposed IPD meta-analyses will allow us to obtain estimates of the diagnostic accuracy of the PHQ-9, PHQ-8, and PHQ-2

    Diagnostic accuracy of the edinburgh postnatal depression scale (EPDS) for detecting major depression in pregnant and postnatal women: Protocol for a systematic review and individual patient data meta-analyses

    No full text
    Introduction: Studies of the diagnostic accuracy of depression screening tools often used data-driven methods to select optimal cut-offs. Typically, these studies report results from a small range of cut-off points around whatever cut-off score is identified as most accurate. When published data are combined in meta-analyses, estimates of accuracy for different cutoff points may be based on data from different studies, rather than data from all studies for each cut-off point. Thus, traditional meta-analyses may exaggerate accuracy estimates. Individual patient data (IPD) metaanalyses synthesise data from all studies for each cutoff score to obtain accuracy estimates. The 10-item Edinburgh Postnatal Depression Scale (EPDS) is commonly recommended for depression screening in the perinatal period. The primary objective of this IPD meta-analysis is to determine the diagnostic accuracy of the EPDS to detect major depression among women during pregnancy and in the postpartum period across all potentially relevant cut-off scores, accounting for patient factors that may influence accuracy (age, pregnancy vs postpartum). Methods and analysis: Data sources will include Medline, Medline In-Process & Other Non-Indexed Citations, PsycINFO, and Web of Science. Studies that include a diagnosis of major depression based on a validated structured or semistructured clinical interview administered within 2 weeks of (before or after) the administration of the EPDS will be included. Risk of bias will be assessed with the Quality Assessment of Diagnostic Accuracy Studies-2 tool. Bivariate randomeffects meta-analysis will be conducted for the full range of plausible cut-off values. Analyses will evaluate data from pregnancy and the postpartum period separately, as well as combining data from all women in a single model

    Comparison of the Accuracy of the 7-Item HADS Depression Subscale and 14-Item Total HADS for Screening for Major Depression: A Systematic Review and Individual Participant Data Meta-Analysis

    No full text
    The seven-item Hospital Anxiety and Depression Scale Depression subscale (HADS-D) and the total score of the 14-item HADS (HADS-T) are both used for major depression screening. Compared to the HADS-D, the HADS-T includes anxiety items and requires more time to complete. We compared the screening accuracy of the HADS-D and HADS-T for major depression detection. We conducted an individual participant data meta-analysis and fit bivariate random effects models to assess diagnostic accuracy among participants with both HADS-D and HADS-T scores. We identified optimal cutoffs, estimated sensitivity and specificity with 95% confidence intervals, and compared screening accuracy across paired cutoffs via two-stage and individual-level models. We used a 0.05 equivalence margin to assess equivalency in sensitivity and specificity. 20,700 participants (2,285 major depression cases) from 98 studies were included. Cutoffs of >= 7 for the HADS-D (sensitivity 0.79 [0.75, 0.83], specificity 0.78 [0.75, 0.80]) and >= 15 for the HADS-T (sensitivity 0.79 [0.76, 0.82], specificity 0.81 [0.78, 0.83]) minimized the distance to the top-left corner of the receiver operating characteristic curve. Across all sets of paired cutoffs evaluated, differences of sensitivity between HADS-T and HADS-D ranged from -0.05 to 0.01 (0.00 at paired optimal cutoffs), and differences of specificity were within 0.03 for all cutoffs (0.02-0.03). The pattern was similar among outpatients, although the HADS-T was slightly (not nonequivalently) more specific among inpatients. The accuracy of HADS-T was equivalent to the HADS-D for detecting major depression. In most settings, the shorter HADS-D would be preferred.Public Significance Statement The present study suggests that the accuracy of 14-item Hospital Anxiety and Depression Scale (HADS-D) and the seven-item HADS Depression subscale (HADS-D) are equivalent for detecting major depression. Using the seven-item HADS-D for depression screening instead of the full 14-item HADS-T has minimal influence on performance of the measure but would reduce patient and participant burden in most clinical and research settings

    Depression prevalence using the HADS-D compared to SCID major depression classification: An individual participant data meta-analysis

    No full text
    Objectives: Validated diagnostic interviews are required to classify depression status and estimate prevalence of disorder, but screening tools are often used instead. We used individual participant data meta-analysis to compare prevalence based on standard Hospital Anxiety and Depression Scale – depression subscale (HADS-D) cutoffs of ≥8 and ≥11 versus Structured Clinical Interview for DSM (SCID) major depression and determined if an alternative HADS-D cutoff could more accurately estimate prevalence. Methods: We searched Medline, Medline In-Process & Other Non-Indexed Citations via Ovid, PsycINFO, and Web of Science (inception-July 11, 2016) for studies comparing HADS-D scores to SCID major depression status. Pooled prevalence and pooled differences in prevalence for HADS-D cutoffs versus SCID major depression were estimated. Results: 6005 participants (689 SCID major depression cases) from 41 primary studies were included. Pooled prevalence was 24.5% (95% Confidence Interval (CI): 20.5%, 29.0%) for HADS-D ≥8, 10.7% (95% CI: 8.3%, 13.8%) for HADS-D ≥11, and 11.6% (95% CI: 9.2%, 14.6%) for SCID major depression. HADS-D ≥11 was closest to SCID major depression prevalence, but the 95% prediction interval for the difference that could be expected for HADS-D ≥11 versus SCID in a new study was −21.1% to 19.5%. Conclusions: HADS-D ≥8 substantially overestimates depression prevalence. Of all possible cutoff thresholds, HADS-D ≥11 was closest to the SCID, but there was substantial heterogeneity in the difference between HADS-D ≥11 and SCID-based estimates. HADS-D should not be used as a substitute for a validated diagnostic interview. © 2020 Elsevier Inc

    Depression prevalence based on the Edinburgh Postnatal Depression Scale compared to Structured Clinical Interview for DSM DIsorders classification: Systematic review and individual participant data meta-analysis.

    No full text
    Estimates of depression prevalence in pregnancy and postpartum are based on the Edinburgh Postnatal Depression Scale (EPDS) more than on any other method. We aimed to determine if any EPDS cutoff can accurately and consistently estimate depression prevalence in individual studies. We analyzed datasets that compared EPDS scores to Structured Clinical Interview for DSM (SCID) major depression status. Random-effects meta-analysis was used to compare prevalence with EPDS cutoffs versus the SCID. Seven thousand three hundred and fifteen participants (1017 SCID major depression) from 29 primary studies were included. For EPDS cutoffs used to estimate prevalence in recent studies (≥9 to ≥14), pooled prevalence estimates ranged from 27.8% (95% CI: 22.0%-34.5%) for EPDS ≥ 9 to 9.0% (95% CI: 6.8%-11.9%) for EPDS ≥ 14; pooled SCID major depression prevalence was 9.0% (95% CI: 6.5%-12.3%). EPDS ≥14 provided pooled prevalence closest to SCID-based prevalence but differed from SCID prevalence in individual studies by a mean absolute difference of 5.1% (95% prediction interval: -13.7%, 12.3%). EPDS ≥14 approximated SCID-based prevalence overall, but considerable heterogeneity in individual studies is a barrier to using it for prevalence estimation

    Probability of major depression diagnostic classification based on the SCID, CIDI and MINI diagnostic interviews controlling for Hospital Anxiety and Depression Scale – Depression subscale scores: An individual participant data meta-analysis of 73 primary studies

    No full text
    Objective: Two previous individual participant data meta-analyses (IPDMAs) found that different diagnostic interviews classify different proportions of people as having major depression overall or by symptom levels. We compared the odds of major depression classification across diagnostic interviews among studies that administered the Depression subscale of the Hospital Anxiety and Depression Scale (HADS-D). Methods: Data accrued for an IPDMA on HADS-D diagnostic accuracy were analysed. We fit binomial generalized linear mixed models to compare odds of major depression classification for the Structured Clinical Interview for DSM (SCID), Composite International Diagnostic Interview (CIDI), and Mini International Neuropsychiatric Interview (MINI), controlling for HADS-D scores and participant characteristics with and without an interaction term between interview and HADS-D scores. Results: There were 15,856 participants (1942 [12%] with major depression) from 73 studies, including 15,335 (97%) non-psychiatric medical patients, 164 (1%) partners of medical patients, and 357 (2%) healthy adults. The MINI (27 studies, 7345 participants, 1066 major depression cases) classified participants as having major depression more often than the CIDI (10 studies, 3023 participants, 269 cases) (adjusted odds ratio [aOR] = 1.70 (0.84, 3.43)) and the semi-structured SCID (36 studies, 5488 participants, 607 cases) (aOR = 1.52 (1.01, 2.30)). The odds ratio for major depression classification with the CIDI was less likely to increase as HADS-D scores increased than for the SCID (interaction aOR = 0.92 (0.88, 0.96)). Conclusion: Compared to the SCID, the MINI may diagnose more participants as having major depression, and the CIDI may be less responsive to symptom severity. © 2019 Elsevier Inc

    Genome features of Leptospira interrogans serovar Copenhageni

    No full text
    We report novel features of the genome sequence of Leptospira interrogans serovar Copenhageni, a highly invasive spirochete. Leptospira species colonize a significant proportion of rodent populations worldwide and produce life-threatening infections in mammals. Genomic sequence analysis reveals the presence of a competent transport system with 13 families of genes encoding for major transporters including a three-member component efflux system compatible with the long-term survival of this organism. The leptospiral genome contains a broad array of genes encoding regulatory system, signal transduction and methyl-accepting chemotaxis proteins, reflecting the organism's ability to respond to diverse environmental stimuli. The identification of a complete set of genes encoding the enzymes for the cobalamin biosynthetic pathway and the novel coding genes related to lipopolysaccharide biosynthesis should bring new light to the study of Leptospira physiology. Genes related to toxins, lipoproteins and several surface-exposed proteins may facilitate a better understanding of the Leptospira pathogenesis and may serve as potential candidates for vaccine

    Equivalency of the diagnostic accuracy of the PHQ-8 and PHQ-9: A systematic review and individual participant data meta-analysis

    No full text
    Item 9 of the Patient Health Questionnaire-9 (PHQ-9) queries about thoughts of death and self-harm, but not suicidality. Although it is sometimes used to assess suicide risk, most positive responses are not associated with suicidality. The PHQ-8, which omits Item 9, is thus increasingly used in research. We assessed equivalency of total score correlations and the diagnostic accuracy to detect major depression of the PHQ-8 and PHQ-9.Methods We conducted an individual patient data meta-analysis. We fit bivariate random-effects models to assess diagnostic accuracy.Results 16 742 participants (2097 major depression cases) from 54 studies were included. The correlation between PHQ-8 and PHQ-9 scores was 0.996 (95 confidence interval 0.996 to 0.996). The standard cutoff score of 10 for the PHQ-9 maximized sensitivity + specificity for the PHQ-8 among studies that used a semi-structured diagnostic interview reference standard (N = 27). At cutoff 10, the PHQ-8 was less sensitive by 0.02 (-0.06 to 0.00) and more specific by 0.01 (0.00 to 0.01) among those studies (N = 27), with similar results for studies that used other types of interviews (N = 27). For all 54 primary studies combined, across all cutoffs, the PHQ-8 was less sensitive than the PHQ-9 by 0.00 to 0.05 (0.03 at cutoff 10), and specificity was within 0.01 for all cutoffs (0.00 to 0.01).Conclusions PHQ-8 and PHQ-9 total scores were similar. Sensitivity may be minimally reduced with the PHQ-8, but specificity is similar. Copyright © Cambridge University Press 2019
    corecore