10 research outputs found

    A case study of an individual participant data meta-analysis of diagnostic accuracy showed that prediction regions represented heterogeneity well

    Get PDF
    The diagnostic accuracy of a screening tool is often characterized by its sensitivity and specificity. An analysis of these measures must consider their intrinsic correlation. In the context of an individual participant data meta-analysis, heterogeneity is one of the main components of the analysis. When using a random-effects meta-analytic model, prediction regions provide deeper insight into the effect of heterogeneity on the variability of estimated accuracy measures across the entire studied population, not just the average. This study aimed to investigate heterogeneity via prediction regions in an individual participant data meta-analysis of the sensitivity and specificity of the Patient Health Questionnaire-9 for screening to detect major depression. From the total number of studies in the pool, four dates were selected containing roughly 25%, 50%, 75% and 100% of the total number of participants. A bivariate random-effects model was fitted to studies up to and including each of these dates to jointly estimate sensitivity and specificity. Two-dimensional prediction regions were plotted in ROC-space. Subgroup analyses were carried out on sex and age, regardless of the date of the study. The dataset comprised 17,436 participants from 58 primary studies of which 2322 (13.3%) presented cases of major depression. Point estimates of sensitivity and specificity did not differ importantly as more studies were added to the model. However, correlation of the measures increased. As expected, standard errors of the logit pooled TPR and FPR consistently decreased as more studies were used, while standard deviations of the random-effects did not decrease monotonically. Subgroup analysis by sex did not reveal important contributions for observed heterogeneity; however, the shape of the prediction regions differed. Subgroup analysis by age did not reveal meaningful contributions to the heterogeneity and the prediction regions were similar in shape. Prediction intervals and regions reveal previously unseen trends in a dataset. In the context of a meta-analysis of diagnostic test accuracy, prediction regions can display the range of accuracy measures in different populations and settings

    Comparison of Self-Efficacy for Managing Chronic Disease between patients with systemic sclerosis and other chronic conditions: A systematic review

    No full text
    Contains fulltext : 165999.pdf (publisher's version ) (Closed access)The complexity and burden of systemic sclerosis (SSc) pose challenges to developing and sustaining disease management self-efficacy. The objective of this systematic review was to compare scores on a commonly used self-efficacy measure, the Self-Efficacy for Managing Chronic Disease (SEMCD) Scale, between SSc and other diseases. Data sources included the CINAHL, EMBASE, MEDLINE, and Scopus databases, searched through January 25, 2016, and reference lists of included articles and relevant reviews. Studies in any language that reported total SEMCD scores or individual item scores in adult non-psychiatric medical patients were eligible. We identified one eligible non-intervention study of SSc patients (n = 553), 13 other non-intervention studies, and 21 studies with pre-intervention data for patients enrolled in a self-management program or a trial of a program. Of 13 non-intervention studies with published total score means in cancer, cardiovascular disease, Parkinson’s disease, spinal cord injuries, organ transplant candidates and recipients, dialysis, and lupus, SEMCD scores were statistically significantly lower (poorer self-efficacy) in SSc than 6 other disease samples, not significantly different from 6, and significantly higher than lupus patients. Compared to 18 studies of patients in self-management programs or trials with published total score means, SSc patients were similar or lower than 9 samples and significantly higher than 9 samples. Compared to patients with other diseases not enrolled in programs to improve self-efficacy, SSc patients report lower self-efficacy scores than most patient groups. Rigorously tested self-care interventions designed to meet the unique needs of patients with SSc are needed.12 p

    Validation of the Self-Efficacy for Managing Chronic Disease Scale: A Scleroderma Patient-Centered Intervention Network cohort study

    No full text
    Objective: Self-management programs for patients with chronic illnesses, including rheumatic diseases, seek to enhance self-efficacy for performing health management behaviors. No measure of self-efficacy has been validated for patients with systemic sclerosis (SSc; scleroderma). The objective of this study was to assess the validity and internal consistency reliability of the Self-Efficacy for Managing Chronic Disease (SEMCD) scale in SSc. Methods: English-speaking SSc patients enrolled in the Scleroderma Patient-centered Intervention Network Cohort who completed the SEMCD scale at their baseline assessment between March 2014 and June 2015 were included. Patients were enrolled from 21 sites in Canada, the US, and the UK. Confirmatory factor analysis (CFA) was used to evaluate the factor structure of the SEMCD scale. Cronbach's alpha was calculated to assess internal consistency reliability. Hypotheses on the direction and magnitude of Pearson's correlations with psychological and physical outcome measures were formulated and tested to examine convergent validity. Results: A total of 553 patients were included. CFA supported the single-factor structure of the SEMCD scale (Tucker Lewis Index = 0.99, comparative fit index = 0.99, root mean square error of approximation = 0.10). Internal consistency was high (alpha = 0.93), and correlations with measures of psychological and physical functioning were moderate to large (|r| = 0.48-0.67, P < 0.001), confirming study hypotheses. Conclusion: Scores from the SEMCD scale are valid for measuring self-efficacy in patients with SSc, and results support using the scale as an outcome measure to evaluate the effectiveness of self-management programs in SSc

    Diagnostic accuracy of the edinburgh postnatal depression scale (EPDS) for detecting major depression in pregnant and postnatal women: Protocol for a systematic review and individual patient data meta-analyses

    No full text
    Introduction: Studies of the diagnostic accuracy of depression screening tools often used data-driven methods to select optimal cut-offs. Typically, these studies report results from a small range of cut-off points around whatever cut-off score is identified as most accurate. When published data are combined in meta-analyses, estimates of accuracy for different cutoff points may be based on data from different studies, rather than data from all studies for each cut-off point. Thus, traditional meta-analyses may exaggerate accuracy estimates. Individual patient data (IPD) metaanalyses synthesise data from all studies for each cutoff score to obtain accuracy estimates. The 10-item Edinburgh Postnatal Depression Scale (EPDS) is commonly recommended for depression screening in the perinatal period. The primary objective of this IPD meta-analysis is to determine the diagnostic accuracy of the EPDS to detect major depression among women during pregnancy and in the postpartum period across all potentially relevant cut-off scores, accounting for patient factors that may influence accuracy (age, pregnancy vs postpartum). Methods and analysis: Data sources will include Medline, Medline In-Process & Other Non-Indexed Citations, PsycINFO, and Web of Science. Studies that include a diagnosis of major depression based on a validated structured or semistructured clinical interview administered within 2 weeks of (before or after) the administration of the EPDS will be included. Risk of bias will be assessed with the Quality Assessment of Diagnostic Accuracy Studies-2 tool. Bivariate randomeffects meta-analysis will be conducted for the full range of plausible cut-off values. Analyses will evaluate data from pregnancy and the postpartum period separately, as well as combining data from all women in a single model

    Comparison of the Accuracy of the 7-Item HADS Depression Subscale and 14-Item Total HADS for Screening for Major Depression: A Systematic Review and Individual Participant Data Meta-Analysis

    No full text
    The seven-item Hospital Anxiety and Depression Scale Depression subscale (HADS-D) and the total score of the 14-item HADS (HADS-T) are both used for major depression screening. Compared to the HADS-D, the HADS-T includes anxiety items and requires more time to complete. We compared the screening accuracy of the HADS-D and HADS-T for major depression detection. We conducted an individual participant data meta-analysis and fit bivariate random effects models to assess diagnostic accuracy among participants with both HADS-D and HADS-T scores. We identified optimal cutoffs, estimated sensitivity and specificity with 95% confidence intervals, and compared screening accuracy across paired cutoffs via two-stage and individual-level models. We used a 0.05 equivalence margin to assess equivalency in sensitivity and specificity. 20,700 participants (2,285 major depression cases) from 98 studies were included. Cutoffs of >= 7 for the HADS-D (sensitivity 0.79 [0.75, 0.83], specificity 0.78 [0.75, 0.80]) and >= 15 for the HADS-T (sensitivity 0.79 [0.76, 0.82], specificity 0.81 [0.78, 0.83]) minimized the distance to the top-left corner of the receiver operating characteristic curve. Across all sets of paired cutoffs evaluated, differences of sensitivity between HADS-T and HADS-D ranged from -0.05 to 0.01 (0.00 at paired optimal cutoffs), and differences of specificity were within 0.03 for all cutoffs (0.02-0.03). The pattern was similar among outpatients, although the HADS-T was slightly (not nonequivalently) more specific among inpatients. The accuracy of HADS-T was equivalent to the HADS-D for detecting major depression. In most settings, the shorter HADS-D would be preferred.Public Significance Statement The present study suggests that the accuracy of 14-item Hospital Anxiety and Depression Scale (HADS-D) and the seven-item HADS Depression subscale (HADS-D) are equivalent for detecting major depression. Using the seven-item HADS-D for depression screening instead of the full 14-item HADS-T has minimal influence on performance of the measure but would reduce patient and participant burden in most clinical and research settings

    Depression prevalence using the HADS-D compared to SCID major depression classification: An individual participant data meta-analysis

    No full text
    Objectives: Validated diagnostic interviews are required to classify depression status and estimate prevalence of disorder, but screening tools are often used instead. We used individual participant data meta-analysis to compare prevalence based on standard Hospital Anxiety and Depression Scale – depression subscale (HADS-D) cutoffs of ≥8 and ≥11 versus Structured Clinical Interview for DSM (SCID) major depression and determined if an alternative HADS-D cutoff could more accurately estimate prevalence. Methods: We searched Medline, Medline In-Process &amp; Other Non-Indexed Citations via Ovid, PsycINFO, and Web of Science (inception-July 11, 2016) for studies comparing HADS-D scores to SCID major depression status. Pooled prevalence and pooled differences in prevalence for HADS-D cutoffs versus SCID major depression were estimated. Results: 6005 participants (689 SCID major depression cases) from 41 primary studies were included. Pooled prevalence was 24.5% (95% Confidence Interval (CI): 20.5%, 29.0%) for HADS-D ≥8, 10.7% (95% CI: 8.3%, 13.8%) for HADS-D ≥11, and 11.6% (95% CI: 9.2%, 14.6%) for SCID major depression. HADS-D ≥11 was closest to SCID major depression prevalence, but the 95% prediction interval for the difference that could be expected for HADS-D ≥11 versus SCID in a new study was −21.1% to 19.5%. Conclusions: HADS-D ≥8 substantially overestimates depression prevalence. Of all possible cutoff thresholds, HADS-D ≥11 was closest to the SCID, but there was substantial heterogeneity in the difference between HADS-D ≥11 and SCID-based estimates. HADS-D should not be used as a substitute for a validated diagnostic interview. © 2020 Elsevier Inc

    Depression prevalence based on the Edinburgh Postnatal Depression Scale compared to Structured Clinical Interview for DSM DIsorders classification: Systematic review and individual participant data meta-analysis.

    No full text
    Estimates of depression prevalence in pregnancy and postpartum are based on the Edinburgh Postnatal Depression Scale (EPDS) more than on any other method. We aimed to determine if any EPDS cutoff can accurately and consistently estimate depression prevalence in individual studies. We analyzed datasets that compared EPDS scores to Structured Clinical Interview for DSM (SCID) major depression status. Random-effects meta-analysis was used to compare prevalence with EPDS cutoffs versus the SCID. Seven thousand three hundred and fifteen participants (1017 SCID major depression) from 29 primary studies were included. For EPDS cutoffs used to estimate prevalence in recent studies (≥9 to ≥14), pooled prevalence estimates ranged from 27.8% (95% CI: 22.0%-34.5%) for EPDS ≥ 9 to 9.0% (95% CI: 6.8%-11.9%) for EPDS ≥ 14; pooled SCID major depression prevalence was 9.0% (95% CI: 6.5%-12.3%). EPDS ≥14 provided pooled prevalence closest to SCID-based prevalence but differed from SCID prevalence in individual studies by a mean absolute difference of 5.1% (95% prediction interval: -13.7%, 12.3%). EPDS ≥14 approximated SCID-based prevalence overall, but considerable heterogeneity in individual studies is a barrier to using it for prevalence estimation

    Probability of major depression diagnostic classification based on the SCID, CIDI and MINI diagnostic interviews controlling for Hospital Anxiety and Depression Scale – Depression subscale scores: An individual participant data meta-analysis of 73 primary studies

    No full text
    Objective: Two previous individual participant data meta-analyses (IPDMAs) found that different diagnostic interviews classify different proportions of people as having major depression overall or by symptom levels. We compared the odds of major depression classification across diagnostic interviews among studies that administered the Depression subscale of the Hospital Anxiety and Depression Scale (HADS-D). Methods: Data accrued for an IPDMA on HADS-D diagnostic accuracy were analysed. We fit binomial generalized linear mixed models to compare odds of major depression classification for the Structured Clinical Interview for DSM (SCID), Composite International Diagnostic Interview (CIDI), and Mini International Neuropsychiatric Interview (MINI), controlling for HADS-D scores and participant characteristics with and without an interaction term between interview and HADS-D scores. Results: There were 15,856 participants (1942 [12%] with major depression) from 73 studies, including 15,335 (97%) non-psychiatric medical patients, 164 (1%) partners of medical patients, and 357 (2%) healthy adults. The MINI (27 studies, 7345 participants, 1066 major depression cases) classified participants as having major depression more often than the CIDI (10 studies, 3023 participants, 269 cases) (adjusted odds ratio [aOR] = 1.70 (0.84, 3.43)) and the semi-structured SCID (36 studies, 5488 participants, 607 cases) (aOR = 1.52 (1.01, 2.30)). The odds ratio for major depression classification with the CIDI was less likely to increase as HADS-D scores increased than for the SCID (interaction aOR = 0.92 (0.88, 0.96)). Conclusion: Compared to the SCID, the MINI may diagnose more participants as having major depression, and the CIDI may be less responsive to symptom severity. © 2019 Elsevier Inc

    Equivalency of the diagnostic accuracy of the PHQ-8 and PHQ-9: A systematic review and individual participant data meta-analysis

    No full text
    Item 9 of the Patient Health Questionnaire-9 (PHQ-9) queries about thoughts of death and self-harm, but not suicidality. Although it is sometimes used to assess suicide risk, most positive responses are not associated with suicidality. The PHQ-8, which omits Item 9, is thus increasingly used in research. We assessed equivalency of total score correlations and the diagnostic accuracy to detect major depression of the PHQ-8 and PHQ-9.Methods We conducted an individual patient data meta-analysis. We fit bivariate random-effects models to assess diagnostic accuracy.Results 16 742 participants (2097 major depression cases) from 54 studies were included. The correlation between PHQ-8 and PHQ-9 scores was 0.996 (95 confidence interval 0.996 to 0.996). The standard cutoff score of 10 for the PHQ-9 maximized sensitivity + specificity for the PHQ-8 among studies that used a semi-structured diagnostic interview reference standard (N = 27). At cutoff 10, the PHQ-8 was less sensitive by 0.02 (-0.06 to 0.00) and more specific by 0.01 (0.00 to 0.01) among those studies (N = 27), with similar results for studies that used other types of interviews (N = 27). For all 54 primary studies combined, across all cutoffs, the PHQ-8 was less sensitive than the PHQ-9 by 0.00 to 0.05 (0.03 at cutoff 10), and specificity was within 0.01 for all cutoffs (0.00 to 0.01).Conclusions PHQ-8 and PHQ-9 total scores were similar. Sensitivity may be minimally reduced with the PHQ-8, but specificity is similar. Copyright © Cambridge University Press 2019

    Probability of major depression classification based on the SCID, CIDI, and MINI diagnostic interviews: A synthesis of three individual participant data meta-analyses

    No full text
    Introduction: Three previous individual participant data meta-analyses (IPDMAs) reported that, compared to the Structured Clinical Interview for the DSM (SCID), alternative reference standards, primarily the Composite International Diagnostic Interview (CIDI) and the Mini International Neuropsychiatric Interview (MINI), tended to misclassify major depression status, when controlling for depression symptom severity. However, there was an important lack of precision in the results. Objective: To compare the odds of the major depression classification based on the SCID, CIDI, and MINI. Methods: We included and standardized data from 3 IPDMA databases. For each IPDMA, separately, we fitted binomial generalized linear mixed models to compare the adjusted odds ratios (aORs) of major depression classification, controlling for symptom severity and characteristics of participants, and the interaction between interview and symptom severity. Next, we synthesized results using a DerSimonian-Laird random-effects meta-analysis. Results: In total, 69,405 participants (7,574 [11%] with major depression) from 212 studies were included. Controlling for symptom severity and participant characteristics, the MINI (74 studies; 25,749 participants) classified major depression more often than the SCID (108 studies; 21,953 participants; aOR 1.46; 95% confidence interval [CI] 1.11-1.92]). Classification odds for the CIDI (30 studies; 21,703 participants) and the SCID did not differ overall (aOR 1.19; 95% CI 0.79-1.75); however, as screening scores increased, the aOR increased less for the CIDI than the SCID (interaction aOR 0.64; 95% CI 0.52-0.80). Conclusions: Compared to the SCID, the MINI classified major depression more often. The odds of the depression classification with the CIDI increased less as symptom levels increased. Interpretation of research that uses diagnostic interviews to classify depression should consider the interview characteristics. © 202
    corecore