36 research outputs found

    Assessing the equivalence of web-based and paper-and-pencil questionnaires using differential item and test functioning (DIF and DTF) analysis:A case of the Four-Dimensional Symptom Questionnaire (4DSQ)

    Get PDF
    Purpose:  Many paper-and-pencil (P&P) questionnaires have been migrated to electronic platforms. Differential item and test functioning (DIF and DTF) analysis constitutes a superior research design to assess measurement equivalence across modes of administration. The purpose of this study was to demonstrate an item response theory (IRT)-based DIF and DTF analysis to assess the measurement equivalence of a Web-based version and the original P&P format of the Four-Dimensional Symptom Questionnaire (4DSQ), measuring distress, depression, anxiety, and somatization. Methods:  The P&P group (n=2031) and the Web group (n=958) consisted of primary care psychology clients. Unidimensionality and local independence of the 4DSQ scales were examined using IRT and Yen’s Q3. Bifactor modeling was used to assess the scales’ essential unidimensionality. Measurement equivalence was assessed using IRT-based DIF analysis using a 3-stage approach: linking on the latent mean and variance, selection of anchor items, and DIF testing using the Wald test. DTF was evaluated by comparing expected scale scores as a function of the latent trait. Results:  The 4DSQ scales proved to be essentially unidimensional in both modalities. Five items, belonging to the distress and somatization scales, displayed small amounts of DIF. DTF analysis revealed that the impact of DIF on the scale levelwas negligible. Conclusions:  IRT-based DIF and DTF analysis is demonstrated as a way to assess the equivalence of Web-based and P&P questionnaire modalities. Data obtained with the Web-based 4DSQ are equivalent to data obtained with the P&P version

    The aesthetic items scale: A tool for the evaluation of aesthetic outcome after breast reconstruction

    Get PDF
    __Background:__ Valid tools to assess aesthetic outcomes after breast reconstructive surgery are scarce. Previously a professional aesthetic assessment scale was introduced, the Aesthetic Items Scale (AIS). We aim to determine if this method is a valid and reliable tool to assess aesthetic outcome after breast reconstructive surgery. __Methods:__ The study population was consenting women who underwent prophylactic mastectomy with subsequent implant-based breast reconstruction. The aesthetic outcome with regard to breast volume, shape, symmetry, scars, and nipple areola complex was rated on a 5-point scale using standardized photographs to give a summed total score. Photographs were evaluated by the patient, 5 plastic surgeons, and 3 mammography nurses. An overall rating of aesthetic outcome on a 1-10 scale was given separately. We determined the intraclass correlation coefficient and assessed interobserver agreement. To assess validity, we calculated the correlation between total score and overall rating of aesthetic outcome. __Results:__ Interobserver reliability was highest between plastic surgeons for the subitem and overall scores and ranged between 0.56 and 0.82. The summed score of the AIS correlates strongly with the overall rating in professionals but not in patients. __Conclusions:__ The AIS is a valid and reliable method for evaluating aesthetic outcome of breast reconstruction by plastic surgeons. The results indicate that patients judge aesthetic outcome differently, taking into account factors that are not represented in the AIS. Professionals can use this method to evaluate surgical results, but other measurements are needed to map satisfaction of the patient with her breasts

    High and Intensive Care in Psychiatry: Validating the HIC Monitor as a Tool for Assessing the Quality of Psychiatric Intensive Care Units

    Get PDF
    This study aims to validate the HIC monitor as a model-fidelity scale to the High and Intensive Care (HIC) model, a recently developed model for acute psychiatric wards. To assess the psychometric properties of the HIC monitor, 37 audits were held on closed inpatient wards at 20 psychiatric hospitals in the Netherlands. Interrater reliability, construct validity and content validity were examined. Our results suggest that the HIC monitor has good psychometric properties. It can be used as a tool for assessing the implementation of the HIC model on acute psychiatric wards in the Netherlands, and for quality assessment and improvement

    Clinical Usefulness of Tools to Support Decision-making for Palliative Treatment of Metastatic Colorectal Cancer: A Systematic Review

    Get PDF
    A systematic review of the literature was performed to provide a comprehensive overview of the available decision support tools for incurable metastatic colorectal cancer, and to assess their clinical usefulness. We identified 14 tools. The evidence regarding the quality of the information they provide is too limited to currently recommend their use to guide treatment decision-making. Background: Decision-making regarding palliative treatment for patients with metastatic colorectal cancer (mCRC) is complex and comprises numerous decisions. Decision-making should be guided by the premise of maintaining and/or improving patients' quality of life, by patient preference, and by the trade-off between treatment benefits and harm. Decision support systems (DSSs) for clinicians (eg, nomograms) can assist in this process. The present systematic review aimed to pro

    Longitudinal associations between physical load and chronic low back pain in the general population: the Doetichem Cohort Study

    No full text
    Study Design. Prospective cohort study. Objective. We explored long-term associations between physical load exposure and chronic low back pain (LBP) using data from an ongoing population-based cohort study. Summary of Background Data. Physical load in work or daily life is often studied in relation to LBP. Most studies are cross-sectional or have a limited follow-up. Methods. Between 1993 and 2007, 4738 men and women aged 25 to 64 years were measured maximal 3 times with 5-year intervals. Physical load in daily activities (9 items, e. g., awkward postures, mechanical vibration) was assessed by questionnaire at 2 measurements and chronic LBP 3 times. Physical load exposure at both measurements was described. Multivariable logistic regression analyses were adjusted for sex, age, education, work status, body mass index, physical activity, and smoking. Results. Despite stable prevalence rates of physical load exposure, about 50% of the participants exposed changed their exposure during a 5-year period. 7.2% of the participants reported awkward postures at 2 measurements, 8.4% at the first measurement only, and 6.8% at the second measurement only. Among all physical load variables, associations with chronic LBP were found only for awkward postures. An increased risk for incident chronic LBP was found in participants exposed twice to awkward postures. In contrast, only single exposure to awkward postures was associated with persistence of chronic LBP. Conclusion. Awkward postures were associated with chronic LBP in the general population. Exposure to awkward postures at 2 measurements with 5 years in between did increase the risk for incident chronic LBP, but not for persistence of chronic LBP

    [Measurement of patient safety: a systematic review of the reliability and validity of adverse event detection with record review]

    No full text
    Item does not contain fulltextOBJECTIVES: Record review is the most used method to quantify patient safety. We systematically reviewed the reliability and validity of adverse event detection with record review. DESIGN: A systematic review of the literature. METHODS: We searched PubMed, EMBASE, CINAHL, PsycINFO and the Cochrane Library and from their inception through February 2015. We included all studies that aimed to describe the reliability and/or validity of record review. Two reviewers conducted data extraction. We pooled kappa values (kappa) and analysed the differences in subgroups according to number of reviewers, reviewer experience and training level, adjusted for the prevalence of adverse events. RESULTS: In 25 studies, the psychometric data of the Global Trigger Tool (GTT) and the Harvard Medical Practice Study (HMPS) were reported and 24 studies were included for statistical pooling. The inter-raterreliability of the GTT and HMPS showed a pooled kappa of 0.65 and 0.55, respectively. The inter-rater agreement was statistically significantly higher when the group of reviewers within a study consisted of a maximum five reviewers. We found no studies reporting on the validity of the GTT and HMPS. CONCLUSIONS: The reliability of record review is moderate to substantial and improved when a small group of reviewers carried out record review. The validity of the record review method has never been evaluated, while clinical data registries, autopsy or direct observations of patient care are methods that can be used to test concurrent validity

    Injection therapy for subacute and chronic low back pain: an updated Cochrane review.

    No full text
    Item does not contain fulltextSTUDY DESIGN: A systematic review of randomized controlled trials (RCTs). OBJECTIVE: To determine if injection therapy is more effective than placebo or other treatments for patients with subacute or chronic low back pain. SUMMARY OF BACKGROUND DATA: The effectiveness of injection therapy for low back pain is still debatable. Heterogeneity of target tissue, pharmacological agent, and dosage, generally found in RCTs, point to the need for clinically valid comparisons in a literature synthesis. METHODS: We updated the search of the earlier systematic review and searched the Cochrane Central Register of Controlled Trials, MEDLINE, and EMBASE databases up to March 2007 for relevant trials reported in English, French, German, Dutch, and Nordic languages. We also screened references from trials identified. RCTs on the effects of injection therapy involving epidural, facet, or local sites for subacute or chronic low back pain were included. Studies that compared the effects of intradiscal injections, prolotherapy, or ozone therapy with other treatments were excluded unless injection therapy with another pharmaceutical agent (no placebo treatment) was part of one of the treatment arms. Studies about injections in sacroiliac joints and studies evaluating the effects of epidural steroids for radicular pain were also excluded. RESULTS: Eighteen trials (1179 participants) were included in this review. The injection sites varied from epidural sites and facet joints (i.e. intra-articular injections, peri-articular injections and nerve blocks) to local sites (i.e. tender-and trigger points). The drugs that were studied consisted of corticosteroids, local anesthetics, and a variety of other drugs. The methodologic quality of the trials was limited with 10 of 18 trials rated as having a high methodologic quality. Statistical pooling was not possible because of clinical heterogeneity in the trials. Overall, the results indicated that there is no strong evidence for or against the use of any type of injection therapy. CONCLUSION: There is insufficient evidence to support the use of injection therapy in subacute and chronic low-back pain. However, it cannot be ruled out that specific subgroups of patients may respond to a specific type of injection therapy

    Passive imputation and parcel summaries are both valid to handle missing items in studies with many multi-item scales

    No full text
    Previous studies showed that missing data in multi-item scales can best be handled by multiple imputation of item scores. However, when many scales are used, the number of items will become too large for the imputation model to reliably estimate imputations. A solution is to use passive imputation or a parcel summary score that combine and consequently reduce the number of variables in the imputation model. The performance of these methods was evaluated in a simulation study and illustrated in an example. Passive imputation, which updated scale scores from imputed items, and parcel summary scores that use the average over available item scores were compared to using all items simultaneously, imputing total scores of scales and complete-case analysis. Scale scores and coefficient estimates from linear regression were compared to "true" parameters on bias and precision. Passive imputation and using parcel summaries showed smaller bias and more precision than imputing total scores and complete-case analyses. Passive imputation or using parcel summary scores are valid missing data solutions in studies that include many multi-item scales

    From TENDUM to SPICOS, or: how flexible is the TENDUM approach to question answering?

    No full text
    It has generally been thought that it would not take too much effort to change or extend the input language and the subject domain of TENDUM. Now that TENDUM is used to form the 'linguistic' core of the SPICOS system, we are in a position to test this idea of flexibility. After a very brief outline of TENDUM we will first analyse the claim; subsequently , we will inquire to what extent it has been confirmed by the SPICOS experience . By way of illustration , imperative constructions ('commands') are dealt with in some detail

    Family physicians' diagnostic gut feelings are measurable: construct validation of a questionnaire

    Get PDF
    Contains fulltext : 125453.pdf (publisher's version ) (Open Access)BACKGROUND: Family physicians perceive that gut feelings, i.e. a 'sense of reassurance' or a 'sense of alarm', play a substantial role in diagnostic reasoning. A measuring instrument is desirable for further research. Our objective is to validate a questionnaire measuring the presence of gut feelings in diagnostic reasoning. METHODS: We constructed 16 case vignettes from real practice situations and used the accompanying 'sense of reassurance' or the 'sense of alarm' as reference labels. Based on the results of an initial study (26 family physicians), we divided the case vignettes into a group involving a clear role for the sense of reassurance or the sense of alarm and a group involving an ambiguous role. 49 experienced family physicians evaluated each 10 vignettes using the questionnaire. Construct validity was assessed by testing hypotheses and an internal consistency procedure was performed. RESULTS: As hypothesized we found that the correlations between the reference labels and corresponding items were high for the clear-case vignettes (0.59 - 0.72) and low for the ambiguous-case vignettes (0.08 - 0.23). The agreement between the classification in clear sense of reassurance, clear sense of alarm and ambiguous case vignettes as derived from the initial study and the study population's judgments was substantial (Kappa = 0.62). Factor analysis showed one factor with opposites for sense of reassurance and sense of alarm items. The questionnaire's internal consistency was high (0.91). We provided a linguistic validated English-language text of the questionnaire. CONCLUSIONS: The questionnaire appears to be valid. It enables quantitative research into the role of gut feelings and their diagnostic value in family physicians' diagnostic reasoning
    corecore