78 research outputs found

    Interval estimation and optimal design for the within-subject coefficient of variation for continuous and binary variables

    Get PDF
    BACKGROUND: In this paper we propose the use of the within-subject coefficient of variation as an index of a measurement's reliability. For continuous variables and based on its maximum likelihood estimation we derive a variance-stabilizing transformation and discuss confidence interval construction within the framework of a one-way random effects model. We investigate sample size requirements for the within-subject coefficient of variation for continuous and binary variables. METHODS: We investigate the validity of the approximate normal confidence interval by Monte Carlo simulations. In designing a reliability study, a crucial issue is the balance between the number of subjects to be recruited and the number of repeated measurements per subject. We discuss efficiency of estimation and cost considerations for the optimal allocation of the sample resources. The approach is illustrated by an example on Magnetic Resonance Imaging (MRI). We also discuss the issue of sample size estimation for dichotomous responses with two examples. RESULTS: For the continuous variable we found that the variance stabilizing transformation improves the asymptotic coverage probabilities on the within-subject coefficient of variation for the continuous variable. The maximum like estimation and sample size estimation based on pre-specified width of confidence interval are novel contribution to the literature for the binary variable. CONCLUSION: Using the sample size formulas, we hope to help clinical epidemiologists and practicing statisticians to efficiently design reliability studies using the within-subject coefficient of variation, whether the variable of interest is continuous or binary

    Comparison of two dependent within subject coefficients of variation to evaluate the reproducibility of measurement devices

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The within-subject coefficient of variation and intra-class correlation coefficient are commonly used to assess the reliability or reproducibility of interval-scale measurements. Comparison of reproducibility or reliability of measurement devices or methods on the same set of subjects comes down to comparison of dependent reliability or reproducibility parameters.</p> <p>Methods</p> <p>In this paper, we develop several procedures for testing the equality of two dependent within-subject coefficients of variation computed from the same sample of subjects, which is, to the best of our knowledge, has not yet been dealt with in the statistical literature. The Wald test, the likelihood ratio, and the score tests are developed. A simple regression procedure based on results due to Pitman and Morgan is constructed. Furthermore we evaluate the statistical properties of these methods via extensive Monte Carlo simulations. The methodologies are illustrated on two data sets; the first are the microarray gene expressions measured by two plat- forms; the Affymetrix and the Amersham. Because microarray experiments produce expressions for a large number of genes, one would expect that the statistical tests to be asymptotically equivalent. To explore the behaviour of the tests in small or moderate sample sizes, we illustrated the methodologies on data from computer-aided tomographic scans of 50 patients.</p> <p>Results</p> <p>It is shown that the relatively simple Wald's test (WT) is as powerful as the likelihood ratio test (LRT) and that both have consistently greater power than the score test. The regression test holds its empirical levels, and in some occasions is as powerful as the WT and the LRT.</p> <p>Conclusion</p> <p>A comparison between the reproducibility of two measuring instruments using the same set of subjects leads naturally to a comparison of two correlated indices. The presented methodology overcomes the difficulty noted by data analysts that dependence between datasets would confound any inferences one could make about the differences in measures of reliability and reproducibility. The statistical tests presented in this paper have good properties in terms of statistical power.</p

    Cross-cultural adaptation and validation of the “spinal cord injury-falls concern scale” in the Italian population

    Get PDF
    Study design: Psychometrics study. Objective: The objective of this study was to develop an Italian version of the Spinal Cord Injury-Falls Concern Scale (SCI-FCS) and examine its reliability and validity. Setting: Multicenter study in spinal units in Northern and Southern Italy. The scale also was administered to non-hospitalized outpatient clinic patients. Methods: The original scale was translated from English to Italian using the “Translation and Cultural Adaptation of Patient-Reported Outcomes Measures” guidelines. The reliability and validity of the culturally adapted scale were assessed following the “Consensus-Based Standards for the Selection of Health Status Measurement Instruments” checklist. The SCI-FCS-I internal consistency, inter-rater, and intra-rater reliability were examined using Cronbach’s alpha coefficient and the intraclass correlation coefficient, respectively. Concurrent validity was evaluated using Pearson’s correlation coefficient with the Italian version of the short form of the Wheelchair Use Confidence Scale for Manual Wheelchair Users (WheelCon-M-I-short form). Results: The Italian version of the SCI-FCS-I was administered to 124 participants from 1 June to 30 September 2017. The mean ± SD of the SCI-FCS-I score was 16.73 ± 5.88. All SCI-FCS items were either identical or similar in meaning to the original version’s items. Cronbach’s α was 0.827 (p < 0.01), the inter-rater reliability was 0.972 (p < 0.01), and the intra-rater reliability was 0.973 (p < 0.01). Pearson’s correlation coefficient of the SCI-FCS-I scores with the WheelCon-M-I-short form was 0.56 (p < 0.01). Conclusions: The SCI-FCS-I was found to be reliable and a valid outcome measure for assessing manual wheelchair concerns about falling in the Italian population

    Non-Invasive Measurement of Hemoglobin: Assessment of Two Different Point-of-Care Technologies

    Get PDF
    Measurement of blood hemoglobin (Hb) concentration is a routine procedure. Using a non-invasive point-of-care device reduces pain and discomfort for the patient and allows time saving in patient care. The aims of the present study were to assess the concordance of Hb levels obtained non-invasively with the Pronto-7 monitor (version 2.1.9, Masimo Corporation, Irvine, USA) or with the NBM-200MP monitor (Orsense, Nes Ziona, Israel) and the values obtained from the usual colorimetric method using blood samples and to determine the source of discordance.We conducted two consecutive prospective open trials enrolling patients presenting in the emergency department of a university hospital. The first was designed to assess Pronto-7ℱ and the second NBM-200MPℱ. In each study, the main outcome measure was the agreement between both methods. Independent factors associated with the bias were determined using multiple linear regression. Three hundred patients were prospectively enrolled in each study. For Pronto-7ℱ, the absolute mean difference was 0.56 g.L(-1) (95% confidence interval [CI] 0.41 to 0.69) with an upper agreement limit at 2.94 g.L(-1) (95% CI [2.70;3.19]), a lower agreement limit at -1.84 g.L(-1) (95% CI [-2.08;-1.58]) and an intra-class correlation coefficient at 0.80 (95% CI [0.74;0.84]). The corresponding values for the NBM-200MPℱ were 0.21 [0.02;0.39], 3.42 [3.10;3.74], -3.01 [-3.32;-2.69] and 0.69 [0.62;0.75]. Multivariate analysis showed that age and laboratory values of hemoglobin were independently associated with the bias when using Pronto-7ℱ, while perfusion index and laboratory value of hemoglobin were independently associated with the bias when using NBM-200MPℱ.Despite a relatively limited bias in both cases, the large limits of agreement found in both cases render the clinical usefulness of such devices debatable. For both devices, the bias is independently and inversely associated with the true value of hemoglobin.ClinicalTrials.gov NCT01321580 and NCT01321593

    Frequency of GP communication addressing the patient's resources and coping strategies in medical interviews: a video-based observational study

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>There is increasing focus on patient-centred communicative approaches in medical consultations, but few studies have shown the extent to which patients' positive coping strategies and psychological assets are addressed by general practitioners (GPs) on a regular day at the office. This study measures the frequency of GPs' use of questions and comments addressing their patients' coping strategies or resources.</p> <p>Methods</p> <p>Twenty-four GPs were video-recorded in 145 consultations. The consultations were coded using a modified version of the Roter Interaction Analysis System. In this study, we also developed four additional coding categories based on cognitive therapy and solution-focused therapy: attribution, resources, coping, and solution-focused techniques.</p> <p>The reliability between coders was established, a factor analysis was applied to test the relationship between the communication categories, and a tentative validating exercise was performed by reversed coding.</p> <p>Results</p> <p>Cohen's kappa was 0.52 between coders. Only 2% of the utterances could be categorized as resource or coping oriented. Six GPs contributed 59% of these utterances. The factor analysis identified two factors, one task oriented and one patient oriented.</p> <p>Conclusion</p> <p>The frequency of communication about coping and resources was very low. Communication skills training for GPs in this field is required. Further validating studies of this kind of measurement tool are warranted.</p

    Reproducibility of 3-dimensional ultrasound readings of volume of carotid atherosclerotic plaque

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Non-invasive 3-dimensional (3D) ultrasound (US) has emerged as the predominant approach for evaluating the progression of carotid atherosclerosis and its response to treatment. The aim of this study was to investigate the quality of a central reading procedure concerning plaque volume (PV), measured by 3D US in a multinational US trial.</p> <p>Methods</p> <p>Two data sets of 45 and 60 3D US patient images of plaques (mean PV, 71.8 and 39.8 ÎŒl, respectively) were used. PV was assessed by means of manual planimetry. The intraclass correlation coefficient (ICC) was applied to determine reader variabilities. The repeatability coefficient (RC) and the coefficient of variation (CV) were used to investigate the effect of number of slices (S) in manual planimetry and plaque size on measurement variability.</p> <p>Results</p> <p>Intra-reader variability was small as reflected by ICCs of 0.985, 0.967 and 0.969 for 3 appointed readers. The ICC value generated between the 3 readers was 0.964, indicating that inter-reader variability was small, too. Subgroup analyses showed that both intra- and inter-reader variabilities were lower for larger than for smaller plaques. Mean CVs were similar for the 5S- and 10S-methods with a RC of 4.7 ÎŒl. The RC between both methods as well as the CVs were comparatively lower for larger plaques.</p> <p>Conclusion</p> <p>By implementing standardised central 3D US reading protocols and strict quality control procedures highly reliable ultrasonic re-readings of plaque images can be achieved in large multicentre trials.</p

    Testing for heterogeneity among the components of a binary composite outcome in a clinical trial

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Investigators designing clinical trials often use composite outcomes to overcome many statistical issues. Trialists want to maximize power to show a statistically significant treatment effect and avoid inflation of Type I error rate due to evaluation of multiple individual clinical outcomes. However, if the treatment effect is not similar among the components of this composite outcome, we are left not knowing how to interpret the treatment effect on the composite itself. Given significant heterogeneity among these components, a composite outcome may be judged as being invalid or un-interpretable for estimation of the treatment effect. This paper compares the power of different tests to detect heterogeneity of treatment effect across components of a composite binary outcome.</p> <p>Methods</p> <p>Simulations were done comparing four different models commonly used to analyze correlated binary data. These models included: logistic regression for ignoring correlation, logistic regression weighted by the intra cluster correlation coefficient, population average logistic regression using generalized estimating equations (GEE), and random effects logistic regression.</p> <p>Results</p> <p>We found that the population average model based on generalized estimating equations (GEE) had the greatest power across most scenarios. Adequate power to detect possible composite heterogeneity or variation between treatment effects of individual components of a composite outcome was seen when the power for detecting the main study treatment effect for the composite outcome was also reasonably high.</p> <p>Conclusions</p> <p>It is recommended that authors report tests of composite heterogeneity for composite outcomes and that this accompany the publication of the statistically significant results of the main effect on the composite along with individual components of composite outcomes.</p

    Reproducibility and day time bias correction of optoelectronic leg volumetry: a prospective cohort study

    Get PDF
    Background Leg edema is a common manifestation of various underlying pathologies. Reliable measurement tools are required to quantify edema and monitor therapeutic interventions. Aim of the present work was to investigate the reproducibility of optoelectronic leg volumetry over 3 weeks' time period and to eliminate daytime related within-individual variability. Methods Optoelectronic leg volumetry was performed in 63 hairdressers (mean age 45 ± 16 years, 85.7% female) in standing position twice within a minute for each leg and repeated after 3 weeks. Both lower leg (legBD) and whole limb (limbBF) volumetry were analysed. Reproducibility was expressed as analytical and within-individual coefficients of variance (CVA, CVW), and as intra-class correlation coefficients (ICC). Results A total of 492 leg volume measurements were analysed. Both legBD and limbBF volumetry were highly reproducible with CVA of 0.5% and 0.7%, respectively. Within-individual reproducibility of legBD and limbBF volumetry over a three weeks' period was high (CVW 1.3% for both; ICC 0.99 for both). At both visits, the second measurement revealed a significantly higher volume compared to the first measurement with a mean increase of 7.3 ml ± 14.1 (0.33% ± 0.58%) for legBD and 30.1 ml ± 48.5 ml (0.52% ± 0.79%) for limbBF volume. A significant linear correlation between absolute and relative leg volume differences and the difference of exact day time of measurement between the two study visits was found (P < .001). A therefore determined time-correction formula permitted further improvement of CVW. Conclusions Leg volume changes can be reliably assessed by optoelectronic leg volumetry at a single time point and over a 3 weeks' time period. However, volumetry results are biased by orthostatic and daytime-related volume changes. The bias for day-time related volume changes can be minimized by a time-correction formula

    The Development and Validation of the Thai-Translated Irrational Performance Beliefs Inventory (T-iPBI)

    Get PDF
    © 2018, Springer Science+Business Media, LLC, part of Springer Nature. One of the most commonly employed cognitive-behavioural approaches to psychotherapy is rational-emotive behaviour therapy, but researchers have been troubled by some of the limitations of irrational beliefs psychometrics. As a result, Turner et al. (Eur J Psychol Assess 34:174–180, 2018a. https://doi.org/10.1027/1015-5759/a000314) developed the Irrational Performance Beliefs Inventory (iPBI), a novel measure of irrational beliefs for use within performance domains. However, the linguistic and cross-cultural adaptation of the iPBI into other languages is necessary for its multinational and multicultural use. The purpose of this paper is to develop the Thai-translated version of the iPBI (T-iPBI) and examine the validity and reliability of the T-iPBI. Data retrieved from 166 participants were analysed using SPSS and AMOS software packages. Thirty-three participants completed two follow-up T-iPBI measurements (1- and 3-week repeat assessment). After the linguistic and cross-cultural adaptation processes, the T-iPBI demonstrated excellent levels of reliability, with internal consistency and test–retest reliability, as well as construct, concurrent, and predictive validity. The current findings indicate that the 20-item T-iPBI can be used as a self-assessment instrument to evaluate individual’s irrational performance beliefs in a Thai population. We also highlight the implications of this study and suggest a variety of future research directions that stem from the results

    Reliability of Therapist Effects in Practice-Based Psychotherapy Research : A Guide for the Planning of Future Studies

    Get PDF
    This paper aims to provide researchers with practical information on sample sizes for accurate estimations of therapist effects (TEs). The investigations are based on an integrated sample of 48,648 patients treated by 1800 therapists. Multilevel modeling and resampling were used to realize varying sample size conditions to generate empirical estimates of TEs. Sample size tables, including varying sample size conditions, were constructed and study examples given. This study gives an insight into the potential size of the TE and provides researchers with a practical guide to aid the planning of future studies in this field
    • 

    corecore