Search CORE

Comparison of two dependent within subject coefficients of variation to evaluate the reproducibility of measurement devices

Author: A Donner
A Donner
A Donner
A Stuart
A-K Jarvinen
AC Davison
Allan Donner
B Giraudeau
CL Yauk
Dilek Colak
DR Cox
E Bradley
EJG Pitman
G Atkinson
G Dunn
H Quan
H Quan
H Wang
J Cohen
J Fleiss
J Neyman
J Neyman
L Lin
L Shi
L Tian
LI Lin
M Blodeau
M Shoukri
MM Shoukri
Mohamed M Shoukri
N Draper
Namik Kaya
OC Ukomunne
PK Tan
R Landis
RA Irizarry
RC Gupta
RS Searle
S Weerahandi
SW Turner
W Morgan
WK Fung
WP Kuo
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background The within-subject coefficient of variation and intra-class correlation coefficient are commonly used to assess the reliability or reproducibility of interval-scale measurements. Comparison of reproducibility or reliability of measurement devices or methods on the same set of subjects comes down to comparison of dependent reliability or reproducibility parameters. Methods In this paper, we develop several procedures for testing the equality of two dependent within-subject coefficients of variation computed from the same sample of subjects, which is, to the best of our knowledge, has not yet been dealt with in the statistical literature. The Wald test, the likelihood ratio, and the score tests are developed. A simple regression procedure based on results due to Pitman and Morgan is constructed. Furthermore we evaluate the statistical properties of these methods via extensive Monte Carlo simulations. The methodologies are illustrated on two data sets; the first are the microarray gene expressions measured by two plat- forms; the Affymetrix and the Amersham. Because microarray experiments produce expressions for a large number of genes, one would expect that the statistical tests to be asymptotically equivalent. To explore the behaviour of the tests in small or moderate sample sizes, we illustrated the methodologies on data from computer-aided tomographic scans of 50 patients. Results It is shown that the relatively simple Wald's test (WT) is as powerful as the likelihood ratio test (LRT) and that both have consistently greater power than the score test. The regression test holds its empirical levels, and in some occasions is as powerful as the WT and the LRT. Conclusion A comparison between the reproducibility of two measuring instruments using the same set of subjects leads naturally to a comparison of two correlated indices. The presented methodology overcomes the difficulty noted by data analysts that dependence between datasets would confound any inferences one could make about the differences in measures of reliability and reproducibility. The statistical tests presented in this paper have good properties in terms of statistical power.</p

Scholarship@Western

Cross-cultural adaptation and validation of the “spinal cord injury-falls concern scale” in the Italian population

Author: A Citterio
AL Nelson
Anna Berardi
BM Sakakibara
CL Boswell-Ruys
D Gavin-Dreschnack
D Wild
Donatella Valente
E Butler Forslund
E Butler Forslund
E Butler Forslund
G Galeoto
G Galeoto
Giovanni Galeoto
JC Nunnally
K Berg
KS Roaldsen
L Yardley
LB Mokkink
Maria Auxiliadora Marquez
Martina Antonacci
MC Pagliacci
MC Pagliacci
MJ DeVivo
MM Shoukri
PW Rushton
R Vliet Van
RG Cumming
Rita De Santis
RL Kirby
RP Gaal
SC Kirshblum
TV Perneger
V Jørgensen
Valter Santilli
Viviana Ammendola
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Study design: Psychometrics study. Objective: The objective of this study was to develop an Italian version of the Spinal Cord Injury-Falls Concern Scale (SCI-FCS) and examine its reliability and validity. Setting: Multicenter study in spinal units in Northern and Southern Italy. The scale also was administered to non-hospitalized outpatient clinic patients. Methods: The original scale was translated from English to Italian using the “Translation and Cultural Adaptation of Patient-Reported Outcomes Measures” guidelines. The reliability and validity of the culturally adapted scale were assessed following the “Consensus-Based Standards for the Selection of Health Status Measurement Instruments” checklist. The SCI-FCS-I internal consistency, inter-rater, and intra-rater reliability were examined using Cronbach’s alpha coefficient and the intraclass correlation coefficient, respectively. Concurrent validity was evaluated using Pearson’s correlation coefficient with the Italian version of the short form of the Wheelchair Use Confidence Scale for Manual Wheelchair Users (WheelCon-M-I-short form). Results: The Italian version of the SCI-FCS-I was administered to 124 participants from 1 June to 30 September 2017. The mean ± SD of the SCI-FCS-I score was 16.73 ± 5.88. All SCI-FCS items were either identical or similar in meaning to the original version’s items. Cronbach’s α was 0.827 (p < 0.01), the inter-rater reliability was 0.972 (p < 0.01), and the intra-rater reliability was 0.973 (p < 0.01). Pearson’s correlation coefficient of the SCI-FCS-I scores with the WheelCon-M-I-short form was 0.56 (p < 0.01). Conclusions: The SCI-FCS-I was found to be reliable and a valid outcome measure for assessing manual wheelchair concerns about falling in the Italian population

Archivio della ricerca- Università di Roma La Sapienza

Non-Invasive Measurement of Hemoglobin: Assessment of Two Different Point-of-Care Technologies

Author: AM Lardi
C Ricos
E Gayat
E Gayat
Emmanuel Matthieu
Etienne Gayat
H Gehring
H Khusun
H von Schenck
JM Bland
JW Severinghaus
Jérôme Aulagnier
Marc Fischler
Mireille Boisson
MM Shoukri
MR Macknet
PM Bossuyt
RD Miller
RG Hahn
Tobias Eckle
Publication venue: Public Library of Science
Publication date: 06/01/2012
Field of study

Measurement of blood hemoglobin (Hb) concentration is a routine procedure. Using a non-invasive point-of-care device reduces pain and discomfort for the patient and allows time saving in patient care. The aims of the present study were to assess the concordance of Hb levels obtained non-invasively with the Pronto-7 monitor (version 2.1.9, Masimo Corporation, Irvine, USA) or with the NBM-200MP monitor (Orsense, Nes Ziona, Israel) and the values obtained from the usual colorimetric method using blood samples and to determine the source of discordance.We conducted two consecutive prospective open trials enrolling patients presenting in the emergency department of a university hospital. The first was designed to assess Pronto-7™ and the second NBM-200MP™. In each study, the main outcome measure was the agreement between both methods. Independent factors associated with the bias were determined using multiple linear regression. Three hundred patients were prospectively enrolled in each study. For Pronto-7™, the absolute mean difference was 0.56 g.L(-1) (95% confidence interval [CI] 0.41 to 0.69) with an upper agreement limit at 2.94 g.L(-1) (95% CI [2.70;3.19]), a lower agreement limit at -1.84 g.L(-1) (95% CI [-2.08;-1.58]) and an intra-class correlation coefficient at 0.80 (95% CI [0.74;0.84]). The corresponding values for the NBM-200MP™ were 0.21 [0.02;0.39], 3.42 [3.10;3.74], -3.01 [-3.32;-2.69] and 0.69 [0.62;0.75]. Multivariate analysis showed that age and laboratory values of hemoglobin were independently associated with the bias when using Pronto-7™, while perfusion index and laboratory value of hemoglobin were independently associated with the bias when using NBM-200MP™.Despite a relatively limited bias in both cases, the large limits of agreement found in both cases render the clinical usefulness of such devices debatable. For both devices, the bias is independently and inversely associated with the true value of hemoglobin.ClinicalTrials.gov NCT01321580 and NCT01321593

Public Library of Science (PLOS)

Frequency of GP communication addressing the patient's resources and coping strategies in medical interviews: a video-based observational study

Author: A Coulter
A Di Caccavo
A Ring
Arnstein Finset
C Charles
C Zimmermann
DL Roter
G Affleck
G Greenberg
GJ Carroll
H Boon
I Enzer
J Robinson
J Smedslund
J Svennevig
J Thistlethwaite
JM Bensing
M Stewart
MM Shoukri
N Mead
P De Jong
R Bakeman
S Keith
T Coleman
T Coleman
Trond A Mjaaland
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background There is increasing focus on patient-centred communicative approaches in medical consultations, but few studies have shown the extent to which patients' positive coping strategies and psychological assets are addressed by general practitioners (GPs) on a regular day at the office. This study measures the frequency of GPs' use of questions and comments addressing their patients' coping strategies or resources. Methods Twenty-four GPs were video-recorded in 145 consultations. The consultations were coded using a modified version of the Roter Interaction Analysis System. In this study, we also developed four additional coding categories based on cognitive therapy and solution-focused therapy: attribution, resources, coping, and solution-focused techniques. The reliability between coders was established, a factor analysis was applied to test the relationship between the communication categories, and a tentative validating exercise was performed by reversed coding. Results Cohen's kappa was 0.52 between coders. Only 2% of the utterances could be categorized as resource or coping oriented. Six GPs contributed 59% of these utterances. The factor analysis identified two factors, one task oriented and one patient oriented. Conclusion The frequency of communication about coping and resources was very low. Communication skills training for GPs in this field is required. Further validating studies of this kind of measurement tool are warranted.</p

NORA - Norwegian Open Research Archives

Reproducibility of 3-dimensional ultrasound readings of volume of carotid atherosclerotic plaque

Author: A Fenster
A Fenster
A Zanchetti
AM Landry
C Palombo
Dieter Schremmer
GS Mintz
J Persson
JD Spence
JD Spence
JM Bland
Klaus O Stumpe
KO Stumpe
M Ludwig
M Ludwig
M Ludwig
M Ludwig
M Ludwig
M Naghavi
MA Espeland
Malte Ludwig
MM Shoukri
MW Lorenz
PE Shrout
S Graf
Tomasz Zielinski
U Schminke
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Non-invasive 3-dimensional (3D) ultrasound (US) has emerged as the predominant approach for evaluating the progression of carotid atherosclerosis and its response to treatment. The aim of this study was to investigate the quality of a central reading procedure concerning plaque volume (PV), measured by 3D US in a multinational US trial. Methods Two data sets of 45 and 60 3D US patient images of plaques (mean PV, 71.8 and 39.8 μl, respectively) were used. PV was assessed by means of manual planimetry. The intraclass correlation coefficient (ICC) was applied to determine reader variabilities. The repeatability coefficient (RC) and the coefficient of variation (CV) were used to investigate the effect of number of slices (S) in manual planimetry and plaque size on measurement variability. Results Intra-reader variability was small as reflected by ICCs of 0.985, 0.967 and 0.969 for 3 appointed readers. The ICC value generated between the 3 readers was 0.964, indicating that inter-reader variability was small, too. Subgroup analyses showed that both intra- and inter-reader variabilities were lower for larger than for smaller plaques. Mean CVs were similar for the 5S- and 10S-methods with a RC of 4.7 μl. The RC between both methods as well as the CVs were comparatively lower for larger plaques. Conclusion By implementing standardised central 3D US reading protocols and strict quality control procedures highly reliable ultrasonic re-readings of plaque images can be achieved in large multicentre trials.</p

Testing for heterogeneity among the components of a binary composite outcome in a clinical trial

Author: A Donald
AJ Sankoh
AP Hallstrom
CG Park
D Follmann
DL DeMets
DW Hosmer
E Braunwald
I Ferreira-Gonzalez
Janice Pogue
JD Neaton
JNK Rao
KY Liang
LA Moye
LE Bjorling
Lehana Thabane
MB Leon
MF Huque
MM Shoukri
P McCullagh
P Tugwell
PC Austin
PJ Devereaux
RJ Hardy
RM Califf
S Bergman
S Hariharan
S Ross
Salim Yusuf
SM Davis
The Heart Outcomes Prevention Evaluation (HOPE) Study Investigators
V Berger
VM Montori
VM Montori
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Investigators designing clinical trials often use composite outcomes to overcome many statistical issues. Trialists want to maximize power to show a statistically significant treatment effect and avoid inflation of Type I error rate due to evaluation of multiple individual clinical outcomes. However, if the treatment effect is not similar among the components of this composite outcome, we are left not knowing how to interpret the treatment effect on the composite itself. Given significant heterogeneity among these components, a composite outcome may be judged as being invalid or un-interpretable for estimation of the treatment effect. This paper compares the power of different tests to detect heterogeneity of treatment effect across components of a composite binary outcome. Methods Simulations were done comparing four different models commonly used to analyze correlated binary data. These models included: logistic regression for ignoring correlation, logistic regression weighted by the intra cluster correlation coefficient, population average logistic regression using generalized estimating equations (GEE), and random effects logistic regression. Results We found that the population average model based on generalized estimating equations (GEE) had the greatest power across most scenarios. Adequate power to detect possible composite heterogeneity or variation between treatment effects of individual components of a composite outcome was seen when the power for detecting the main study treatment effect for the composite outcome was also reasonably high. Conclusions It is recommended that authors report tests of composite heterogeneity for composite outcomes and that this accompany the publication of the statistically significant results of the main effect on the composite along with individual components of composite outcomes.</p

Reproducibility and day time bias correction of optoelectronic leg volumetry: a prospective cohort study

Author: A Berard
A Rieck
AW Stanton
B Eklof
C Stick
C Stick
CG Fraser
Claudia Blazek
E Rabe
E Rabe
F Brijker
F Pannier
Felix Amsler
Frédéric Baumann
HN Mayrovitz
Hong H Keo
IO Man
Iris Baumgartner
J Fischbach
JM Bland
JM Bland
JM Bland
JW Ely
L Pellis
M Perrin
M Roustit
M Vayssairat
MJ Jonker
MM Shoukri
N Henschke
PA Sakkinen
Rolf P Engelberger
S Tierney
S Ziegler
Torsten Willenberg
U Müller-Bühl
W Blattler
Werner Blättler
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Background Leg edema is a common manifestation of various underlying pathologies. Reliable measurement tools are required to quantify edema and monitor therapeutic interventions. Aim of the present work was to investigate the reproducibility of optoelectronic leg volumetry over 3 weeks' time period and to eliminate daytime related within-individual variability. Methods Optoelectronic leg volumetry was performed in 63 hairdressers (mean age 45 ± 16 years, 85.7% female) in standing position twice within a minute for each leg and repeated after 3 weeks. Both lower leg (legBD) and whole limb (limbBF) volumetry were analysed. Reproducibility was expressed as analytical and within-individual coefficients of variance (CVA, CVW), and as intra-class correlation coefficients (ICC). Results A total of 492 leg volume measurements were analysed. Both legBD and limbBF volumetry were highly reproducible with CVA of 0.5% and 0.7%, respectively. Within-individual reproducibility of legBD and limbBF volumetry over a three weeks' period was high (CVW 1.3% for both; ICC 0.99 for both). At both visits, the second measurement revealed a significantly higher volume compared to the first measurement with a mean increase of 7.3 ml ± 14.1 (0.33% ± 0.58%) for legBD and 30.1 ml ± 48.5 ml (0.52% ± 0.79%) for limbBF volume. A significant linear correlation between absolute and relative leg volume differences and the difference of exact day time of measurement between the two study visits was found (P < .001). A therefore determined time-correction formula permitted further improvement of CVW. Conclusions Leg volume changes can be reliably assessed by optoelectronic leg volumetry at a single time point and over a 3 weeks' time period. However, volumetry results are biased by orthostatic and daytime-related volume changes. The bias for day-time related volume changes can be minimized by a time-correction formula

Bern Open Repository and Information System (BORIS)

The Development and Validation of the Thai-Translated Irrational Performance Beliefs Inventory (T-iPBI)

Author: A Ellis
A Ellis
A Szentagotai
A Vîslă
AG Wood
AS Zigmond
CP Chen
D Wild
DA Kenny
DAF Haaga
E Anthoine
G Bernal
G Hofstede
G Si
H Lindner
I Bjelland
JC Nunnally
KF Widaman
L Chang
LI Lega
LT Hu
M Dixon
M. J. Turner
MD Terjesen
MJ Turner
MJ Turner
MJ Turner
MJ Turner
MJ Turner
MJ Turner
MJ Turner
MJ Turner
MJ Turner
MJ Turner
MM Shoukri
MS Allen
NA Ndika
P Burgess
PM Bentler
R DiGiuseppe
RF DeVellis
S Deen
S Mat Roni
T Nilchaikovit
V. Chotpitayasunondh
W Dryden
W Dryden
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 15/06/2019
Field of study

© 2018, Springer Science+Business Media, LLC, part of Springer Nature. One of the most commonly employed cognitive-behavioural approaches to psychotherapy is rational-emotive behaviour therapy, but researchers have been troubled by some of the limitations of irrational beliefs psychometrics. As a result, Turner et al. (Eur J Psychol Assess 34:174–180, 2018a. https://doi.org/10.1027/1015-5759/a000314) developed the Irrational Performance Beliefs Inventory (iPBI), a novel measure of irrational beliefs for use within performance domains. However, the linguistic and cross-cultural adaptation of the iPBI into other languages is necessary for its multinational and multicultural use. The purpose of this paper is to develop the Thai-translated version of the iPBI (T-iPBI) and examine the validity and reliability of the T-iPBI. Data retrieved from 166 participants were analysed using SPSS and AMOS software packages. Thirty-three participants completed two follow-up T-iPBI measurements (1- and 3-week repeat assessment). After the linguistic and cross-cultural adaptation processes, the T-iPBI demonstrated excellent levels of reliability, with internal consistency and test–retest reliability, as well as construct, concurrent, and predictive validity. The current findings indicate that the 20-item T-iPBI can be used as a self-assessment instrument to evaluate individual’s irrational performance beliefs in a Thai population. We also highlight the implications of this study and suggest a variety of future research directions that stem from the results

E-space: Manchester Metropolitan University's Research Repository

STORE - Staffordshire Online Repository

Reliability of Therapist Effects in Practice-Based Psychotherapy Research : A Guide for the Planning of Future Studies

Author: AE Kazdin
Anne-Katharina Schiefele
BA Bell
BE Wampold
C Evans
CJM Maas
D Kim
D Saxon
David Saxon
DF Ricks
Dietmar Schulte
F Gao
I Elkin
I Elkin
I Elkin
J Hox
J Okiishi
J Owen
Jaime Delgadillo
Jan Böhnke
JC Gardiner
JC Okiishi
JD Huppert
JF Boswell
JS Lyons
Julian Rubel
K Howard
K Jong De
K Kroenke
KI Howard
L Manea
LE Beutler
LJ Adelson
LR Derogatis
LR Derogatis
M Moerbeek
Mark Kopta
Michael Barkham
Michael J. Lambert
MJ Lambert
MM Shoukri
P Burton
P Crits-Christoph
RJ Lueger
S Kopta
S Raudenbush
SA Baldwin
SC Musca
SG Hofmann
SL Garfield
SM Eldridge
Stevan L. Nielsen
U Dinger
W Lutz
W Lutz
W Lutz
Wolfgang Lutz
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 05/05/2016
Field of study

This paper aims to provide researchers with practical information on sample sizes for accurate estimations of therapist effects (TEs). The investigations are based on an integrated sample of 48,648 patients treated by 1800 therapists. Multilevel modeling and resampling were used to realize varying sample size conditions to generate empirical estimates of TEs. Sample size tables, including varying sample size conditions, were constructed and study examples given. This study gives an insight into the potential size of the TE and provides researchers with a practical guide to aid the planning of future studies in this field