Search CORE

3 research outputs found

Robust Statistical Methods for Empirical Software Engineering

Author: A Agresti
A Arcuri
A Vargha
AF Tappenden
Amnart Pohthong
Barbara Kitchenham
BL Welch
D Budgen
D Budgen
David Budgen
DE Stout
DM Erceg-Hurn
DW Zimmerman
DW Zimmerman
E Brunner
GEP Box
GS Mudholkar
HC Kraemer
J Demšar
Jacky Keung
JT Behrens
JW Cohen
JW Cohen
K Dejaeger
KK Yuen
L Acion
L Madeyski
L Madeyski
L Madeyski
Lech Madeyski
LK John
M El-Attar
M El-Attar
M Jureczko
MG Akritas
MG Akritas
MW Lipsey
N Cliff
NM Razali
P Shrout
PA Whigham
Pearl Brereton
PH Ramsey
R Bergmann
RB D’Agostino
RJ Grissom
RM Price
RR Wilcox
RR Wilcox
Shirley Gibbs
SL Braver
SS Shapiro
Stuart Charters
T Dybå
T Micceri
T Tian
TC Urdan
VB Kampenes
W Conover
W Viechtbauer
WR Shadish
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Lessons from Conducting a Distributed Quasi-Experiment

Author: Brereton Pearl
Budgen David
Charters Stuart
Gibbs Shirley
Keung Jacky
Kitchenham Barbara
Pohthong Amnart
Publication venue
Publication date: 12/12/2013
Field of study

Context: Due to the lack of suitably skilled participants, software engineering experiments often lack the statistical power needed to detect the levels of effect that may be encountered. Aim: To investigate whether this can be remedied by running an experiment across multiple sites, organised as a single study rather than as a set of replications. Method: We performed a `trial' of the idea using a topic (structured abstracts) that some of us had studied previously and which required no participant training. We used five sites, each with 16 participants. Results: We were able to demonstrate the benefits of increased statistical power (and of structured abstracts). We report on our experiences with designing and conducting the study and identify some key lessons about how future studies of this form might be organised. Conclusions: The distributed model offers a flexible, robust form that is capable of delivering better statistical power than would be achieved by running a set of parallel replicated studies

Durham Research Online

Crossref

Lincoln University Research Archive

Robust Statistical Methods for Empirical Software Engineering

Author: Brereton Pearl
Budgen David
Charters Stuart
Gibbs Shirley
Keung Jacky
Kitchenham Barbara
Madeyski Lech
Pohthong Amnart
Publication venue: Springer
Publication date: 16/06/2016
Field of study

There have been many changes in statistical theory in the past 30 years, including increased evidence that non-robust methods may fail to detect important results. The statistical advice available to software engineering researchers needs to be updated to address these issues. This paper aims both to explain the new results in the area of robust analysis methods and to provide a large-scale worked example of the new methods. We summarise the results of analyses of the Type 1 error efficiency and power of standard parametric and non-parametric statistical tests when applied to non-normal data sets. We identify parametric and non-parametric methods that are robust to non-normality. We present an analysis of a large-scale software engineering experiment to illustrate their use. We illustrate the use of kernel density plots, and parametric and non-parametric methods using four different software engineering data sets. We explain why the methods are necessary and the rationale for selecting a specific analysis. We suggest using kernel density plots rather than box plots to visualise data distributions. For parametric analysis, we recommend trimmed means, which can support reliable tests of the differences between the central location of two or more samples. When the distribution of the data differs among groups, or we have ordinal scale data, we recommend non-parametric methods such as Cliff’s δ or a robust rank-based ANOVA-like method

Durham Research Online

Keele Research Repository

Crossref

Springer - Publisher Connector

Lincoln University Research Archive