An Empirical Study on Dependence Clusters for Effort-Aware Fault-Proneness Prediction

Binkley, D; Harman, M; Islam, S; Krinke, J; Xu, B; Yang, Y; Zhou, Y

research

An Empirical Study on Dependence Clusters for Effort-Aware Fault-Proneness Prediction

Authors: D Binkley
M Harman
S Islam
J Krinke
B Xu
Y Yang
Y Zhou
Publication date: 25 August 2016
Publisher: 31st IEEE/ACM International Conference on Automated Software Engineering (ASE 2016)
Doi

Abstract

A dependence cluster is a set of mutually inter-dependent program elements. Prior studies have found that large dependence clusters are prevalent in software systems. It has been suggested that dependence clusters have potentially harmful effects on software quality. However, little empirical evidence has been provided to support this claim. The study presented in this paper investigates the relationship between dependence clusters and software quality at the function-level with a focus on effort-aware fault-proneness prediction. The investigation first analyzes whether or not larger dependence clusters tend to be more fault-prone. Second, it investigates whether the proportion of faulty functions inside dependence clusters is significantly different from the proportion of faulty functions outside dependence clusters. Third, it examines whether or not functions inside dependence clusters playing a more important role than others are more fault-prone. Finally, based on two groups of functions (i.e., functions inside and outside dependence clusters), the investigation considers a segmented fault-proneness prediction model. Our experimental results, based on five well-known open-source systems, show that (1) larger dependence clusters tend to be more fault-prone; (2) the proportion of faulty functions inside dependence clusters is significantly larger than the proportion of faulty functions outside dependence clusters; (3) functions inside dependence clusters that play more important roles are more fault-prone; (4) our segmented prediction model can significantly improve the effectiveness of effort-aware fault-proneness prediction in both ranking and classification scenarios. These findings help us better understand how dependence clusters influence software quality

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Crossref

info:doi/10.1145%2F2970276.297...

Last time updated on 05/06/2019

UCL Discovery

oai:eprints.ucl.ac.uk.OAI2:150...

Last time updated on 10/03/2017