Search CORE

15 research outputs found

Discovering Loners and Phantoms in Commit and Issue Data

Author: Brandtner Martin
Gall Harald
Leitner Philipp
Panichella Sebastiano
Schermann Gerald
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 19/05/2015
Field of study

The interlinking of commit and issue data has become a de-facto standard in software development. Modern issue tracking systems, such as JIRA, automatically interlink commits and issues by the extraction of identifiers (e.g., issue key) from commit messages. However, the conventions for the use of interlinking methodologies vary between software projects. For example, some projects enforce the use of identifiers for every commit while others have less restrictive conventions. In this work, we introduce a model called PaLiMod to enable the analysis of interlinking characteristics in commit and issue data. We surveyed 15 Apache projects to investigate differences and commonalities between linked and non-linked commits and issues. Based on the gathered information, we created a set of heuristics to interlink the residual of non-linked commits and issues. We present the characteristics of Loners and Phantoms in commit and issue data. The results of our evaluation indicate that the proposed PaLiMod model and heuristics enable an automatic interlinking and can indeed reduce the residual of non-linked commits and issues in software projects

Crossref

ZORA

EALink: An Efficient and Accurate Pre-trained Framework for Issue-Commit Link Recovery

Author: Ji Rongrong
Li Hui
Wang Juhong
Wang Yanlin
Wei Zhao
Xu Yong
Zhang Chenyuan
Publication venue
Publication date: 21/08/2023
Field of study

Issue-commit links, as a type of software traceability links, play a vital role in various software development and maintenance tasks. However, they are typically deficient, as developers often forget or fail to create tags when making commits. Existing studies have deployed deep learning techniques, including pretrained models, to improve automatic issue-commit link recovery.Despite their promising performance, we argue that previous approaches have four main problems, hindering them from recovering links in large software projects. To overcome these problems, we propose an efficient and accurate pre-trained framework called EALink for issue-commit link recovery. EALink requires much fewer model parameters than existing pre-trained methods, bringing efficient training and recovery. Moreover, we design various techniques to improve the recovery accuracy of EALink. We construct a large-scale dataset and conduct extensive experiments to demonstrate the power of EALink. Results show that EALink outperforms the state-of-the-art methods by a large margin (15.23%-408.65%) on various evaluation metrics. Meanwhile, its training and inference overhead is orders of magnitude lower than existing methods.Comment: 13 pages, 6 figures, published to AS

arXiv.org e-Print Archive

An Empirical Study of Regression Bug Chains in Linux

Author: Jiang B
Sui Y
Xiao G
Zheng Z
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2020
Field of study

OPUS - University of Technology Sydney

Towards More Accurate Multi-Label Software Behavior Learning

Author: CHEN Zhenyu
LO David
WANG Xinyu
XIA Xin
YANG Feng
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/02/2014
Field of study

NSF

Institutional Knowledge at Singapore Management University

Characterizing and Predicting Blocking Bugs in Open Source Projects

Author: Nagappan Mei
Shihab Emad
Valdivia-Garcia Harold
Publication venue: 'Elsevier BV'
Publication date: 06/04/2018
Field of study

Software engineering researchers have studied specific types of issues such reopened bugs, performance bugs, dormant bugs, etc. However, one special type of severe bugs is blocking bugs. Blocking bugs are software bugs that prevent other bugs from being fixed. These bugs may increase maintenance costs, reduce overall quality and delay the release of the software systems. In this paper, we study blocking bugs in eight open source projects and propose a model to predict them early on. We extract 14 different factors (from the bug repositories) that are made available within 24 hours after the initial submission of the bug reports. Then, we build decision trees to predict whether a bug will be a blocking bugs or not. Our results show that our prediction models achieve F-measures of 21%-54%, which is a two-fold improvement over the baseline predictors. We also analyze the fixes of these blocking bugs to understand their negative impact. We find that fixing blocking bugs requires more lines of code to be touched compared to non-blocking bugs. In addition, our file-level analysis shows that files affected by blocking bugs are more negatively impacted in terms of cohesion, coupling complexity and size than files affected by non-blocking bugs

Crossref

Concordia University Research Repository

Automatic, high accuracy prediction of reopened bugs

Author: LO David
Shihab Emad
Wang Xinyu
Xia Xin
Zhou Bo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 18/09/2014
Field of study

Crossref

Institutional Knowledge at Singapore Management University

Automated bug report field reassignment and refinement prediction

Author: LO David
SHIHAB Emad
WANG Xinyu
XIA Xin
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/09/2016
Field of study

Date of Publication (online): 26 October 2015</p

Crossref

Institutional Knowledge at Singapore Management University