Search CORE

321 research outputs found

What is the Connection Between Issues, Bugs, and Enhancements? (Lessons Learned from 800+ Software Projects)

Author: Chatterjee S
Chong Hok Yuen CKS
Dickey David A
Hurst Harold Edwin
Junhong Guo
Neil Martin
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 05/09/2018
Field of study

Agile teams juggle multiple tasks so professionals are often assigned to multiple projects, especially in service organizations that monitor and maintain a large suite of software for a large user base. If we could predict changes in project conditions changes, then managers could better adjust the staff allocated to those projects.This paper builds such a predictor using data from 832 open source and proprietary applications. Using a time series analysis of the last 4 months of issues, we can forecast how many bug reports and enhancement requests will be generated next month. The forecasts made in this way only require a frequency count of this issue reports (and do not require an historical record of bugs found in the project). That is, this kind of predictive model is very easy to deploy within a project. We hence strongly recommend this method for forecasting future issues, enhancements, and bugs in a project.Comment: Accepted to 2018 International Conference on Software Engineering, at the software engineering in practice track. 10 pages, 10 figure

arXiv.org e-Print Archive

Crossref

NICHE: A Curated Dataset of Engineered Machine Learning Projects in Python

Author: Lo David
Lok Camellia
Phan Jack
Qi Haodi
Sim Sheng Qin
Tan Constance
Tay Qijin
Thung Ferdian
Wee Fiona
Widyasari Ratnadira
Yang Zhou
Publication venue
Publication date: 27/01/2023
Field of study

Machine learning (ML) has gained much attention and been incorporated into our daily lives. While there are numerous publicly available ML projects on open source platforms such as GitHub, there have been limited attempts in filtering those projects to curate ML projects of high quality. The limited availability of such a high-quality dataset poses an obstacle in understanding ML projects. To help clear this obstacle, we present NICHE, a manually labelled dataset consisting of 572 ML projects. Based on evidences of good software engineering practices, we label 441 of these projects as engineered and 131 as non-engineered. This dataset can help researchers understand the practices that are followed in high-quality ML projects. It can also be used as a benchmark for classifiers designed to identify engineered ML projects.Comment: Accepted by MSR 202

arXiv.org e-Print Archive

Institutional Knowledge at Singapore Management University

FigShare

We Don't Need Another Hero? The Impact of "Heroes" on Software Development

Author: Bier Norman
Cabot Jordi
Cullom Charmayne
Ghotra Baljinder
Goeminne Mathieu
Kocaguneli E.
Morcov Stefan
Pinto Gustavo
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 20/02/2018
Field of study

A software project has "Hero Developers" when 80% of contributions are delivered by 20% of the developers. Are such heroes a good idea? Are too many heroes bad for software quality? Is it better to have more/less heroes for different kinds of projects? To answer these questions, we studied 661 open source projects from Public open source software (OSS) Github and 171 projects from an Enterprise Github. We find that hero projects are very common. In fact, as projects grow in size, nearly all project become hero projects. These findings motivated us to look more closely at the effects of heroes on software development. Analysis shows that the frequency to close issues and bugs are not significantly affected by the presence of project type (Public or Enterprise). Similarly, the time needed to resolve an issue/bug/enhancement is not affected by heroes or project type. This is a surprising result since, before looking at the data, we expected that increasing heroes on a project will slow down howfast that project reacts to change. However, we do find a statistically significant association between heroes, project types, and enhancement resolution rates. Heroes do not affect enhancement resolution rates in Public projects. However, in Enterprise projects, the more heroes increase the rate at which project complete enhancements. In summary, our empirical results call for a revision of a long-held truism in software engineering. Software heroes are far more common and valuable than suggested by the literature, particularly for medium to large Enterprise developments. Organizations should reflect on better ways to find and retain more of these heroesComment: 8 pages + 1 references, Accepted to International conference on Software Engineering - Software Engineering in Practice, 201

arXiv.org e-Print Archive

Crossref