Location of Repository

FIT DATA SELECTION BASED ON PROJECT FEATURES FOR SOFTWARE EFFORT ESTIMATION MODELS

By Koji Toda, Akito Monden and Ken-ichi Matsumoto

Abstract

To construct a better multivariate regression model for software effort estimation, this paper proposes a method to automatically select projects as fit data (a dataset for model construction) from a given project data set based on an estimation target's features. As a result of an experimental evaluation using the ISBSG data set, which is one of the most commonly used project data sets for effort estimation studies, the proposed method showed better estimation performance than the conventional method (of constructing a model using all project data). The median of MRE (Magnitude of Relative Error) was improved from 0.552 to 0.383, and also the median of MER (Magnitude of Error Relative) was improved from 0.457 to 0.381. While regression models were often constructed using all available project data, this paper showed the necessity of fit data selection, and showed that the proposed method is one of the effective and systematic means of doing the selection

Topics: KEY WORDS
Year: 2011
OAI identifier: oai:CiteSeerX.psu:10.1.1.186.185
Provided by: CiteSeerX
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://citeseerx.ist.psu.edu/v... (external link)
  • Suggested articles


    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.