Search CORE

3,904 research outputs found

Exploratory Analysis of Multivariate Data (Unsupervised Image Segmentation and Data Driven Linear and Nonlinear Decomposition)

Author: Hilger Klaus Baggesen
Publication venue
Publication date: 01/03/2002
Field of study

Choosing software metrics for defect prediction: an investigation on feature selection techniques

Author: Aha
Aha
Berenson
Cameron
Chen
Coppin
Dash
Domingos
Fawcett
Fogarty
Forman
Furlanello
Guyon
Hall
Harman
Haykin
Hudepohl
Khoshgoftaar
Khoshgoftaar
Khoshgoftaar
Le Cessie
Lessmann
Liu
Menzies
Pfleeger
Platt
Shawe-Taylor
Tan
Votta
Witten
Wohlin
Publication venue: 'Wiley'
Publication date: 25/04/2011
Field of study

The selection of software metrics for building software quality prediction models is a search-based software engineering problem. An exhaustive search for such metrics is usually not feasible due to limited project resources, especially if the number of available metrics is large. Defect prediction models are necessary in aiding project managers for better utilizing valuable project resources for software quality improvement. The efficacy and usefulness of a fault-proneness prediction model is only as good as the quality of the software measurement data. This study focuses on the problem of attribute selection in the context of software quality estimation. A comparative investigation is presented for evaluating our proposed hybrid attribute selection approach, in which feature ranking is first used to reduce the search space, followed by a feature subset selection. A total of seven different feature ranking techniques are evaluated, while four different feature subset selection approaches are considered. The models are trained using five commonly used classification algorithms. The case study is based on software metrics and defect data collected from multiple releases of a large real-world software system. The results demonstrate that while some feature ranking techniques performed similarly, the automatic hybrid search algorithm performed the best among the feature subset selection methods. Moreover, performances of the defect prediction models either improved or remained unchanged when over 85were eliminated. Copyright © 2011 John Wiley & Sons, Ltd.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/83475/1/1043_ftp.pd

Crossref

Deep Blue Documents at the University of Michigan

Unconformity-type uranium systems: a comparative review and predictive modelling of critical genetic factors

Author: Bierlein Frank
Bruce Matt
Buckingham Amanda
Butera Kristin
Kreuzer Oliver
Wilde Andy
Publication venue: 'MDPI AG'
Publication date: 01/01/2020
Field of study

A review of descriptive and genetic models is presented for unconformity-type uranium deposits with particular attention given to spatial representations of key process components of the mineralising system and their mappable expressions. This information formed the basis for the construction of mineral potential models for the world's premier unconformity-style uranium provinces, the Athabasca Basin in Saskatchewan, Canada (>650,000 t U3 O8), and the NW McArthur Basin in the Northern Territory, Australia (>450,000 t U3 O8). A novel set of ‘edge’ detection routines was used to identify high-contrast zones in gridded geophysical data in support of the mineral potential modelling. This approach to geophysical data processing and interpretation offers a virtually unbiased means of detecting potential basement structures under cover and at a range of scales. Fuzzy logic mineral potential mapping was demonstrated to be a useful tool for delineating areas that have high potential for hosting economic uranium concentrations, utilising all knowledge and incorporating all relevant spatial data available for the project area. The resulting models not only effectively ‘rediscover’ the known uranium mineralisation but also highlight several other areas containing all of the mappable components deemed critical for the accumulation of economic uranium deposits. The intelligence amplification approach to mineral potential modelling presented herein is an example of augmenting expert-driven conceptual targeting with the powerful logic and rationality of modern computing. The result is a targeting tool that captures the current status quo of geospatial and exploration information and conceptual knowledge pertaining to unconformity-type uranium systems. Importantly, the tool can be readily updated once new information or knowledge comes to hand. As with every targeting tool, these models should not be utilised in isolation, but as one of several inputs informing exploration decision-making. Nor should they be regarded as ‘treasure maps’, but rather as pointers towards areas of high potential that are worthy of further investigation

Multidisciplinary Digital Publishing Institute

ResearchOnline at James Cook University

complexFuzzy: A novel clustering method for selecting training instances of cross-project defect prediction

Author: Ozturk Muhammed Maruf
Publication venue: 'AGHU University of Science and Technology Press'
Publication date: 01/01/2021
Field of study

Over the last decade, researchers have investigated to what extent cross-project defect prediction (CPDP) shows advantages over traditional defect prediction settings. These works do not take training and testing data of defect prediction from the same project. Instead, dissimilar projects are employed. Selecting proper training data plays an important role in terms of the success of CPDP. In this study, a novel clustering method named complexFuzzy is presented for selecting training data of CPDP. The method is developed by determining membership values with the help of some metrics which can be considered as indicators of complexity. First, CPDP combinations are created on 29 different data sets. Subsequently, complexFuzzy is evaluated by considering cluster centers of data sets and comparing some performance measures including area under the curve (AUC) and F-measure. The method is superior to other five comparison algorithms in terms of the distance of cluster centers and prediction performance

AGH (Akademia Górniczo-Hutnicza) University of Science and Technology: Journals

Computer Science Journal (AGH University of Science and Technology, Krakow)

Biblioteka Nauki - repozytorium artykuÅÃ³w

Enhancing Software Project Outcomes: Using Machine Learning and Open Source Data to Employ Software Project Performance Determinants

Author: Jagtiani Lalit N.
Publication venue
Publication date: 22/03/2019
Field of study

Many factors can influence the ongoing management and execution of technology projects. Some of these elements are known a priori during the project planning phase. Others require real-time data gathering and analysis throughout the lifetime of a project. These real-time project data elements are often neglected, misclassified, or otherwise misinterpreted during the project execution phase resulting in increased risk of delays, quality issues, and missed business opportunities. The overarching motivation for this research endeavor is to offer reliable improvements in software technology management and delivery. The primary purpose is to discover and analyze the impact, role, and level of influence of various project related data on the ongoing management of technology projects. The study leverages open source data regarding software performance attributes. The goal is to temper the subjectivity currently used by project managers (PMs) with quantifiable measures when assessing project execution progress. Modern-day PMs who manage software development projects are charged with an arduous task. Often, they obtain their inputs from technical leads who tend to be significantly more technical. When assessing software projects, PMs perform their role subject to the limitations of their capabilities and competencies. PMs are required to contend with the stresses of the business environment, the policies, and procedures dictated by their organizations, and resource constraints. The second purpose of this research study is to propose methods by which conventional project assessment processes can be enhanced using quantitative methods that utilize real-time project execution data. Transferability of academic research to industry application is specifically addressed vis-à-vis a delivery framework to provide meaningful data to industry practitioners

UB ScholarWorks

Complexity-based classification of software modules

Author: Wang Jian Han
Publication venue
Publication date: 01/01/2008
Field of study

Software plays a major role in many organizations. Organizational success depends partially on the quality of software used. In recent years, many researchers have recognized that statistical classification techniques are well-suited to develop software quality prediction models. Different statistical software quality models, using complexity metrics as early indicators of software quality, have been proposed in the past. At a high-level the problem of software categorization is to classify software modules into fault prone and non-fault prone. The focus of this thesis is two-fold. One is to study some selected classification techniques including unsupervised and supervised learning algorithms widely used for software categorization. The second emphasis is to explore a new unsupervised learning model, employing Bayesian and deterministic approaches. Besides, we evaluate and compare experimentally these approaches using a real data set. Our experimental results show that different algorithms lead to different statistically significant results

Concordia University Research Repository

A Review of Metrics and Modeling Techniques in Software Fault Prediction Model Development

Author: Chandra Pravin
Goyal Rinkaj
Singh Yogesh
Publication venue: Covenant University, Ota, Nigeria
Publication date: 18/07/2016
Field of study

This paper surveys different software fault predictions progressed through different data analytic techniques reported in the software engineering literature. This study split in three broad areas; (a) The description of software metrics suites reported and validated in the literature. (b) A brief outline of previous research published in the development of software fault prediction model based on various analytic techniques. This utilizes the taxonomy of analytic techniques while summarizing published research. (c) A review of the advantages of using the combination of metrics. Though, this area is comparatively new and needs more research efforts

Covenant Journals (Covenant University)