Search CORE

3,311 research outputs found

Software defect prediction using maximal information coefficient and fast correlation-based filter feature selection

Author: Mpofu Bongeka
Publication venue
Publication date: 01/12/2018
Field of study

Software quality ensures that applications that are developed are failure free. Some modern systems are intricate, due to the complexity of their information processes. Software fault prediction is an important quality assurance activity, since it is a mechanism that correctly predicts the defect proneness of modules and classifies modules that saves resources, time and developers’ efforts. In this study, a model that selects relevant features that can be used in defect prediction was proposed. The literature was reviewed and it revealed that process metrics are better predictors of defects in version systems and are based on historic source code over time. These metrics are extracted from the source-code module and include, for example, the number of additions and deletions from the source code, the number of distinct committers and the number of modified lines. In this research, defect prediction was conducted using open source software (OSS) of software product line(s) (SPL), hence process metrics were chosen. Data sets that are used in defect prediction may contain non-significant and redundant attributes that may affect the accuracy of machine-learning algorithms. In order to improve the prediction accuracy of classification models, features that are significant in the defect prediction process are utilised. In machine learning, feature selection techniques are applied in the identification of the relevant data. Feature selection is a pre-processing step that helps to reduce the dimensionality of data in machine learning. Feature selection techniques include information theoretic methods that are based on the entropy concept. This study experimented the efficiency of the feature selection techniques. It was realised that software defect prediction using significant attributes improves the prediction accuracy. A novel MICFastCR model, which is based on the Maximal Information Coefficient (MIC) was developed to select significant attributes and Fast Correlation Based Filter (FCBF) to eliminate redundant attributes. Machine learning algorithms were then run to predict software defects. The MICFastCR achieved the highest prediction accuracy as reported by various performance measures.School of ComputingPh. D. (Computer Science

Unisa Institutional Repository

Adaptive Genetic Algorithm Based Artificial Neural Network for Software Defect Prediction

Author: Prof. Bachala Sathyanarayana
Racharla Suresh Kumar
Publication venue: Global Journals Inc. (US)
Publication date: 04/11/2015
Field of study

To meet the requirement of an efficient software defect prediction,in this paper an evolutionary computing based neural network learning scheme has been developed that alleviates the existing Artificial Neural Network (ANN) limitations such as local minima and convergence issues. To achieve optimal software defect prediction, in this paper, Adaptive-Genetic Algorithm (A-GA) based ANN learning and weightestimation scheme has been developed. Unlike conventional GA, in this paper we have used adaptive crossover and mutation probability parameter that alleviates the issue of disruption towards optimal solution. We have used object oriented software metrics, CK metrics for fault prediction and the proposed Evolutionary Computing Based Hybrid Neural Network (HENN)algorithm has been examined for performance in terms of accuracy, precision, recall, F-measure, completeness etc, where it has performed better as compared to major existing schemes. The proposed scheme exhibited 97.99% prediction accuracy while ensuring optimal precision, Fmeasure and recall

DBBRBF- Convalesce optimization for software defect prediction problem using hybrid distribution base balance instance selection and radial basis Function classifier

Author: Panda Mrutyunjaya
Publication venue
Publication date: 08/06/2018
Field of study

Software is becoming an indigenous part of human life with the rapid development of software engineering, demands the software to be most reliable. The reliability check can be done by efficient software testing methods using historical software prediction data for development of a quality software system. Machine Learning plays a vital role in optimizing the prediction of defect-prone modules in real life software for its effectiveness. The software defect prediction data has class imbalance problem with a low ratio of defective class to non-defective class, urges an efficient machine learning classification technique which otherwise degrades the performance of the classification. To alleviate this problem, this paper introduces a novel hybrid instance-based classification by combining distribution base balance based instance selection and radial basis function neural network classifier model (DBBRBF) to obtain the best prediction in comparison to the existing research. Class imbalanced data sets of NASA, Promise and Softlab were used for the experimental analysis. The experimental results in terms of Accuracy, F-measure, AUC, Recall, Precision, and Balance show the effectiveness of the proposed approach. Finally, Statistical significance tests are carried out to understand the suitability of the proposed model.Comment: 32 pages, 24 Tables, 8 Figures

arXiv.org e-Print Archive

Applying Soft Computing Approaches to Predict Defect Density in Software Product Releases: An Empirical Study

Author: Kumar Rajesh
Kumar Vijay
Sharma Arun
Publication venue: Institute of Informatics, Slovak Academy of Sciences
Publication date: 22/03/2013
Field of study

There is non-linear relationship between software metrics and defects, which results to a complex mapping. Therefore, to focus on the defect density area, it is a critical business requirement of effective and practical approach, which can help find the defect density in software releases. Soft computing provides a better platform to solve the non-linear and complex mapping problem. The aim of this paper is to formulate, build, evaluate, validate and compare two main sections of soft computing, fuzzy logic and artificial neural network approaches in prediction of defect density of subsequent software product releases. In this research, these two approaches are formulated and applied to predict the existence of a defect in file of software release. Both approaches have also been validated against various releases of two commercial software product release data sets. The validation criteria include mean absolute error, root mean square error and graphical analysis. The analysis of the study shows that artificial neural network provides better results compared to Fuzzy Inference System; but applicability of best approach depends on the data availability and the quantum of data

AI Solutions for MDS: Artificial Intelligence Techniques for Misuse Detection and Localisation in Telecommunication Environments

Author: Anyakoha Chukwudi
Bauerdick H.
Gottfried B.
Mintram Robert
Muthuraman S.
Phalp Keith T.
Vincent Jonathan
Publication venue: 'Indiana University Press (Project Muse)'
Publication date: 15/07/2006
Field of study

This report considers the application of Articial Intelligence (AI) techniques to the problem of misuse detection and misuse localisation within telecommunications environments. A broad survey of techniques is provided, that covers inter alia rule based systems, model-based systems, case based reasoning, pattern matching, clustering and feature extraction, articial neural networks, genetic algorithms, arti cial immune systems, agent based systems, data mining and a variety of hybrid approaches. The report then considers the central issue of event correlation, that is at the heart of many misuse detection and localisation systems. The notion of being able to infer misuse by the correlation of individual temporally distributed events within a multiple data stream environment is explored, and a range of techniques, covering model based approaches, `programmed' AI and machine learning paradigms. It is found that, in general, correlation is best achieved via rule based approaches, but that these suffer from a number of drawbacks, such as the difculty of developing and maintaining an appropriate knowledge base, and the lack of ability to generalise from known misuses to new unseen misuses. Two distinct approaches are evident. One attempts to encode knowledge of known misuses, typically within rules, and use this to screen events. This approach cannot generally detect misuses for which it has not been programmed, i.e. it is prone to issuing false negatives. The other attempts to `learn' the features of event patterns that constitute normal behaviour, and, by observing patterns that do not match expected behaviour, detect when a misuse has occurred. This approach is prone to issuing false positives, i.e. inferring misuse from innocent patterns of behaviour that the system was not trained to recognise. Contemporary approaches are seen to favour hybridisation, often combining detection or localisation mechanisms for both abnormal and normal behaviour, the former to capture known cases of misuse, the latter to capture unknown cases. In some systems, these mechanisms even work together to update each other to increase detection rates and lower false positive rates. It is concluded that hybridisation offers the most promising future direction, but that a rule or state based component is likely to remain, being the most natural approach to the correlation of complex events. The challenge, then, is to mitigate the weaknesses of canonical programmed systems such that learning, generalisation and adaptation are more readily facilitated

Bournemouth University Research Online

Software Defect Prediction Based on Classication Rule Mining

Author: Sahana Dulal Chandra
Publication venue
Publication date: 01/01/2013
Field of study

There has been rapid growth of software development. Due to various causes, the software comes with many defects. In Software development process, testing of software is the main phase which reduces the defects of the software. If a developer or a tester can predict the software defects properly then, it reduces the cost, time and eort. In this paper, we show a comparative analysis of software defect prediction based on classifcation rule mining. We propose a scheme for this process and we choose different classication algorithms. Showing the comparison of predictions in software defects analysis. This evaluation analyzes the prediction performance of competing learning schemes for given historical data sets(NASA MDP Data Set). The result of this scheme evaluation shows that we have to choose different classifer rule for different data set

ethesis@nitr

Evolutionary Computing based an Efficient and Cost Effective Software Defect Prediction System

Author: Prof. Bachala Sathyanarayana
Racharla Suresh Kumar
Publication venue: Global Journals Inc. (US)
Publication date: 15/05/2015
Field of study

The earlier defect prediction and fault removal can play a vital role in ensuring software reliability and quality of service In this paper Hybrid Evolutionary computing based Neural Network HENN based software defect prediction model has been developed For HENN an adaptive genetic algorithm A-GA has been developed that alleviates the key existing limitations like local minima and convergence Furthermore the implementation of A-GA enables adaptive crossover and mutation probability selection that strengthens computational efficiency of our proposed system The proposed HENN algorithm has been used for adaptive weight estimation and learning optimization in ANN for defect prediction In addition a novel defect prediction and fault removal cost estimation model has been derived to evaluate the cost effectiveness of the proposed system The simulation results obtained for PROMISE and NASA MDP datasets exhibit the proposed model outperforms Levenberg Marquardt based ANN system LM-ANN and other systems as well And also cost analysis exhibits that the proposed HENN model is approximate 21 66 cost effective as compared to LM-AN