2,225 research outputs found
Statistical Analysis for Revealing Defects in Software Projects
Dissertation presented as the partial requirement for obtaining a Master's degree in Information Management, specialization in Information Systems and Technologies ManagementDefect detection in software is the procedure to identify parts of software that may comprise
defects. Software companies always seek to improve the performance of software projects in terms
of quality and efficiency. They also seek to deliver the soft-ware projects without any defects to the
communities and just in time. The early revelation of defects in software projects is also tried to
avoid failure of those projects, save costs, team effort, and time. Therefore, these companies need to
build an intelligent model capable of detecting software defects accurately and efficiently.
This study seeks to achieve two main objectives. The first goal is to build a statistical model to
identify the critical defect factors that influence software projects. The second objective is to build a
statistical model to reveal defects early in software pro-jects as reasonable accurately. A bibliometric
map (VOSviewer) was used to find the relationships between the common terms in those domains.
The results of this study are divided into three parts:
In the first part The term "software engineering" is connected to "cluster," "regression," and "neural
network." Moreover, the terms "random forest" and "feature selection" are connected to "neural
network," "recall," and "software engineering," "cluster," "regression," and "fault prediction model"
and "software defect prediction" and "defect density."
In the second part We have checked and analyzed 29 manuscripts in detail, summarized their major
contributions, and identified a few research gaps.
In the third part Finally, software companies try to find the critical factors that affect the detection of
software defects and find any of the intelligent or statistical methods that help to build a model
capable of detecting those defects with high accuracy.
Two statistical models (Multiple linear regression (MLR) and logistic regression (LR)) were used to
find the critical factors and through them to detect software defects accurately. MLR is executed by
using two methods which are critical defect factors (CDF) and premier list of software defect factors
(PLSDF). The accuracy of MLR-CDF and MLR-PLSDF is 82.3 and 79.9 respectively. The standard error
of MLR-CDF and MLR-PLSDF is 26% and 28% respectively. In addition, LR is executed by using two
methods which are CDF and PLSDF. The accuracy of LR-CDF and LR-PLSDF is 86.4 and 83.8
respectively. The standard error of LR-CDF and LR-PLSDF is 22% and 25% respectively. Therefore, LRCDF
outperforms on all the proposed models and state-of-the-art methods in terms of accuracy and
standard error
Software quality and reliability prediction using Dempster -Shafer theory
As software systems are increasingly deployed in mission critical applications, accurate quality and reliability predictions are becoming a necessity. Most accurate prediction models require extensive testing effort, implying increased cost and slowing down the development life cycle. We developed two novel statistical models based on Dempster-Shafer theory, which provide accurate predictions from relatively small data sets of direct and indirect software reliability and quality predictors. The models are flexible enough to incorporate information generated throughout the development life-cycle to improve the prediction accuracy.;Our first contribution is an original algorithm for building Dempster-Shafer Belief Networks using prediction logic. This model has been applied to software quality prediction. We demonstrated that the prediction accuracy of Dempster-Shafer Belief Networks is higher than that achieved by logistic regression, discriminant analysis, random forests, as well as the algorithms in two machine learning software packages, See5 and WEKA. The difference in the performance of the Dempster-Shafer Belief Networks over the other methods is statistically significant.;Our second contribution is also based on a practical extension of Dempster-Shafer theory. The major limitation of the Dempsters rule and other known rules of evidence combination is the inability to handle information coming from correlated sources. Motivated by inherently high correlations between early life-cycle predictors of software reliability, we extended Murphy\u27s rule of combination to account for these correlations. When used as a part of the methodology that fuses various software reliability prediction systems, this rule provided more accurate predictions than previously reported methods. In addition, we proposed an algorithm, which defines the upper and lower bounds of the belief function of the combination results. To demonstrate its generality, we successfully applied it in the design of the Online Safety Monitor, which fuses multiple correlated time varying estimations of convergence of neural network learning in an intelligent flight control system
Computer Vision System for Non-Destructive and Contactless Evaluation of Quality Traits in Fresh Rocket Leaves (Diplotaxis Tenuifolia L.)
La tesi di dottorato è incentrata sull'analisi di tecnologie non distruttive per il controllo della
qualità dei prodotti agroalimentari, lungo l'intera filiera agroalimentare. In particolare, la tesi
riguarda l'applicazione del sistema di visione artificiale per valutare la qualità delle foglie di
rucola fresh-cut. La tesi è strutturata in tre parti (introduzione, applicazioni sperimentali e
conclusioni) e in cinque capitoli, rispettivamente il primo e il secondo incentrati sulle
tecnologie non distruttive e in particolare sui sistemi di computer vision per il monitoraggio
della qualità dei prodotti agroalimentari. Il terzo, quarto e quinto capitolo mirano a valutare le
foglie di rucola sulla base della stima di parametri qualitativi, considerando diversi aspetti: (i)
la variabilità dovuta alle diverse pratiche agricole, (ii) la senescenza dei prodotti confezionati
e non, e (iii) lo sviluppo e sfruttamento dei vantaggi di nuovi modelli più semplici rispetto al
machine learning utilizzato negli esperimenti precedenti. Il lavoro di ricerca di questa tesi di
dottorato è stato svolto dall'Università di Foggia, dall'Istituto di Scienze delle Produzioni
Alimentari (ISPA) e dall'Istituto di Tecnologie e Sistemi Industriali Intelligenti per le
Manifatture Avanzate (STIIMA) del Consiglio Nazionale delle Ricerche (CNR). L’attività di
ricerca è stata condotta nell'ambito del Progetto SUS&LOW (Sustaining Low-impact Practices
in Horticulture through Non-destructive Approach to Provide More Information on Fresh
Produce History & Quality), finanziato dal MUR-PRIN 2017, e volto a sostenere la qualitÃ
della produzione e dell'ambiente utilizzando pratiche agricole a basso input e la valutazione
non distruttiva della qualità di prodotti ortofrutticoli.The doctoral thesis focused on the analysis of non-destructive technologies available for the
control quality of agri-food products, along the whole supply chain. In particular, the thesis
concerns the application of computer vision system to evaluate the quality of fresh rocket
leaves. The thesis is structured in three parts (introduction, experimental applications and
conclusions) and in 5 chapters, the first and second focused on non-destructive technologies
and in particular on computer vision systems for monitoring the quality of agri-food products,
respectively. The third, quarter, and fifth chapters aim to assess the rocket leaves based on the
estimation of quality aspects, considering different aspects: (i) the variability due to the
different agricultural practices, (ii) the senescence of packed and unpacked products, and (iii)
development and exploitation of the advantages of new models simpler than the machine
learning used in the previous experiments. The research work of this doctoral thesis was carried
out by the University of Foggia, the Institute of Science of Food Production (ISPA) and the
Institute of Intelligent Industrial Technologies and Systems for Advanced Manufacturing
(STIIMA) of National Research Council (CNR). It was conducted within the Project
SUS&LOW (Sustaining Low-impact Practices in Horticulture through Non-destructive
Approach to Provide More Information on Fresh Produce History & Quality), funded by MUR-
PRIN 2017, and aimed at sustaining quality of production and of the environment using low
input agricultural practices and non-destructive quality evaluation
Software Defect Prediction Using AWEIG+ADACOST Bayesian Algorithm for Handling High Dimensional Data and Class Imbalance Problem
The most important part in software engineering is a software defect prediction. Software defect prediction is defined as a software prediction process from errors, failures, and system errors. Machine learning methods are used by researchers to predict software defects including estimation, association, classification, clustering, and datasets analysis. Datasets of NASA Metrics Data Program (NASA MDP) is one of the metric software that researchers use to predict software defects. NASA MDP datasets contain unbalanced classes and high dimensional data, so they will affect the classification evaluation results to be low. In this research, data with unbalanced classes will be solved by the AdaCost method and high dimensional data will be handled with the Average Weight Information Gain (AWEIG) method, while the classification method that will be used is the Naïve Bayes algorithm. The proposed method is named AWEIG + AdaCost Bayesian. In this experiment, the AWEIG + AdaCost Bayesian algorithm is compared to the Naïve Bayesian algorithm. The results showed the mean of Area Under the Curve (AUC) algorithm AWEIG + AdaCost Bayesian yields better than just a Naïve Bayes algorithm with respectively mean of AUC values are 0.752 and 0.696
Applied Mathematics and Computational Physics
As faster and more efficient numerical algorithms become available, the understanding of the physics and the mathematical foundation behind these new methods will play an increasingly important role. This Special Issue provides a platform for researchers from both academia and industry to present their novel computational methods that have engineering and physics applications
- …