3 research outputs found

    A Particle Swarm Optimized Learning Model of Fault Classification in Web-Apps

    Get PDF
    The term web-app defines the current dynamic pragmatics of the website, where the user has control. Finding faults in such dynamic content is challenging, as to whether the fault is exposed or not depends on its execution path. Moreover, the complexity and uniqueness of each web application make fault assessment an extremely laborious and expensive task. Also, artificial fault injection models are run in controlled and simulated environments, which may not be representative of the real-world fault data. Classifying faults can intelligently enhance the quality of the web-apps by the assessment of probable faults. In this paper, an empirical study is conducted to classify faults in bug reports of three open-source web-apps ( qaManager , bitWeaver , and WebCalendar ) and reviews of two play store web-apps ( Dineout: Reserve a Table and Wynk Music ). Five supervised learning algorithms (naive Bayesian, decision tree, support vector machines, KK -nearest neighbor, and multi-layer perceptron) have been first evaluated based on the conventional term frequency–inverse document frequency (tf-idf) feature extraction method, and subsequently, a feature selection method to improve classifier performance is proposed using particle swarm optimization (a nature-inspired, meta-heuristic algorithm). This paper is a preliminary exploratory study to build an automated tool, which can optimally categorize faults. The empirical analysis validates that the particle swarm optimization for feature selection in fault classification task outperforms the tf-idf filter-based classifiers with an average accuracy gain of about 11% and nearly 26% average feature reduction. The highest accuracy of 93.35% is shown by the decision tree after feature selection

    Feature Selection for Document Classification : Case Study of Meta-heuristic Intelligence and Traditional Approaches

    Get PDF
    Doctor of Philosophy (Computer Engineering), 2020Nowadays, the culture for accessing news around the world is changed from paper to electronic format and the rate of publication for newspapers and magazines on website are increased dramatically. Meanwhile, text feature selection for the automatic document classification (ADC) is becoming a big challenge because of the unstructured nature of text feature, which is called “multi-dimension feature problem”. On the other hand, various powerful schemes dealing with text feature selection are being developed continuously nowadays, but there still exists a research gap for “optimization of feature selection problem (OFSP)”, which can be looked for the global optimal features. Meanwhile, the capacity of meta-heuristic intelligence for knowledge discovery process (KDP) is also become the critical role to overcome NP-hard problem of OFSP by providing effective performance and efficient computation time. Therefore, the idea of meta-heuristic based approach for optimization of feature selection is proposed in this research to search the global optimal features for ADC. In this thesis, case study of meta-heuristic intelligence and traditional approaches for feature selection optimization process in document classification is observed. It includes eleven meta-heuristic algorithms such as Ant Colony search, Artificial Bee Colony search, Bat search, Cuckoo search, Evolutionary search, Elephant search, Firefly search, Flower search, Genetic search, Rhinoceros search, and Wolf search, for searching the optimal feature subset for document classification. Then, the results of proposed model are compared with three traditional search algorithms like Best First search (BFS), Greedy Stepwise (GS), and Ranker search (RS). In addition, the framework of data mining is applied. It involves data preprocessing, feature engineering, building learning model and evaluating the performance of proposed meta-heuristic intelligence-based feature selection using various performance and computation complexity evaluation schemes. In data processing, tokenization, stop-words handling, stemming and lemmatizing, and normalization are applied. In feature engineering process, n-gram TF-IDF feature extraction is used for implementing feature vector and both filter and wrapper approach are applied for observing different cases. In addition, three different classifiers like J48, Naïve Bayes, and Support Vector Machine, are used for building the document classification model. According to the results, the proposed system can reduce the number of selected features dramatically that can deteriorate learning model performance. In addition, the selected global subset features can yield better performance than traditional search according to single objective function of proposed model

    A Particle Swarm Optimized Learning Model of Fault Classification in Web-Apps

    No full text
    corecore