Search CORE

11,805 research outputs found

DISCRIMINANT STEPWISE PROCEDURE

Author: Kubus Mariusz
Publication venue: Wydawnictwo Uniwersytetu Łódzkiego
Publication date: 01/01/2014
Field of study

Stepwise procedure is now probably the most popular tool for automatic feature selection. In the most cases it represents model selection approach which evaluates various feature subsets (so called wrapper). In fact it is heuristic search technique which examines the space of all possible feature subsets. This method is known in the literature under different names and variants. We organize the concepts and terminology, and show several variants of stepwise feature selection from a search strategy point of view. Short review of implementations in R will be given

Biblioteka Nauki - repozytorium artykuÅÃ³w

Directory of Open Access Journals

Repozytorium Uniwersytetu Łódzkiego (University of Lodz Repository)

Two-Stage Bagging Pruning for Reducing the Ensemble Size and Improving the Classification Performance

Author: Chen Bi
Jiang Bo
Shan Guogen
Song Yujie
Zhang Hua
Publication venue: Digital Scholarship@UNLV
Publication date: 01/01/2019
Field of study

Ensemble methods, such as the traditional bagging algorithm, can usually improve the performance of a single classifier. However, they usually require large storage space as well as relatively time-consuming predictions. Many approaches were developed to reduce the ensemble size and improve the classification performance by pruning the traditional bagging algorithms. In this article, we proposed a two-stage strategy to prune the traditional bagging algorithm by combining two simple approaches: accuracy-based pruning (AP) and distance-based pruning (DP). These two methods, as well as their two combinations, “AP+DP” and “DP+AP” as the two-stage pruning strategy, were all examined. Comparing with the single pruning methods, we found that the two-stage pruning methods can furthermore reduce the ensemble size and improve the classification. “AP+DP” method generally performs better than the “DP+AP” method when using four base classifiers: decision tree, Gaussian naive Bayes, K-nearest neighbor, and logistic regression. Moreover, as compared to the traditional bagging, the two-stage method “AP+DP” improved the classification accuracy by 0.88%, 4.06%, 1.26%, and 0.96%, respectively, averaged over 28 datasets under the four base classifiers. It was also observed that “AP+DP” outperformed other three existing algorithms Brag, Nice, and TB assessed on 8 common datasets. In summary, the proposed two-stage pruning methods are simple and promising approaches, which can both reduce the ensemble size and improve the classification accuracy

Directory of Open Access Journals

University of Nevada, Las Vegas Repository