Search CORE

534,458 research outputs found

Feature subset selection and ranking for data dimensionality reduction

Author: Billings S.A.
Wei H.L.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2007
Field of study

A new unsupervised forward orthogonal search (FOS) algorithm is introduced for feature selection and ranking. In the new algorithm, features are selected in a stepwise way, one at a time, by estimating the capability of each specified candidate feature subset to represent the overall features in the measurement space. A squared correlation function is employed as the criterion to measure the dependency between features and this makes the new algorithm easy to implement. The forward orthogonalization strategy, which combines good effectiveness with high efficiency, enables the new algorithm to produce efficient feature subsets with a clear physical interpretation

Crossref

White Rose Research Online

Feature subset selection and ranking for data dimensionality reduction

Author: Billings S.A.
Wei H.L.
Publication venue: Automatic Control and Systems Engineering, University of Sheffield
Publication date: 01/10/2005
Field of study

White Rose Research Online

Practical feature subset selection for machine learning

Author: Hall Mark A.
Smith Lloyd A.
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/1998
Field of study

Machine learning algorithms automatically extract knowledge from machine readable information. Unfortunately, their success is usually dependant on the quality of the data that they operate on. If the data is inadequate, or contains extraneous and irrelevant information, machine learning algorithms may produce less accurate and less understandable results, or may fail to discover anything of use at all. Feature subset selection can result in enhanced performance, a reduced hypothesis search space, and, in some cases, reduced storage requirement. This paper describes a new feature selection algorithm that uses a correlation based heuristic to determine the “goodness” of feature subsets, and evaluates its effectiveness with three common machine learning algorithms. Experiments using a number of standard machine learning data sets are presented. Feature subset selection gave significant improvement for all three algorithm

CiteSeerX

Research Commons@Waikato

A new genetic algorithm for multi-label correlation-based feature selection.

Author: Freitas Alex A.
Jungjit Suwimol
Publication venue: ESANN
Publication date: 01/04/2015
Field of study

This paper proposes a new Genetic Algorithm for Multi-Label Correlation-Based Feature Selection (GA-ML-CFS). This GA performs a global search in the space of candidate feature subset, in order to select a high-quality feature subset is used by a multi-label classification algorithm - in this work, the Multi-Label k-NN algorithm. We compare the results of GA-ML-CFS with the results of the previously proposed Hill-Climbing for Multi-Label Correlation-Based Feature Selection (HC-ML-CFS), across 10 multi-label datasets

Kent Academic Repository

Differential Evolution based feature subset selection

Author: Al-Ani A
Al-Jumaily A
Khushaba RN
Publication venue
Publication date: 01/12/2008
Field of study

In this paper, a novel feature selection algorithm based on Differential Evolution (DE) optimization technique is presented. The new algorithm, called DEFS, modifies the DE which is a real-valued optimizer, to suit the problem of feature selection. The proposed DEFS highly reduces the computational costs while at the same time proving to present powerful performance. The DEFS technique is applied to a brain-computer-interface (BCI) application and compared with other dimensionality reduction techniques. The practical results indicate the significance of the proposed algorithm in terms of solutions optimality, memory requirement, and computational cost. © 2008 IEEE

Crossref

OPUS - University of Technology Sydney

Feature subset selection: a correlation based filter approach

Author: Hall Mark A.
Smith Lloyd A.
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/1997
Field of study

Recent work has shown that feature subset selection can have a position affect on the performance of machine learning algorithms. Some algorithms can be slowed or their performance adversely affected by too much data some of which may be irrelevant or redundant to the learning task. Feature subset selection, then, is a method of enhancing the performance of learning algorithms, reducing the hypothesis search space, and, in some cases, reducing the storage requirement. This paper describes a feature subset selector that uses a correlation based heuristic to determine the goodness of feature subsets, and evaluates its effectiveness with three common ML algorithms: a decision tree inducer (C4.5), a naive Bayes classifier, and an instance based learner(IBI). Experiments using a number of standard data sets drawn from real and artificial domains are presented. Feature subset selection gave significant improvement for all three algorithms; C4.5 generated smaller decision trees

Research Commons@Waikato

Feature Selection Library (MATLAB Toolbox)

Author: Roffo Giorgio
Publication venue
Publication date: 06/08/2018
Field of study

Feature Selection Library (FSLib) is a widely applicable MATLAB library for Feature Selection (FS). FS is an essential component of machine learning and data mining which has been studied for many years under many different conditions and in diverse scenarios. These algorithms aim at ranking and selecting a subset of relevant features according to their degrees of relevance, preference, or importance as defined in a specific application. Because feature selection can reduce the amount of features used for training classification models, it alleviates the effect of the curse of dimensionality, speeds up the learning process, improves model's performance, and enhances data understanding. This short report provides an overview of the feature selection algorithms included in the FSLib MATLAB toolbox among filter, embedded, and wrappers methods.Comment: Feature Selection Library (FSLib) 201

arXiv.org e-Print Archive

Exploring Language-Independent Emotional Acoustic Features via Feature Selection

Author: Chen Ke
Shaukat Arslan
Publication venue
Publication date: 08/08/2010
Field of study

We propose a novel feature selection strategy to discover language-independent acoustic features that tend to be responsible for emotions regardless of languages, linguistics and other factors. Experimental results suggest that the language-independent feature subset discovered yields the performance comparable to the full feature set on various emotional speech corpora.Comment: 15 pages, 2 figures, 6 table

arXiv.org e-Print Archive

The University of Manchester - Institutional Repository