Search CORE

89 research outputs found

An Effective Feature Selection Method Based on Pair-Wise Feature Proximity for High Dimensional Low Sample Size Data

Author: bradley
duda
gu
he
hsu
liu
luo
nie
tang
yu
zaffalon
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 08/08/2017
Field of study

Feature selection has been studied widely in the literature. However, the efficacy of the selection criteria for low sample size applications is neglected in most cases. Most of the existing feature selection criteria are based on the sample similarity. However, the distance measures become insignificant for high dimensional low sample size (HDLSS) data. Moreover, the variance of a feature with a few samples is pointless unless it represents the data distribution efficiently. Instead of looking at the samples in groups, we evaluate their efficiency based on pairwise fashion. In our investigation, we noticed that considering a pair of samples at a time and selecting the features that bring them closer or put them far away is a better choice for feature selection. Experimental results on benchmark data sets demonstrate the effectiveness of the proposed method with low sample size, which outperforms many other state-of-the-art feature selection methods.Comment: European Signal Processing Conference 201

arXiv.org e-Print Archive

Crossref

Feature Selection in Large Scale Data Stream for Credit Card Fraud Detection

Author: Ise Masayuki
Konishi Osamu
Niimi Ayahiko
Publication venue: IEEE SMC Hiroshima Chapter
Publication date: 01/11/2009
Field of study

There is increased interest in accurate model acquisition from large scale data streams. In this paper, because we have focused attention on time-oriented variation, we propose a method contracting time-series data for data stream. Additionally, our proposal method employs the combination of plural simple contraction method and original features. In this experiment, we treat a real data stream in credit card transactions because it is large scale and difficult to classify. This experiment yields that this proposal method improves classification performance according to training data. However, this proposal method needs more generality. Hence, we'll improve generality with employing the suitable combination of a contraction method and a feature for the feature in our proposal method

Hiroshima University Institutional Repository

Okayama University Scientific Achievement Repository

Unsupervised Feature Selection with Adaptive Structure Learning

Author: Alelyani S.
He X.
Hou C.
Krzanowski W.
Li Z.
Liu J.
Liu X.
Nie F.
Nie F.
Qian M.
Takeuchi I.
Yang Y.
Zhao Z.
Publication venue
Publication date: 02/04/2015
Field of study

The problem of feature selection has raised considerable interests in the past decade. Traditional unsupervised methods select the features which can faithfully preserve the intrinsic structures of data, where the intrinsic structures are estimated using all the input features of data. However, the estimated intrinsic structures are unreliable/inaccurate when the redundant and noisy features are not removed. Therefore, we face a dilemma here: one need the true structures of data to identify the informative features, and one need the informative features to accurately estimate the true structures of data. To address this, we propose a unified learning framework which performs structure learning and feature selection simultaneously. The structures are adaptively learned from the results of feature selection, and the informative features are reselected to preserve the refined structures of data. By leveraging the interactions between these two essential tasks, we are able to capture accurate structures and select more informative features. Experimental results on many benchmark data sets demonstrate that the proposed method outperforms many state of the art unsupervised feature selection methods

arXiv.org e-Print Archive

CiteSeerX

Crossref

Modified Mutual Information-based Feature Selection for Intrusion Detection Systems in Decision Tree Learning

Author: Price Christopher
Scully Peter Matthew David
Song Jingping
Zhu Zhiliang
Publication venue
Publication date: 01/07/2014
Field of study

Crossref

Aberystwyth Research Portal