Search CORE

156,212 research outputs found

Compressively Sensed Image Recognition

Author: Aslan Sinem
Degerli Aysen
Gabbouj Moncef
Sankur Bulent
Yamac Mehmet
Publication venue
Publication date: 01/01/2018
Field of study

Compressive Sensing (CS) theory asserts that sparse signal reconstruction is possible from a small number of linear measurements. Although CS enables low-cost linear sampling, it requires non-linear and costly reconstruction. Recent literature works show that compressive image classification is possible in CS domain without reconstruction of the signal. In this work, we introduce a DCT base method that extracts binary discriminative features directly from CS measurements. These CS measurements can be obtained by using (i) a random or a pseudo-random measurement matrix, or (ii) a measurement matrix whose elements are learned from the training data to optimize the given classification task. We further introduce feature fusion by concatenating Bag of Words (BoW) representation of our binary features with one of the two state-of-the-art CNN-based feature vectors. We show that our fused feature outperforms the state-of-the-art in both cases.Comment: 6 pages, submitted/accepted, EUVIP 201

arXiv.org e-Print Archive

Ege University Institutional Repository

VTT Research System

Preconditioned Data Sparsification for Big Data with Applications to PCA and K-means

Author: Becker Stephen
Pourkamali-Anaraki Farhad
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 19/09/2016
Field of study

We analyze a compression scheme for large data sets that randomly keeps a small percentage of the components of each data sample. The benefit is that the output is a sparse matrix and therefore subsequent processing, such as PCA or K-means, is significantly faster, especially in a distributed-data setting. Furthermore, the sampling is single-pass and applicable to streaming data. The sampling mechanism is a variant of previous methods proposed in the literature combined with a randomized preconditioning to smooth the data. We provide guarantees for PCA in terms of the covariance matrix, and guarantees for K-means in terms of the error in the center estimators at a given step. We present numerical evidence to show both that our bounds are nearly tight and that our algorithms provide a real benefit when applied to standard test data sets, as well as providing certain benefits over related sampling approaches.Comment: 28 pages, 10 figure

arXiv.org e-Print Archive

CU Scholar Institutional Repository

Crossref

Dimension Reduction by Mutual Information Discriminant Analysis

Author: Shadvar Ali
Publication venue
Publication date: 01/01/2012
Field of study

In the past few decades, researchers have proposed many discriminant analysis (DA) algorithms for the study of high-dimensional data in a variety of problems. Most DA algorithms for feature extraction are based on transformations that simultaneously maximize the between-class scatter and minimize the withinclass scatter matrices. This paper presents a novel DA algorithm for feature extraction using mutual information (MI). However, it is not always easy to obtain an accurate estimation for high-dimensional MI. In this paper, we propose an efficient method for feature extraction that is based on one-dimensional MI estimations. We will refer to this algorithm as mutual information discriminant analysis (MIDA). The performance of this proposed method was evaluated using UCI databases. The results indicate that MIDA provides robust performance over different data sets with different characteristics and that MIDA always performs better than, or at least comparable to, the best performing algorithms.Comment: 13pages, 3 tables, International Journal of Artificial Intelligence & Application

arXiv.org e-Print Archive

CiteSeerX

Randomized Dimensionality Reduction for k-means Clustering

Author: Boutsidis Christos
Drineas Petros
Mahoney Michael W.
Zouzias Anastasios
Publication venue
Publication date: 01/01/2013
Field of study

We study the topic of dimensionality reduction for

k

-means clustering. Dimensionality reduction encompasses the union of two approaches: \emph{feature selection} and \emph{feature extraction}. A feature selection based algorithm for

k

-means clustering selects a small subset of the input features and then applies

k

-means clustering on the selected features. A feature extraction based algorithm for

k

-means clustering constructs a small set of new artificial features and then applies

k

-means clustering on the constructed features. Despite the significance of

k

-means clustering as well as the wealth of heuristic methods addressing it, provably accurate feature selection methods for

k

-means clustering are not known. On the other hand, two provably accurate feature extraction methods for

k

-means clustering are known in the literature; one is based on random projections and the other is based on the singular value decomposition (SVD). This paper makes further progress towards a better understanding of dimensionality reduction for

k

-means clustering. Namely, we present the first provably accurate feature selection method for

k

-means clustering and, in addition, we present two feature extraction methods. The first feature extraction method is based on random projections and it improves upon the existing results in terms of time complexity and number of features needed to be extracted. The second feature extraction method is based on fast approximate SVD factorizations and it also improves upon the existing results in terms of time complexity. The proposed algorithms are randomized and provide constant-factor approximation guarantees with respect to the optimal

k

-means objective value.Comment: IEEE Transactions on Information Theory, to appea

arXiv.org e-Print Archive

CiteSeerX

Entropy-based feature extraction for electromagnetic discharges classification in high-voltage power generation

Author: Boreham Philip
Mitiche Imene
Morison Gordon
Nesbitt Alan
Stewart Brian G.
Publication venue: 'MDPI AG'
Publication date: 01/07/2018
Field of study

This work exploits four entropy measures known as Sample, Permutation, Weighted Permutation, and Dispersion Entropy to extract relevant information from Electromagnetic Interference (EMI) discharge signals that are useful in fault diagnosis of High-Voltage (HV) equipment. Multi-class classification algorithms are used to classify or distinguish between various discharge sources such as Partial Discharges (PD), Exciter, Arcing, micro Sparking and Random Noise. The signals were measured and recorded on different sites followed by EMI expert’s data analysis in order to identify and label the discharge source type contained within the signal. The classification was performed both within each site and across all sites. The system performs well for both cases with extremely high classification accuracy within site. This work demonstrates the ability to extract relevant entropy-based features from EMI discharge sources from time-resolved signals requiring minimal computation making the system ideal for a potential application to online condition monitoring based on EMI

Multidisciplinary Digital Publishing Institute

University of Strathclyde Institutional Repository

Directory of Open Access Journals

ResearchOnline@GCU