659 research outputs found
Wavelet-based techniques for speech recognition
In this thesis, new wavelet-based techniques have been developed for the
extraction of features from speech signals for the purpose of automatic speech
recognition (ASR). One of the advantages of the wavelet transform over the short
time Fourier transform (STFT) is its capability to process non-stationary signals.
Since speech signals are not strictly stationary the wavelet transform is a better
choice for time-frequency transformation of these signals. In addition it has
compactly supported basis functions, thereby reducing the amount of
computation as opposed to STFT where an overlapping window is needed. [Continues.
Overview of Wavelet Analysis and Applications to Engineering
Dr. Alessio Medda’s research interest are in digital signal and image processing, mathematical theory and applications. His experience is in signal modeling and analysis for data interpretation and representation applied to a variety of cases.Presented on October 24, 2014 at 1:00 p.m. in the Jesse W. Mason Building, room 3133.Runtime: 65:26 minute
Recommended from our members
Seismological data acquisition and signal processing using wavelets
This thesis was submitted for the degree of Doctor of Philosophy and awarded by Brunel University.This work deals with two main fields:
a) The design, built, installation, test, evaluation, deployment and maintenance of Seismological Network of Crete (SNC) of the Laboratory of Geophysics and Seismology (LGS) at Technological Educational Institute (TEI) at Chania.
b) The use of Wavelet Transform (WT) in several applications during the operation of the aforementioned network.
SNC began its operation in 2003. It is designed and built in order to provide denser network coverage, real time data transmission to CRC, real time telemetry, use of wired ADSL lines and dedicated private satellite links, real time data processing and estimation of source parameters as well as rapid dissemination of results. All the above are implemented using commercial hardware and software which is modified and where is necessary, author designs and deploy additional software modules. Up to now (July 2008) SNC has recorded 5500 identified events (around 970 more than those reported by national bulletin the same period) and its seismic catalogue is complete for magnitudes over 3.2, instead national catalogue which was complete for magnitudes over 3.7 before the operation of SNC.
During its operation, several applications at SNC used WT as a signal processing tool.
These applications benefited from the adaptation of WT to non-stationary signals such as the seismic signals. These applications are:
HVSR method. WT used to reveal undetectable non-stationarities in order to eliminate errors in site’s fundamental frequency estimation. Denoising. Several wavelet denoising schemes compared with the widely used in seismology band-pass filtering in order to prove the superiority of wavelet denoising and to choose the most appropriate scheme for different signal to noise ratios of seismograms.
EEWS. WT used for producing magnitude prediction equations and epicentral estimations from the first 5 secs of P wave arrival. As an alternative analysis tool for detection of significant indicators in temporal patterns of seismicity. Multiresolution wavelet analysis of seismicity used to estimate (in a several years time period) the time where the maximum emitted earthquake energy was observed
A Panorama on Multiscale Geometric Representations, Intertwining Spatial, Directional and Frequency Selectivity
The richness of natural images makes the quest for optimal representations in
image processing and computer vision challenging. The latter observation has
not prevented the design of image representations, which trade off between
efficiency and complexity, while achieving accurate rendering of smooth regions
as well as reproducing faithful contours and textures. The most recent ones,
proposed in the past decade, share an hybrid heritage highlighting the
multiscale and oriented nature of edges and patterns in images. This paper
presents a panorama of the aforementioned literature on decompositions in
multiscale, multi-orientation bases or dictionaries. They typically exhibit
redundancy to improve sparsity in the transformed domain and sometimes its
invariance with respect to simple geometric deformations (translation,
rotation). Oriented multiscale dictionaries extend traditional wavelet
processing and may offer rotation invariance. Highly redundant dictionaries
require specific algorithms to simplify the search for an efficient (sparse)
representation. We also discuss the extension of multiscale geometric
decompositions to non-Euclidean domains such as the sphere or arbitrary meshed
surfaces. The etymology of panorama suggests an overview, based on a choice of
partially overlapping "pictures". We hope that this paper will contribute to
the appreciation and apprehension of a stream of current research directions in
image understanding.Comment: 65 pages, 33 figures, 303 reference
Wavelet methods in speech recognition
In this thesis, novel wavelet techniques are developed to improve parametrization of
speech signals prior to classification. It is shown that non-linear operations carried out
in the wavelet domain improve the performance of a speech classifier and consistently
outperform classical Fourier methods. This is because of the localised nature of the
wavelet, which captures correspondingly well-localised time-frequency features
within the speech signal. Furthermore, by taking advantage of the approximation
ability of wavelets, efficient representation of the non-stationarity inherent in speech
can be achieved in a relatively small number of expansion coefficients. This is an
attractive option when faced with the so-called 'Curse of Dimensionality' problem of
multivariate classifiers such as Linear Discriminant Analysis (LDA) or Artificial
Neural Networks (ANNs). Conventional time-frequency analysis methods such as the
Discrete Fourier Transform either miss irregular signal structures and transients due to
spectral smearing or require a large number of coefficients to represent such
characteristics efficiently. Wavelet theory offers an alternative insight in the
representation of these types of signals.
As an extension to the standard wavelet transform, adaptive libraries of wavelet and
cosine packets are introduced which increase the flexibility of the transform. This
approach is observed to be yet more suitable for the highly variable nature of speech
signals in that it results in a time-frequency sampled grid that is well adapted to
irregularities and transients. They result in a corresponding reduction in the
misclassification rate of the recognition system. However, this is necessarily at the
expense of added computing time.
Finally, a framework based on adaptive time-frequency libraries is developed which
invokes the final classifier to choose the nature of the resolution for a given
classification problem. The classifier then performs dimensionaIity reduction on the
transformed signal by choosing the top few features based on their discriminant power. This approach is compared and contrasted to an existing discriminant wavelet
feature extractor.
The overall conclusions of the thesis are that wavelets and their relatives are capable
of extracting useful features for speech classification problems. The use of adaptive
wavelet transforms provides the flexibility within which powerful feature extractors
can be designed for these types of application
Motion Artifacts Correction from Single-Channel EEG and fNIRS Signals using Novel Wavelet Packet Decomposition in Combination with Canonical Correlation Analysis
The electroencephalogram (EEG) and functional near-infrared spectroscopy
(fNIRS) signals, highly non-stationary in nature, greatly suffers from motion
artifacts while recorded using wearable sensors. This paper proposes two robust
methods: i) Wavelet packet decomposition (WPD), and ii) WPD in combination with
canonical correlation analysis (WPD-CCA), for motion artifact correction from
single-channel EEG and fNIRS signals. The efficacy of these proposed techniques
is tested using a benchmark dataset and the performance of the proposed methods
is measured using two well-established performance matrices: i) Difference in
the signal to noise ratio ({\Delta}SNR) and ii) Percentage reduction in motion
artifacts ({\eta}). The proposed WPD-based single-stage motion artifacts
correction technique produces the highest average {\Delta}SNR (29.44 dB) when
db2 wavelet packet is incorporated whereas the greatest average {\eta} (53.48%)
is obtained using db1 wavelet packet for all the available 23 EEG recordings.
Our proposed two-stage motion artifacts correction technique i.e. the WPD-CCA
method utilizing db1 wavelet packet has shown the best denoising performance
producing an average {\Delta}SNR and {\eta} values of 30.76 dB and 59.51%,
respectively for all the EEG recordings. On the other hand, the two-stage
motion artifacts removal technique i.e. WPD-CCA has produced the best average
{\Delta}SNR (16.55 dB, utilizing db1 wavelet packet) and largest average {\eta}
(41.40%, using fk8 wavelet packet). The highest average {\Delta}SNR and {\eta}
using single-stage artifacts removal techniques (WPD) are found as 16.11 dB and
26.40%, respectively for all the fNIRS signals using fk4 wavelet packet. In
both EEG and fNIRS modalities, the percentage reduction in motion artifacts
increases by 11.28% and 56.82%, respectively when two-stage WPD-CCA techniques
are employed.Comment: 25 pages, 10 figures and 2 table
Automatic condition monitoring system for crack detection in rotating machinery
Maintenance is essential to prevent catastrophic failures in rotating machinery. A crack can cause a failure with costly processes of reparation, especially in a rotating shaft.
In this study, the Wavelet Packets transform energy combined with Artificial Neural Networks with Radial Basis Function architecture (RBF-ANN) are applied to vibration signals to detect cracks in a rotating shaft. Data were obtained from a rig where the shaft rotates under its own weight, at steady state at different crack conditions. Nine defect conditions were induced in the shaft (with depths from 4% to 50% of the shaft diameter). The parameters for Wavelet Packets transform and RBF-ANN are selected to optimize its success rates results. Moreover, ‘Probability of Detection’ curves were calculated showing probabilities of detection close to 100% of the cases tested from the smallest crack size with a 1.77% of false alarms.The authors would like to thank the Spanish Government for financing through the CDTI project RANKINE21 IDI-20101560
Multitaper power spectrum estimation and thresholding: Wavelet packets versus wavelets
Published versio
- …