1,106 research outputs found
Secure Automatic Speaker Verification Systems
The growing number of voice-enabled devices and applications consider automatic speaker verification (ASV) a fundamental component. However, maximum outreach for ASV in critical domains e.g., financial services and health care, is not possible unless we overcome security breaches caused by voice cloning, and replayed audios collectively known as the spoofing attacks. The audio spoofing attacks over ASV systems on one hand strictly limit the usability of voice-enabled applications; and on the other hand, the counterfeiter also remains untraceable. Therefore, to overcome these vulnerabilities, a secure ASV (SASV) system is presented in this dissertation. The proposed SASV system is based on the concept of novel sign modified acoustic local ternary pattern (sm-ALTP) features and asymmetric bagging-based classifier-ensemble. The proposed audio representation approach clusters the high and low-frequency components in audio frames by normally distributing frequency components against a convex function. Then, the neighborhood statistics are applied to capture the user specific vocal tract information. This information is then utilized by the classifier ensemble that is based on the concept of weighted normalized voting rule to detect various spoofing attacks. Contrary to the existing ASV systems, the proposed SASV system not only detects the conventional spoofing attacks (i.e. voice cloning, and replays), but also the new attacks that are still unexplored by the research community and a requirement of the future. In this regard, a concept of cloned replays is presented in this dissertation, where, replayed audios contains the microphone characteristics as well as the voice cloning artifacts. This depicts the scenario when voice cloning is applied in real-time. The voice cloning artifacts suppresses the microphone characteristics thus fails replay detection modules and similarly with the amalgamation of microphone characteristics the voice cloning detection gets deceived. Furthermore, the proposed scheme can be utilized to obtain a possible clue against the counterfeiter through voice cloning algorithm detection module that is also a novel concept proposed in this dissertation. The voice cloning algorithm detection module determines the voice cloning algorithm used to generate the fake audios. Overall, the proposed SASV system simultaneously verifies the bonafide speakers and detects the voice cloning attack, cloning algorithm used to synthesize cloned audio (in the defined settings), and voice-replay attacks over the ASVspoof 2019 dataset. In addition, the proposed method detects the voice replay and cloned voice replay attacks over the VSDC dataset. Rigorous experimentation against state-of-the-art approaches also confirms the robustness of the proposed research
Fusion of Spectral Reflectance and Derivative Information for Robust Hyperspectral Land Cover Classification
Developments in sensor technology have made high resolution hyperspectral remote sensing data available to the remote sensing analyst for ground cover classification and target recognition tasks. Further, with limited ground-truth data in many real-life operating scenarios, such hyperspectral classification systems often employ dimensionality reduction algorithms. In this thesis, the efficacy of spectral derivative features for hyperspectral analysis is studied. These studies are conducted within the context of both single and multiple classifier systems. Finally, a modification of existing classification techniques is proposed and tested on spectral reflectance and derivative features that adapts the classification systems to the characteristics of the dataset under consideration. Experimental results are reported with handheld, airborne and spaceborne hyperspectral data. Efficacy of the proposed approaches (using spectral derivatives and single or multiple classifiers) as quantified by the overall classification accuracy (expressed in percentage), is significantly greater than that of these systems when exploiting only reflectance information
Development of an unsupervised remote sensing methodology of detect surface leakage from terrestrial CO2 storage sites
Imperial Users onl
Velocity Dealiased Spectral Estimators of Range Migrating Targets using a Single Low-PRF Wideband Waveform
Wideband radars are promising systems that may provide numerous advantages, like simultaneous detection of slow and fast moving targets, high range-velocity resolution classification, and electronic countermeasures. Unfortunately, classical processing algorithms are challenged by the range-migration phenomenon that occurs then for fast moving targets. We
propose a new approach where the range migration is used rather as an asset to retrieve information about target velocitiesand, subsequently, to obtain a velocity dealiased mode. More specifically three new complex spectral estimators are devised in case of a single low-PRF (pulse repetition frequency) wideband waveform. The new estimation schemes enable one to decrease the
level of sidelobes that arise at ambiguous velocities and, thus, to enhance the discrimination capability of the radar. Synthetic data and experimental data are used to assess the performance of the proposed estimators
A Comprehensive Survey of Deep Learning in Remote Sensing: Theories, Tools and Challenges for the Community
In recent years, deep learning (DL), a re-branding of neural networks (NNs),
has risen to the top in numerous areas, namely computer vision (CV), speech
recognition, natural language processing, etc. Whereas remote sensing (RS)
possesses a number of unique challenges, primarily related to sensors and
applications, inevitably RS draws from many of the same theories as CV; e.g.,
statistics, fusion, and machine learning, to name a few. This means that the RS
community should be aware of, if not at the leading edge of, of advancements
like DL. Herein, we provide the most comprehensive survey of state-of-the-art
RS DL research. We also review recent new developments in the DL field that can
be used in DL for RS. Namely, we focus on theories, tools and challenges for
the RS community. Specifically, we focus on unsolved challenges and
opportunities as it relates to (i) inadequate data sets, (ii)
human-understandable solutions for modelling physical phenomena, (iii) Big
Data, (iv) non-traditional heterogeneous data sources, (v) DL architectures and
learning algorithms for spectral, spatial and temporal data, (vi) transfer
learning, (vii) an improved theoretical understanding of DL systems, (viii)
high barriers to entry, and (ix) training and optimizing the DL.Comment: 64 pages, 411 references. To appear in Journal of Applied Remote
Sensin
Towards music perception by redundancy reduction and unsupervised learning in probabilistic models
PhDThe study of music perception lies at the intersection of several disciplines: perceptual
psychology and cognitive science, musicology, psychoacoustics, and acoustical
signal processing amongst others. Developments in perceptual theory over the last
fifty years have emphasised an approach based on Shannon’s information theory and
its basis in probabilistic systems, and in particular, the idea that perceptual systems
in animals develop through a process of unsupervised learning in response to natural
sensory stimulation, whereby the emerging computational structures are well adapted
to the statistical structure of natural scenes. In turn, these ideas are being applied to
problems in music perception.
This thesis is an investigation of the principle of redundancy reduction through
unsupervised learning, as applied to representations of sound and music.
In the first part, previous work is reviewed, drawing on literature from some of the
fields mentioned above, and an argument presented in support of the idea that perception
in general and music perception in particular can indeed be accommodated within
a framework of unsupervised learning in probabilistic models.
In the second part, two related methods are applied to two different low-level representations.
Firstly, linear redundancy reduction (Independent Component Analysis)
is applied to acoustic waveforms of speech and music. Secondly, the related method of
sparse coding is applied to a spectral representation of polyphonic music, which proves
to be enough both to recognise that the individual notes are the important structural elements,
and to recover a rough transcription of the music.
Finally, the concepts of distance and similarity are considered, drawing in ideas
about noise, phase invariance, and topological maps. Some ecologically and information
theoretically motivated distance measures are suggested, and put in to practice in
a novel method, using multidimensional scaling (MDS), for visualising geometrically
the dependency structure in a distributed representation.Engineering and Physical Science Research Counci
Privacy-Protecting Techniques for Behavioral Data: A Survey
Our behavior (the way we talk, walk, or think) is unique and can be used as a biometric trait. It also correlates with sensitive attributes like emotions. Hence, techniques to protect individuals privacy against unwanted inferences are required. To consolidate knowledge in this area, we systematically reviewed applicable anonymization techniques. We taxonomize and compare existing solutions regarding privacy goals, conceptual operation, advantages, and limitations. Our analysis shows that some behavioral traits (e.g., voice) have received much attention, while others (e.g., eye-gaze, brainwaves) are mostly neglected. We also find that the evaluation methodology of behavioral anonymization techniques can be further improved
Recommended from our members
Application of Higher-Order Statistics and Subspace-Based Techniques to the Analysis and Diagnosis of Electrocardiogram Signals
The first and main contribution of this research work is the higher-order statistics (HOS)-based non-linear analysis and subsequent diagnosis of abnormal electrocardiogram (ECG) signals, particularly myocardial ischaemia. In the time domain; the second-, third-, and the fourth-order cumulants have been used in the analysis. In the frequency domain; up to the tenth-order polyspectra have been exploited. This HOS-based analysis of normal and ischaemic electrocardiogram signals has led to the identification of certain key discriminant features for the two physiological states of the heart. These features are then fed to different backpropagation-based multiple layer perceptrons for classification. The second contribution is a proposed new methodology to discriminate patients with angina pectoris or with old myocardial infarction (MI) during the first 60 seconds of stress test (or in some cases using rest ECG). It is based on the pseudo-spectral Multiple Signal Classification (MUSIC) and has the potential of being highly sensitive diagnostic signal processing tool. The third contribution is the development of a novel higher-order statistics, high-resolution estimator for quadratically coupled frequencies based on subspace spectral estimation.
Extensive studies of cumulants, bispectra and bicoherence-squared of normal and ischaemic ECG signals collected from MIT and ST-T European databases has enabled us to see key discriminant features in both the third- and fourth-order cumulant domains. In the frequency domain, the polyspectral study has been extended to the lOth-order poly spectra. By calculating one-dimensional polyspectrum slices using an algorithm developed by Zhou and Giannakis (1995) a considerable reduction in the CPU time has been achieved. Furthermore, Zhou’s algorithm has been further extended to estimate the polycoherency slices which are used to characterise non-linearities in normal and ischaemic ECG signals. An important finding in this thesis is the decrease of the order of non-linearity representing the electrocardiogram signals of ischaemic patients.
This thesis also includes the results of a pilot study involving eighteen healthy subjects (MIT database) and confirmed that the ECG signal is non-Gaussian, cyclostationary and quasi periodic. Combined spectral and bispectral analysis of the signal revealed that there are unique harmonic characteristics for the P-wave, QRS complex and T-wave and other frequencies due to harmonic interactions.
In this work three linear and one non-linear adaptive filtering/predictions techniques have been applied to noisy ECG signals and their respective performances appraised. It is shown that the Kalman filter gives the best mean-square error MSE error but its comparatively long execution time and problems arising from ill-conditioning of the state-error covariance matrix render it of limited use in ECG applications. It is also shown that the LMS-based quadratic and cubic Volterra filters are the most superior for the ECG signal prediction.
For ECG classifications; three multi-layer perceptrons employing back-propagation and modified back-propagation algorithms, and using two sets from the higher-order most discriminant features as their inputs, have yielded fairly high classification rates
Machine Learning for Fluid Mechanics
The field of fluid mechanics is rapidly advancing, driven by unprecedented
volumes of data from field measurements, experiments and large-scale
simulations at multiple spatiotemporal scales. Machine learning offers a wealth
of techniques to extract information from data that could be translated into
knowledge about the underlying fluid mechanics. Moreover, machine learning
algorithms can augment domain knowledge and automate tasks related to flow
control and optimization. This article presents an overview of past history,
current developments, and emerging opportunities of machine learning for fluid
mechanics. It outlines fundamental machine learning methodologies and discusses
their uses for understanding, modeling, optimizing, and controlling fluid
flows. The strengths and limitations of these methods are addressed from the
perspective of scientific inquiry that considers data as an inherent part of
modeling, experimentation, and simulation. Machine learning provides a powerful
information processing framework that can enrich, and possibly even transform,
current lines of fluid mechanics research and industrial applications.Comment: To appear in the Annual Reviews of Fluid Mechanics, 202
- …