Search CORE

1,106 research outputs found

Secure Automatic Speaker Verification Systems

Author: Aljasem Muteb
Publication venue: DigitalCommons@WayneState
Publication date: 01/01/2021
Field of study

The growing number of voice-enabled devices and applications consider automatic speaker verification (ASV) a fundamental component. However, maximum outreach for ASV in critical domains e.g., financial services and health care, is not possible unless we overcome security breaches caused by voice cloning, and replayed audios collectively known as the spoofing attacks. The audio spoofing attacks over ASV systems on one hand strictly limit the usability of voice-enabled applications; and on the other hand, the counterfeiter also remains untraceable. Therefore, to overcome these vulnerabilities, a secure ASV (SASV) system is presented in this dissertation. The proposed SASV system is based on the concept of novel sign modified acoustic local ternary pattern (sm-ALTP) features and asymmetric bagging-based classifier-ensemble. The proposed audio representation approach clusters the high and low-frequency components in audio frames by normally distributing frequency components against a convex function. Then, the neighborhood statistics are applied to capture the user specific vocal tract information. This information is then utilized by the classifier ensemble that is based on the concept of weighted normalized voting rule to detect various spoofing attacks. Contrary to the existing ASV systems, the proposed SASV system not only detects the conventional spoofing attacks (i.e. voice cloning, and replays), but also the new attacks that are still unexplored by the research community and a requirement of the future. In this regard, a concept of cloned replays is presented in this dissertation, where, replayed audios contains the microphone characteristics as well as the voice cloning artifacts. This depicts the scenario when voice cloning is applied in real-time. The voice cloning artifacts suppresses the microphone characteristics thus fails replay detection modules and similarly with the amalgamation of microphone characteristics the voice cloning detection gets deceived. Furthermore, the proposed scheme can be utilized to obtain a possible clue against the counterfeiter through voice cloning algorithm detection module that is also a novel concept proposed in this dissertation. The voice cloning algorithm detection module determines the voice cloning algorithm used to generate the fake audios. Overall, the proposed SASV system simultaneously verifies the bonafide speakers and detects the voice cloning attack, cloning algorithm used to synthesize cloned audio (in the defined settings), and voice-replay attacks over the ASVspoof 2019 dataset. In addition, the proposed method detects the voice replay and cloned voice replay attacks over the VSDC dataset. Rigorous experimentation against state-of-the-art approaches also confirms the robustness of the proposed research

Digital Commons@Wayne State University

Fusion of Spectral Reflectance and Derivative Information for Robust Hyperspectral Land Cover Classification

Author: Kalluri Hemanth Reddy
Publication venue: Scholars Junction
Publication date: 07/12/2009
Field of study

Developments in sensor technology have made high resolution hyperspectral remote sensing data available to the remote sensing analyst for ground cover classification and target recognition tasks. Further, with limited ground-truth data in many real-life operating scenarios, such hyperspectral classification systems often employ dimensionality reduction algorithms. In this thesis, the efficacy of spectral derivative features for hyperspectral analysis is studied. These studies are conducted within the context of both single and multiple classifier systems. Finally, a modification of existing classification techniques is proposed and tested on spectral reflectance and derivative features that adapts the classification systems to the characteristics of the dataset under consideration. Experimental results are reported with handheld, airborne and spaceborne hyperspectral data. Efficacy of the proposed approaches (using spectral derivatives and single or multiple classifiers) as quantified by the overall classification accuracy (expressed in percentage), is significantly greater than that of these systems when exploiting only reflectance information

Mississippi State University Libraries ETD database

Scholars Junction - Mississippi State University Institutional Repository

Development of an unsupervised remote sensing methodology of detect surface leakage from terrestrial CO2 storage sites

Author: Govindan Rajesh
Govindan Rajesh
Publication venue
Publication date: 01/01/2011
Field of study

Imperial Users onl

Spiral - Imperial College Digital Repository

Detecting autism, emotions and social signals using AdaBoost

Author: Busa-Fekete Róbert
Gosztolya Gábor
Tóth László
Publication venue: Interspeech
Publication date: 01/01/2013
Field of study

SZTE Publicatio Repozitórium - SZTE - Repository of Publications

Velocity Dealiased Spectral Estimators of Range Migrating Targets using a Single Low-PRF Wideband Waveform

Author: Besson Olivier
Bidon Stéphanie
Deudon François
Tourneret Jean-Yves
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2013
Field of study

Wideband radars are promising systems that may provide numerous advantages, like simultaneous detection of slow and fast moving targets, high range-velocity resolution classification, and electronic countermeasures. Unfortunately, classical processing algorithms are challenged by the range-migration phenomenon that occurs then for fast moving targets. We propose a new approach where the range migration is used rather as an asset to retrieve information about target velocitiesand, subsequently, to obtain a velocity dealiased mode. More specifically three new complex spectral estimators are devised in case of a single low-PRF (pulse repetition frequency) wideband waveform. The new estimation schemes enable one to decrease the level of sidelobes that arise at ambiguous velocities and, thus, to enhance the discrimination capability of the radar. Synthetic data and experimental data are used to assess the performance of the proposed estimators

Crossref

Scientific Publications of the University of Toulouse II Le Mirail

Open Archive Toulouse Archive Ouverte

A Comprehensive Survey of Deep Learning in Remote Sensing: Theories, Tools and Challenges for the Community

Author: Ball John E.
Anderson Derek T.
Chan Chee Seng
Publication venue
Publication date: 01/01/2017
Field of study

In recent years, deep learning (DL), a re-branding of neural networks (NNs), has risen to the top in numerous areas, namely computer vision (CV), speech recognition, natural language processing, etc. Whereas remote sensing (RS) possesses a number of unique challenges, primarily related to sensors and applications, inevitably RS draws from many of the same theories as CV; e.g., statistics, fusion, and machine learning, to name a few. This means that the RS community should be aware of, if not at the leading edge of, of advancements like DL. Herein, we provide the most comprehensive survey of state-of-the-art RS DL research. We also review recent new developments in the DL field that can be used in DL for RS. Namely, we focus on theories, tools and challenges for the RS community. Specifically, we focus on unsolved challenges and opportunities as it relates to (i) inadequate data sets, (ii) human-understandable solutions for modelling physical phenomena, (iii) Big Data, (iv) non-traditional heterogeneous data sources, (v) DL architectures and learning algorithms for spectral, spatial and temporal data, (vi) transfer learning, (vii) an improved theoretical understanding of DL systems, (viii) high barriers to entry, and (ix) training and optimizing the DL.Comment: 64 pages, 411 references. To appear in Journal of Applied Remote Sensin

arXiv.org e-Print Archive

Crossref

FigShare

Towards music perception by redundancy reduction and unsupervised learning in probabilistic models

Author: Abdallah Samer A.
Publication venue: 'Queen Mary University of London'
Publication date: 01/01/2002
Field of study

PhDThe study of music perception lies at the intersection of several disciplines: perceptual psychology and cognitive science, musicology, psychoacoustics, and acoustical signal processing amongst others. Developments in perceptual theory over the last fifty years have emphasised an approach based on Shannon’s information theory and its basis in probabilistic systems, and in particular, the idea that perceptual systems in animals develop through a process of unsupervised learning in response to natural sensory stimulation, whereby the emerging computational structures are well adapted to the statistical structure of natural scenes. In turn, these ideas are being applied to problems in music perception. This thesis is an investigation of the principle of redundancy reduction through unsupervised learning, as applied to representations of sound and music. In the first part, previous work is reviewed, drawing on literature from some of the fields mentioned above, and an argument presented in support of the idea that perception in general and music perception in particular can indeed be accommodated within a framework of unsupervised learning in probabilistic models. In the second part, two related methods are applied to two different low-level representations. Firstly, linear redundancy reduction (Independent Component Analysis) is applied to acoustic waveforms of speech and music. Secondly, the related method of sparse coding is applied to a spectral representation of polyphonic music, which proves to be enough both to recognise that the individual notes are the important structural elements, and to recover a rough transcription of the music. Finally, the concepts of distance and similarity are considered, drawing in ideas about noise, phase invariance, and topological maps. Some ecologically and information theoretically motivated distance measures are suggested, and put in to practice in a novel method, using multidimensional scaling (MDS), for visualising geometrically the dependency structure in a distributed representation.Engineering and Physical Science Research Counci

Queen Mary Research Online

Privacy-Protecting Techniques for Behavioral Data: A Survey

Author: Arias-Cabarcos Patricia
Hanisch Simon
Parra-Arnau Javier
Strufe Thorsten
Publication venue: arxiv
Publication date: 12/11/2021
Field of study

Our behavior (the way we talk, walk, or think) is unique and can be used as a biometric trait. It also correlates with sensitive attributes like emotions. Hence, techniques to protect individuals privacy against unwanted inferences are required. To consolidate knowledge in this area, we systematically reviewed applicable anonymization techniques. We taxonomize and compare existing solutions regarding privacy goals, conceptual operation, advantages, and limitations. Our analysis shows that some behavioral traits (e.g., voice) have received much attention, while others (e.g., eye-gaze, brainwaves) are mostly neglected. We also find that the evaluation methodology of behavioral anonymization techniques can be further improved

KITopen

Recommended from our members

Application of Higher-Order Statistics and Subspace-Based Techniques to the Analysis and Diagnosis of Electrocardiogram Signals

Author: El-Khafif S. H.
Publication venue
Publication date
Field of study

The first and main contribution of this research work is the higher-order statistics (HOS)-based non-linear analysis and subsequent diagnosis of abnormal electrocardiogram (ECG) signals, particularly myocardial ischaemia. In the time domain; the second-, third-, and the fourth-order cumulants have been used in the analysis. In the frequency domain; up to the tenth-order polyspectra have been exploited. This HOS-based analysis of normal and ischaemic electrocardiogram signals has led to the identification of certain key discriminant features for the two physiological states of the heart. These features are then fed to different backpropagation-based multiple layer perceptrons for classification. The second contribution is a proposed new methodology to discriminate patients with angina pectoris or with old myocardial infarction (MI) during the first 60 seconds of stress test (or in some cases using rest ECG). It is based on the pseudo-spectral Multiple Signal Classification (MUSIC) and has the potential of being highly sensitive diagnostic signal processing tool. The third contribution is the development of a novel higher-order statistics, high-resolution estimator for quadratically coupled frequencies based on subspace spectral estimation. Extensive studies of cumulants, bispectra and bicoherence-squared of normal and ischaemic ECG signals collected from MIT and ST-T European databases has enabled us to see key discriminant features in both the third- and fourth-order cumulant domains. In the frequency domain, the polyspectral study has been extended to the lOth-order poly spectra. By calculating one-dimensional polyspectrum slices using an algorithm developed by Zhou and Giannakis (1995) a considerable reduction in the CPU time has been achieved. Furthermore, Zhou’s algorithm has been further extended to estimate the polycoherency slices which are used to characterise non-linearities in normal and ischaemic ECG signals. An important finding in this thesis is the decrease of the order of non-linearity representing the electrocardiogram signals of ischaemic patients. This thesis also includes the results of a pilot study involving eighteen healthy subjects (MIT database) and confirmed that the ECG signal is non-Gaussian, cyclostationary and quasi periodic. Combined spectral and bispectral analysis of the signal revealed that there are unique harmonic characteristics for the P-wave, QRS complex and T-wave and other frequencies due to harmonic interactions. In this work three linear and one non-linear adaptive filtering/predictions techniques have been applied to noisy ECG signals and their respective performances appraised. It is shown that the Kalman filter gives the best mean-square error MSE error but its comparatively long execution time and problems arising from ill-conditioning of the state-error covariance matrix render it of limited use in ECG applications. It is also shown that the LMS-based quadratic and cubic Volterra filters are the most superior for the ECG signal prediction. For ECG classifications; three multi-layer perceptrons employing back-propagation and modified back-propagation algorithms, and using two sets from the higher-order most discriminant features as their inputs, have yielded fairly high classification rates

City Research Online

Machine Learning for Fluid Mechanics

Author: Brunton Steven
Koumoutsakos Petros
Noack Bernd
Publication venue: 'Annual Reviews'
Publication date: 04/01/2020
Field of study

The field of fluid mechanics is rapidly advancing, driven by unprecedented volumes of data from field measurements, experiments and large-scale simulations at multiple spatiotemporal scales. Machine learning offers a wealth of techniques to extract information from data that could be translated into knowledge about the underlying fluid mechanics. Moreover, machine learning algorithms can augment domain knowledge and automate tasks related to flow control and optimization. This article presents an overview of past history, current developments, and emerging opportunities of machine learning for fluid mechanics. It outlines fundamental machine learning methodologies and discusses their uses for understanding, modeling, optimizing, and controlling fluid flows. The strengths and limitations of these methods are addressed from the perspective of scientific inquiry that considers data as an inherent part of modeling, experimentation, and simulation. Machine learning provides a powerful information processing framework that can enrich, and possibly even transform, current lines of fluid mechanics research and industrial applications.Comment: To appear in the Annual Reviews of Fluid Mechanics, 202

arXiv.org e-Print Archive