604 research outputs found

    Replay detection in voice biometrics: an investigation of adaptive and non-adaptive front-ends

    Full text link
    Among various physiological and behavioural traits, speech has gained popularity as an effective mode of biometric authentication. Even though they are gaining popularity, automatic speaker verification systems are vulnerable to malicious attacks, known as spoofing attacks. Among various types of spoofing attacks, replay attack poses the biggest threat due to its simplicity and effectiveness. This thesis investigates the importance of 1) improving front-end feature extraction via novel feature extraction techniques and 2) enhancing spectral components via adaptive front-end frameworks to improve replay attack detection. This thesis initially focuses on AM-FM modelling techniques and their use in replay attack detection. A novel method to extract the sub-band frequency modulation (FM) component using the spectral centroid of a signal is proposed, and its use as a potential acoustic feature is also discussed. Frequency Domain Linear Prediction (FDLP) is explored as a method to obtain the temporal envelope of a speech signal. The temporal envelope carries amplitude modulation (AM) information of speech resonances. Several features are extracted from the temporal envelope and the FDLP residual signal. These features are then evaluated for replay attack detection and shown to have significant capability in discriminating genuine and spoofed signals. Fusion of AM and FM-based features has shown that AM and FM carry complementary information that helps distinguish replayed signals from genuine ones. The importance of frequency band allocation when creating filter banks is studied as well to further advance the understanding of front-ends for replay attack detection. Mechanisms inspired by the human auditory system that makes the human ear an excellent spectrum analyser have been investigated and integrated into front-ends. Spatial differentiation, a mechanism that provides additional sharpening to auditory filters is one of them that is used in this work to improve the selectivity of the sub-band decomposition filters. Two features are extracted using the improved filter bank front-end: spectral envelope centroid magnitude (SECM) and spectral envelope centroid frequency (SECF). These are used to establish the positive effect of spatial differentiation on discriminating spoofed signals. Level-dependent filter tuning, which allows the ear to handle a large dynamic range, is integrated into the filter bank to further improve the front-end. This mechanism converts the filter bank into an adaptive one where the selectivity of the filters is varied based on the input signal energy. Experimental results show that this leads to improved spoofing detection performance. Finally, deep neural network (DNN) mechanisms are integrated into sub-band feature extraction to develop an adaptive front-end that adjusts its characteristics based on the sub-band signals. A DNN-based controller that takes sub-band FM components as input, is developed to adaptively control the selectivity and sensitivity of a parallel filter bank to enhance the artifacts that differentiate a replayed signal from a genuine signal. This work illustrates gradient-based optimization of a DNN-based controller using the feedback from a spoofing detection back-end classifier, thus training it to reduce spoofing detection error. The proposed framework has displayed a superior ability in identifying high-quality replayed signals compared to conventional non-adaptive frameworks. All techniques proposed in this thesis have been evaluated on well-established databases on replay attack detection and compared with state-of-the-art baseline systems

    Spoofing Detection in Voice Biometrics: Cochlear Modelling and Perceptually Motivated Features

    Full text link
    The automatic speaker verification (ASV) system is one of the most widely adopted biometric technology. However, ASV is vulnerable to spoofing attacks that can significantly affect its reliability. Among the different variants of spoofing attacks, replay attacks pose a major threat as they do not require any expert knowledge to implement and are difficult to detect. The primary focus of this thesis is on understanding and developing biologically inspired models and techniques to detect replay attacks. This thesis develops a novel framework for implementing an active cochlear filter model as a frontend spectral analyser for spoofing attack detection to leverage the remarkable sensitivity and selectivity of the mammalian auditory system over a broad range of intensities and frequencies. In particular, the developed model aims to mimic the active mechanism in the cochlea, enabling sharp frequency tuning and level-dependent compression, which amplifies and tune to low energy signal to make a broad dynamic range of signals audible. Experimental evaluations of the developed models in the context of replay detection systems exhibit a significant performance improvement, highlighting the potential benefits of the use of biologically inspired front ends. In addition, since replay detection relies on the discerning channel characteristics and the effect of the acoustic environment, acoustic cues essential for speech perception such as amplitude- and frequency-modulation (AM, FM) features are also investigated. Finally, to capture discriminative cues present in the temporal domain, the temporal masking psychoacoustic phenomenon in auditory processing is exploited, and the usefulness of the masking pattern is investigated. This led to a novel feature parameterisation which helps improve replay attack detection

    Physical Layer Defenses Against Primary User Emulation Attacks

    Get PDF
    Cognitive Radio (CR) is a promising technology that works by detecting unused parts of the spectrum and automatically reconfiguring the communication system\u27s parameters in order to operate in the available communication channels while minimizing interference. CR enables efficient use of the Radio Frequency (RF) spectrum by generating waveforms that can coexist with existing users in licensed spectrum bands. Spectrum sensing is one of the most important components of CR systems because it provides awareness of its operating environment, as well as detecting the presence of primary (licensed) users of the spectrum

    Machine Learning Mitigants for Speech Based Cyber Risk

    Get PDF
    Statistical analysis of speech is an emerging area of machine learning. In this paper, we tackle the biometric challenge of Automatic Speaker Verification (ASV) of differentiating between samples generated by two distinct populations of utterances, those of an authentic human voice and those generated by a synthetic one. Solving such an issue through a statistical perspective foresees the definition of a decision rule function and a learning procedure to identify the optimal classifier. Classical state-of-the-art countermeasures rely on strong assumptions such as stationarity or local-stationarity of speech that may be atypical to encounter in practice. We explore in this regard a robust non-linear and non-stationary signal decomposition method known as the Empirical Mode Decomposition combined with the Mel-Frequency Cepstral Coefficients in a novel fashion with a refined classifier technique known as multi-kernel Support Vector machine. We undertake significant real data case studies covering multiple ASV systems using different datasets, including the ASVSpoof 2019 challenge database. The obtained results overwhelmingly demonstrate the significance of our feature extraction and classifier approach versus existing conventional methods in reducing the threat of cyber-attack perpetrated by synthetic voice replication seeking unauthorised access

    Point process modeling and estimation: advances in the analysis of dynamic neural spiking data

    Full text link
    A common interest of scientists in many fields is to understand the relationship between the dynamics of a physical system and the occurrences of discrete events within such physical system. Seismologists study the connection between mechanical vibrations of the Earth and the occurrences of earthquakes so that future earthquakes can be better predicted. Astrophysicists study the association between the oscillating energy of celestial regions and the emission of photons to learn the Universe's various objects and their interactions. Neuroscientists study the link between behavior and the millisecond-timescale spike patterns of neurons to understand higher brain functions. Such relationships can often be formulated within the framework of state-space models with point process observations. The basic idea is that the dynamics of the physical systems are driven by the dynamics of some stochastic state variables and the discrete events we observe in an interval are noisy observations with distributions determined by the state variables. This thesis proposes several new methodological developments that advance the framework of state-space models with point process observations at the intersection of statistics and neuroscience. In particular, we develop new methods 1) to characterize the rhythmic spiking activity using history-dependent structure, 2) to model population spike activity using marked point process models, 3) to allow for real-time decision making, and 4) to take into account the need for dimensionality reduction for high-dimensional state and observation processes. We applied these methods to a novel problem of tracking rhythmic dynamics in the spiking of neurons in the subthalamic nucleus of Parkinson's patients with the goal of optimizing placement of deep brain stimulation electrodes. We developed a decoding algorithm that can make decision in real-time (for example, to stimulate the neurons or not) based on various sources of information present in population spiking data. Lastly, we proposed a general three-step paradigm that allows us to relate behavioral outcomes of various tasks to simultaneously recorded neural activity across multiple brain areas, which is a step towards closed-loop therapies for psychological diseases using real-time neural stimulation. These methods are suitable for real-time implementation for content-based feedback experiments

    Cognitive and Neural Map Representations in Schizophrenia

    Get PDF
    An ability to build structured cognitive maps of the world may lie at the heart of understanding cognitive features of schizophrenia. In rodents, cognitive map representations are supported by sequential hippocampal place cell reactivations during rest (offline), known as replay. These events occur in the context of local high frequency ripple oscillations, and whole-brain default mode network (DMN) activation. Genetic mouse models of schizophrenia also report replay and ripple abnormalities. Here, I investigate the behavioural and neural signatures of structured internal representations in people with a diagnosis of schizophrenia (PScz, n = 29) and matched control participants (n = 28) using magnetoencephalography (MEG). Participants were asked to infer correct sequential relationships between task pictures by applying a pre-learned task template to visual experiences containing these pictures. In Chapter 3 I show that, during a post-task rest session, controls exhibited fast spontaneous neural reactivation of task state representations that replayed inferred relationships. Replay was coincident with increased ripple power in hippocampus, which may be related to NMDAR availability (Chapter 4). PScz showed both reduced replay and augmented ripple power, convergent with genetic mouse models. These abnormalities were linked to impairments in behavioural acquisition of task structure, and to its subsequent representation in visually evoked neural responses. In Chapter 5 I explore the temporal coupling between replay onsets and DMN activation. I show an impairment in this association in PScz, which related to subsequent mnemonic maintenance of learned task structure, complementing previous reports of DMN abnormalities in the condition. Finally, in Chapter 6, using a separate verbal fluency task, I show that PScz exhibit evidence of reduced use of (semantic) associative information when sampling concepts from memory. Together, my results provide support for a hypothesis that schizophrenia is associated with abnormalities in neural and behavioural correlates of cognitive map representation

    Security and Privacy for Modern Wireless Communication Systems

    Get PDF
    The aim of this reprint focuses on the latest protocol research, software/hardware development and implementation, and system architecture design in addressing emerging security and privacy issues for modern wireless communication networks. Relevant topics include, but are not limited to, the following: deep-learning-based security and privacy design; covert communications; information-theoretical foundations for advanced security and privacy techniques; lightweight cryptography for power constrained networks; physical layer key generation; prototypes and testbeds for security and privacy solutions; encryption and decryption algorithm for low-latency constrained networks; security protocols for modern wireless communication networks; network intrusion detection; physical layer design with security consideration; anonymity in data transmission; vulnerabilities in security and privacy in modern wireless communication networks; challenges of security and privacy in node–edge–cloud computation; security and privacy design for low-power wide-area IoT networks; security and privacy design for vehicle networks; security and privacy design for underwater communications networks

    Condition monitoring of pharmaceutical powder compression during tabletting using acoustic emission

    Get PDF
    This thesis was submitted for the degree of Doctor of Philosophy and awarded by Brunel University.This research project aimed to develop a condition monitoring system for the final production quality of pharmaceutical tablets and detection capping and lamination during powder compression process using the acoustic emission (AE) method. Pharmaceutical tablet manufacturers obliged by regulatory bodies to test the tablet's physical properties such as hardness, dissolution and disintegration before the tablets are released to the market. Most of the existing methods and techniques for testing and monitoring these tablet's properties are performed at the tablet post-compression stage. Furthermore, these tests are destructive in nature. Early experimental investigations revealed that the AE energy that is generated during powder compression is directly proportional to the peak force that is required to crush the tablet, i. e. crushing strength. Further laboratory and industrial experimental investigation have been conducted to study the relationship between the AE signals and the compression conditions. Traditional AE signal features such as energy, count, peak amplitude, average signal level, event duration and rise time were recorded. AE data analysis with the aid of advanced classification algorithm, fuzzy C-mean clustering showed that the AE energy is a very useful parameter in tablet condition monitoring. It was found that the AE energy that is generated during powder compression is sensitive to the process and is directly proportional to the compression speed, particle size, homogeneity of mixture and the amount of material present. Also this AE signal is dependent upon the type of material used as the tablet filler. Acoustic emission has been shown to be a useful technique for characterising some of the complex physical changes which occur during tabletting. Capping and lamination are serious problems that are encountered during tabletting. A capped or laminated tablet is one which no longer retains its mechanical integrity and exhibit low strength characteristics. Capping and lamination can be caused by a number of factors such as excessive pressure, insufficient binder in the granules and poor material flowabilities. However, capping and lamination can also occur randomly and they are also dependent upon the material used in tabletting. It was possible to identify a capped or laminated tablet by monitoring the AE energy level during continuous on-line monitoring of tabletting. Capped tablets indicated by low level of AE energy. The proposed condition monitoring system aimed to set the AE energy threshold that could discriminate between capped and non-capped tablets. This was based upon statistical distributions of the AE energy values for both the capped and non-capped tablets. The system aims to minimise the rate of false alarms (indication of capping when in reality capping has not occurred) and the rate of missed detection (an indication of non capping, when in reality capping has occurred). A novel approach that employs both the AE method and the receiver operating characteristic (ROC) curve was proposed for the on-line detection of capping and lamination during tabletting. The proposed system employs AE energy as the discriminating parameter to detect between capped and non-capped tablets. The ROC curve was constructed from the area under the two distributions of both capped and non-capped tablet. This curve shows a trade-off between the probabilities of true detection rate and false alarm rate for capped and non-capped tablet. A two-graph receiver operating characteristic (ROC) curve was presented as a modification of the original ROC curve to enable an operator to directly select the desired energy threshold for tablet monitoring. This plot shows the ROC co-ordinate as a function of the threshold value over the entire threshold (AE energy) range for all test outcomes. An alternative way of deciding a threshold based on the slope of the ROC curve was also developed. The slope of the ROC curve represents the optimal operating point on the curve. It depends upon the penalties cost of capping and the prevalence of capping. Sets of guidelines have been outlined for decision making i.e. threshold setting. These guidelines take into account both the prevalence of capping in manufacturing and the cost associated with various outcomes of tablet formation. The proposed condition monitoring system also relates AE monitoring to non-AE measurement as it enable an operator predicting tablet hardness and disintegration form the AE energy, a relationship which was established in this research

    Contributions to adaptive equalization and timing recovery for optical storage systems

    Get PDF
    no abstrac
    • …
    corecore