Search CORE

144 research outputs found

ANALYSING REPLAY SPOOFING COUNTERMEASURE PERFORMANCE UNDER VARIED CONDITIONS

Author: Benetos E
Chettri B
Sturm BL
Publication venue
Publication date: 06/08/2018
Field of study

Deep Generative Variational Autoencoding for Replay Spoof Detection in Automatic Speaker Verification

Author: Benetos E
Chettri B
Kinnunen T
Publication venue: 'Elsevier BV'
Publication date: 02/03/2020
Field of study

Automatic speaker verification (ASV) systems are highly vulnerable to presentation attacks, also called spoofing attacks. Replay is among the simplest attacks to mount - yet difficult to detect reliably. The generalization failure of spoofing countermeasures (CMs) has driven the community to study various alternative deep learning CMs. The majority of them are supervised approaches that learn a human-spoof discriminator. In this paper, we advocate a different, deep generative approach that leverages from powerful unsupervised manifold learning in classification. The potential benefits include the possibility to sample new data, and to obtain insights to the latent features of genuine and spoofed speech. To this end, we propose to use variational autoencoders (VAEs) as an alternative backend for replay attack detection, via three alternative models that differ in their class-conditioning. The first one, similar to the use of Gaussian mixture models (GMMs) in spoof detection, is to train independently two VAEs - one for each class. The second one is to train a single conditional model (C-VAE) by injecting a one-hot class label vector to the encoder and decoder networks. Our final proposal integrates an auxiliary classifier to guide the learning of the latent space. Our experimental results using constant-Q cepstral coefficient (CQCC) features on the ASVspoof 2017 and 2019 physical access subtask datasets indicate that the C-VAE offers substantial improvement in comparison to training two separate VAEs for each class. On the 2019 dataset, the C-VAE outperforms the VAE and the baseline GMM by an absolute 9-10% in both equal error rate (EER) and tandem detection cost function (t-DCF) metrics. Finally, we propose VAE residuals --- the absolute difference of the original input and the reconstruction as features for spoofing detection. The proposed frontend approach augmented with a convolutional neural network classifier demonstrated substantial improvement over the VAE backend use case

arXiv.org e-Print Archive

Queen Mary Research Online

Embedded Based Smart ICU-For Intelligent Patient Monitoring

Author: Anisha Chettri, Mr. S. Raja, Prof. B. Vinodh Kumar
Publication venue: 'Auricle Technologies, Pvt., Ltd.'
Publication date: 31/05/2018
Field of study

Smart ICUs are networks of audio-visual communication and computer systems that link critical care doctors and nurses (intensivists) to intensive care units (ICUs) in other, remote hospitals. The intensivists in the “command center” can communicate by voice with the remote ICU personnel and can receive video communication and clinical data about the patients. Direct patient care is provided by the doctors and nurses in the remote ICU who do not have to be intensivists themselves. In recent years there has been an increase in the number of patients needing ICU care without a corresponding increase in the supply of intensivists. Smart ICUs can be a valuable resource for hospitals faced with the need to expand capacity and improve care for a growing elderly population. Evidence from some early-adopter hospitals indicates that it can leverage management of patient care by intensivists, reduce mortality rates, and reduce LOS. However, positive outcomes appear to depend on the organizational environment into which the Smart ICU is introduced. The dramatic improvements in mortality and LOS reported by some early-adopter hospitals have not been matched in most. The limited research available suggests that the best outcomes may occur in ICUs that: Can make organizational arrangements to support the management of patient care by intensivists using Smart ICU; Have little or no intensivist staff available to them in the absence of Smart ICU; Have relatively high severity-adjusted mortality and LOS rates; Are located in remote or rural areas where safe and efficient transfer of patients to regional centers for advanced critical care presents difficulties. Smart ICU connects a central command center staffed by intensivists with patients in distant ICUs. Continuous, real-time audio, video, and electronic reports of vital signs connect the command center to the patients’ bedsides. Computer-managed decision support systems track each patient’s status and give alerts when negative trends are detected and when changes in treatment patterns are scheduled. The patient data include physiological status (e.g., ECG and blood oxygenation), treatment (e.g., the infusion rate for a specific medicine or the settings on a respirator), and medical records.

International Journal on Recent and Innovation Trends in Computing and Communication

Subband modeling for spoofing detection in automatic speaker verification

Author: Benetos E
Chettri B
Kinnunen T
Odyssey 2020: The Speaker and Language Recognition Workshop
Publication venue: 'The International Fiscal Association of Korea'
Publication date: 01/11/2020
Field of study

Spectrograms - time-frequency representations of audio signals - have found widespread use in neural network-based spoofing detection. While deep models are trained on the fullband spectrum of the signal, we argue that not all frequency bands are useful for these tasks. In this paper, we systematically investigate the impact of different subbands and their importance on replay spoofing detection on two benchmark datasets: ASVspoof 2017 v2.0 and ASVspoof 2019 PA. We propose a joint subband modelling framework that employs n different sub-networks to learn subband specific features. These are later combined and passed to a classifier and the whole network weights are updated during training. Our findings on the ASVspoof 2017 dataset suggest that the most discriminative information appears to be in the first and the last 1 kHz frequency bands, and the joint model trained on these two subbands shows the best performance outperforming the baselines by a large margin. However, these findings do not generalise on the ASVspoof 2019 PA dataset. This suggests that the datasets available for training these models do not reflect real world replay conditions suggesting a need for careful design of datasets for training replay spoofing countermeasures

Queen Mary Research Online

A novel challenge method with aeromonas salmonicida in rainbow trout for evaluation of furunculosis vaccines

Author: Buchmann K.
Chettri J. K.
Dalsgaard Inger
Kania Per
Krossøy B.
Marana M. H.
Skov J.
Publication venue: European Association of Fish Pathologists (EAFP)
Publication date: 01/01/2015
Field of study

Online Research Database In Technology

Aeromonas salmonicida infection in vaccinated rainbow trout: influence of challenge methods and environmental factors on challenge success

Author: Buchmann K.
Chettri J. K.
Dalsgaard Inger
Jaafar R. M.
Kania P. W.
Krossøy B.
Skov J.
Publication venue: European Association of Fish Pathologists (EAFP)
Publication date: 01/01/2015
Field of study

Online Research Database In Technology

Analysing the predictions of a CNN-based replay spoofing detection system

Author: 2018 IEEE Workshop on Spoken Language Technology
BENETOS E
CHETTRI B
MISHRA S
STURM B
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 27/09/2018
Field of study

Playing recorded speech samples of an enrolled speaker - "replay attack" - is a simple approach to bypass an automatic speaker verification (ASV) system. The vulnerability of ASV systems to such attacks has been acknowledged and studied, but there has been no research into what spoofing detection systems are actually learning to discriminate. In this paper, we analyse the local behaviour of a replay spoofing detection system based on convolutional neural networks (CNNs) adapted from a state-of-the-art CNN (LCNN-FFT) submitted at the ASVspoof 2017 challenge. We generate temporal and spectral explanations for predictions of the model using the SLIME algorithm. Our findings suggest that in most instances of spoofing the model is using information in the first 400 milliseconds of each audio instance to make the class prediction. Knowledge of the characteristics that spoofing detection systems are exploiting can help build less vulnerable ASV systems, other spoofing detection systems, as well as better evaluation databases

Queen Mary Research Online

Recommended from our members

Intracellular Photophysics of an Osmium Complex bearing an Oligothiophene Extended Ligand

Author: Breckmann Jannik
Bäuerle Peter
Cameron Colin G.
Chettri Avinash
Cole Houston D.
Dietzek Benjamin
Eggeling Christian
Lagerholm Christoffer B.
McFarland Sherri A.
Nauroozi Djawed
Rau Sven
Reglinski Katharina
Roque John A. III
Schmid Sylvia
Schneider Kilian R.A.
Stumper Anne
Publication venue: Weinheim : Wiley-VCH
Publication date: 01/01/2020
Field of study

This contribution describes the excited-state properties of an Osmium-complex when taken up into human cells. The complex 1 [Os(bpy)2(IP-4T)](PF6)2 with bpy=2,2′-bipyridine and IP-4T=2-{5′-[3′,4′-diethyl-(2,2′-bithien-5-yl)]-3,4-diethyl-2,2′-bithiophene}imidazo[4,5-f][1,10]phenanthroline) can be discussed as a candidate for photodynamic therapy in the biological red/NIR window. The complex is taken up by MCF7 cells and localizes rather homogeneously within in the cytoplasm. To detail the sub-ns photophysics of 1, comparative transient absorption measurements were carried out in different solvents to derive a model of the photoinduced processes. Key to rationalize the excited-state relaxation is a long-lived 3ILCT state associated with the oligothiophene chain. This model was then tested with the complex internalized into MCF7 cells, since the intracellular environment has long been suspected to take big influence on the excited state properties. In our study of 1 in cells, we were able to show that, though the overall model remained the same, the excited-state dynamics are affected strongly by the intracellular environment. Our study represents the first in depth correlation towards ex-vivo and in vivo ultrafast spectroscopy for a possible photodrug. © 2020 The Authors. Published by Wiley-VCH Gmb

Repositorium für Naturwissenschaften und Technik (TIB Hannover)

Digitale Bibliothek Thüringen

Ensemble Models for Spoofing Detection in Automatic Speaker Verification

Author: 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019)
Benetos E
Chettri B
Martinez Ramirez M
Morfi V
Stoller D
Sturm B
Publication venue: International Speech Communication Association (ISCA)
Publication date: 15/09/2019
Field of study

Detecting spoofing attempts of automatic speaker verification (ASV) systems is challenging, especially when using only one modelling approach. For robustness, we use both deep neural networks and traditional machine learning models and combine them as ensemble models through logistic regression. They are trained to detect logical access (LA) and physical access (PA) attacks on the dataset released as part of the ASV Spoofing and Countermeasures Challenge 2019. We propose dataset partitions that ensure different attack types are present during training and validation to improve system robustness. Our ensemble model outperforms all our single models and the baselines from the challenge for both attack types. We investigate why some models on the PA dataset strongly outperform others and find that spoofed recordings in the dataset tend to have longer silences at the end than genuine ones. By removing them, the PA task becomes much more challenging, with the tandem detection cost function (t-DCF) of our best single model rising from 0.1672 to 0.5018 and equal error rate (EER) increasing from 5.98% to 19.8% on the development set

Queen Mary Research Online