Search CORE

566 research outputs found

Synthetic Speech Discrimination using Pitch Pattern Statistics Derived from Image Analysis

Author: Leon Phillip L. De
Stewart Bryan
Yamagishi Junichi
Publication venue
Publication date: 01/09/2012
Field of study

VOICE ACTIVITY DETECTION USING A SLIDING-WINDOW, MAXIMUM MARGIN CLUSTERING APPROACH

Author: Phillip De Leon
Salvador Sanchez
Publication venue
Publication date: 24/04/2020
Field of study

ABSTRACT Recently, an unsupervised, data clustering algorithm based on maximum margin, i.e. support vector machine (SVM) was reported. The maximum margin clustering (MMC) algorithm was later applied to the problem of voice activity detection, however, the application did not allow for real-time detection which is important in speech processing applications. In this paper, we propose a voice activity detector (VAD) based on a sliding window, MMC algorithm which allows for real-time detection. Our system requires a separate initialization stage which imposes an initial detection delay, however, once initialized the system can operate in real-time. Using TIMIT speech under several NOISEX-92 noise backgrounds at various SNRs, we show that our average speech and non-speech hit rates are better than state-of-the-art VADs

CiteSeerX

Normalized, HOS-Based, Blind Speech Separation Algorithms

Author: Phillip De Leon
Yunsheng Ma
Publication venue
Publication date: 06/03/2020
Field of study

Abstrac

CiteSeerX

Detection of voice conversion spoofing attacks using voiced speech

Author: De Leon Phillip L.
Roedig Utz
Sankar Arun
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 17/01/2023
Field of study

Speech consists of voiced and unvoiced segments that differ in their production process and exhibit different characteristics. In this paper, we investigate the spectral differences between bonafide and spoofed speech for voiced and unvoiced speech segments. We observe that the largest spectral differences lie in the 0–4 kHz band of voiced speech. Based on this observation, we propose a low-complexity, pre-processing stage which subsamples voiced frames prior to spoofing detection. The proposed pre-processing stage is applied to two systems, LFCC+GMM and IA/IF+KNN that differ entirely on the features and classifier used for spoofing detection. Our results show improvement with both systems in detection of the ASVspoof 2019 A17 voice conversion attack, which is recognized to have one of the highest spoofing capabilities. We also show improvements in the A18 and A19 voice conversion attacks for the IA/IF+KNN system. The resulting A17 EERs are lower than all reported systems where the A17 spoofing attack is the worst attack except the Capsule Network. Finally, we note that the proposed pre-processing stage reduces the speech date by more than 4× due to subsampling and using only voiced frames but at the same time maintaining similar pooled EER as that for the baseline systems, which may be advantageous for resource constrained spoofing detectors

Cork Open Research Archive

Introduction to the Issue on Spoofing and Countermeasures for Automatic Speaker Verification

Author: Evans Nicholas
Kinnunen Tomi H.
Leon Phillip De
Trancoso Isabel
Yamagishi Junichi
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2017
Field of study

Edinburgh Research Explorer

EURECOM Repository

Quantitative assessment of paravalvular regurgitation following transcatheter aortic valve replacement

Author: Gareth Crouch
Phillip J Tully
Jayme Bennetts
Ajay Sinhal
Craig Bradbrook
Amy L Penhall
Carmine G De Pasquale
Robert A Baker
Joseph B Selvanayagam
M Gotzmann
CR Smith
GP Ussia
M Vasa-Nicotera
JG Webb
AP Kappetein
EV Gelfand
S Globits
SG Myerson
L Sondergaard
E Altiok
GR Hartlage
HB Ribeiro
RS Gabriel
WA Zoghbi
WA Zoghbi
MA Sherif
P Genereux
MB Leon
M Gotzmann
K Takagi
SG Myerson
SK Kodali
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 10/02/2014
Field of study

Paravalvular aortic regurgitation (PAR) following transcatheter aortic valve implantation (TAVI) is well acknowledged. Despite improvements, echocardiographic measurement of PAR largely remains qualitative. Cardiovascular magnetic resonance (CMR) directly quantifies AR with accuracy and reproducibility. We compared CMR and transthoracic echocardiography (TTE) analysis of pre-operative and post-operative aortic regurgitation in patients undergoing both TAVI and surgical aortic valve replacement (AVR).Gareth Crouch, Phillip J Tully, Jayme Bennetts, Ajay Sinhal, Craig Bradbrook, Amy L Penhall, Carmine G De Pasquale, Robert A Baker, and Joseph B Selvanayaga

arXiv.org e-Print Archive

Crossref

EDP Sciences OAI-PMH repository (1.2.0)

Adelaide Research & Scholarship

Springer - Publisher Connector

Numérisation de Documents Anciens Mathématiques

ART

Hal-Diderot