Search CORE

17 research outputs found

Overview of BTAS 2016 Speaker Anti-spoofing Competition

Author: Chen N.
de Assis Angeloni M.
Dinkel H.
Gonçalves A. R.
Korshunov Pavel
Marcel Sébastien
Mello A. G. Souza
Muckenhirn Hannah
Neto M. U.
Paul D.
Qian Y.
Saha G.
Sahidullah Md
Simões F. O.
Stuchi J. A.
Violato R. P. Velloso
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 19/07/2016
Field of study

This paper provides an overview of the Speaker Anti-spoofing Competition organized by Biometric group at Idiap Research Institute for the IEEE International Conference on Biometrics: Theory, Applications, and Systems (BTAS 2016). The competition used AVspoof database, which contains a comprehensive set of presentation attacks, including, (i) direct replay attacks when a genuine data is played back using a laptop and two phones (Samsung Galaxy S4 and iPhone 3G), (ii) synthesized speech replayed with a laptop, and (iii) speech created with a voice conversion algorithm, also replayed with a laptop. The paper states competition goals, describes the database and the evaluation protocol, discusses solutions for spoofing or presentation attack detection submitted by the participants, and presents the results of the evaluation

Infoscience - École polytechnique fédérale de Lausanne

Crossref

The I4U Mega Fusion and Collaboration for NIST Speaker Recognition Evaluation 2016

Author: Ajili M.
Alegre F.
Ambikairajah E.
Aronowitz H.
Bahmaninezhad F.
Bonastre J. F.
Bousquet P. M.
Busch C.
Chng E. S.
Delgado H.
Evans N.
Fauve B.
Halonen M.
Hansen J. H.L.
Hautamäki V.
Isadskiy S.
Jin R.
Kanervisto A.
Kheder W. B.
Kinnunen T.
Larcher A.
Le Lan G.
Lee K. A.
Li H.
Li Haizhou
Lim Z. H.
Lin W. W.
Liu Gang
Ma B.
Ma J.
Mak M. W.
Matrouf D.
Nautsch A.
Nguyen T. H.
Qian Q.
Rao W.
Rathgeb C.
Rouvier M.
Saeidi R.
Sahidullah M.
Sarkar A. K.
Sethu V.
Sizov A.
Sriskandaraja K.
Stafylakis T.
Sun H.
Tan Z. H.
Thomsen D. A.L.
Todisco M.
Tzimiropoulos G.
Vestman V.
Wang G.
Wang Tianzhou
Wang Z.
Xiao X.
Xu C.
Xu H.
Xue J.
Zhang C.
Zhao Q.
Zhao T.
Zhu S.
Publication venue: 'International Speech Communication Association'
Publication date: 01/01/2017
Field of study

The 2016 speaker recognition evaluation (SRE'16) is the latest edition in the series of benchmarking events conducted by the National Institute of Standards and Technology (NIST). I4U is a joint entry to SRE'16 as the result from the collaboration and active exchange of information among researchers from sixteen Institutes and Universities across 4 continents. The joint submission and several of its 32 sub-systems were among top-performing systems. A lot of efforts have been devoted to two major challenges, namely, unlabeled training data and dataset shift from Switchboard-Mixer to the new Call My Net dataset. This paper summarizes the lessons learned, presents our shared view from the sixteen research groups on recent advances, major paradigm shift, and common tool chain used in speaker recognition as we have witnessed in SRE'16. More importantly, we look into the intriguing question of fusing a large ensemble of sub-systems and the potential benefit of large-scale collaboration.Peer reviewe

Aaltodoc Publication Archive

VBN

Overview of BTAS 2016 Speaker Anti-spoofing Competition

Author: Korshunov Pavel
Marcel Sébastien
Muckenhirn Hannah
Gonçalves A. R.
Mello A. G. Souza
Violato R. P. Velloso
Simões F. O.
Neto M. U.
de Assis Angeloni M.
Stuchi J. A.
Dinkel H.
Chen N.
Qian Y.
Paul D.
Saha G.
Sahidullah Md
Publication venue
Publication date: 01/01/2016
Field of study

Infoscience - École polytechnique fédérale de Lausanne

Crossref

Publikationer från Linköpings universitet

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Archivio istituzionale della ricerca - Università di Padova

Analysis of Voice for Parkinson’s Disease Persons Using Dynamic Time Warping Technique

Author: M Sahidullah
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

An Improved Signal Subspace Algorithm for Speech Enhancement

Author: A. Samal
M. Dendrinos
M. Sahidullah
Y. Ephraim
Y. Ephraim
Y. Hu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Part 1: Digital ServicesInternational audienceMost of the algorithms for speech enhancement are designed to improve the speech listening comfort. However the frequency spectrum character is destroyed seriously after the speech enhancement. To achieve better speech listening comfort with less frequency spectral damages, we present an improved signal subspace algorithm for speech enhancement. Compared with the traditional signal space method, the improved algorithm can decrease the Mel-frequency Cepstral Coefficients (MFCC) distance, an evaluation measure which means less frequency spectral damages to the voice and keep the voices’ intelligence at the same time. Besides, the method can enlarge the distance of the easily confused voices, which means the improvement of the voice recognition ratio. Thus we get the purpose of the speech enhancement. The improved algorithm is used in a speech recognition program and has a good performance

Crossref

How to construct perfect and worse-than-coin-flip spoofing countermeasures:a word of warning on shortcut learning

Author: Gonzalez Hautamäki R. (Rosa)
Kinnunen T. (Tomi)
Sahidullah M. (Md)
Shim H.-j. (Hye-jin)
Publication venue: 'International Speech Communication Association'
Publication date: 01/01/2023
Field of study

Abstract Shortcut learning, or ‘Clever Hans effect’ refers to situations where a learning agent (e.g., deep neural networks) learns spurious correlations present in data, resulting in biased models. We focus on finding shortcuts in deep learning based spoofing countermeasures (CMs) that predict whether a given utterance is spoofed or not. While prior work has addressed specific data artifacts, such as silence, no general normative framework has been explored for analyzing shortcut learning in CMs. In this study, we propose a generic approach to identifying shortcuts by introducing systematic interventions on the training and test sides, including the boundary cases of ‘near-perfect’ and ‘worse than coin flip’ (label flip). By using three different models, ranging from classic to state-of-the-art, we demonstrate the presence of shortcut learning in five simulated conditions. We also analyze the results using a regression model to understand how biases affect the class-conditional score statistics

University of Oulu Repository - Jultika

Original Articles Predictors of Mortality in Ventilated Neonates in Intensive Care Unit

Author: Al Mamun
M Monir Hossain
Mahfuza Shirin
Md. Nurul Akhtar
Md. Sahidullah
Mohammad Abdullah
Publication venue
Publication date
Field of study

Background: A large number of neonates in intensive care unit require mechanical ventilation due to various conditions and have a high mortality. To reduce the high mortality in this group of neonates, identification of risk factors is important. Objective: This study was undertaken to find out the predictors of mortality in ventilated neonates in the Intensive Care Unit. Methods: This study was carried out in the Intensive Care Unit of Dhaka Shish

CiteSeerX

Plasma alpha-2-macroglobulin level in moderate to severe psoriasis

Author: Khondoker Anwarul Islam
Md. Sahidullah Sikder
Mohammed Saiful Islam Bhuiyan
S. M. Manzurul Haque
Sheikh Md. Khorshed Alam
Publication venue: 'Bangladesh Journals Online (JOL)'
Publication date: 01/11/2017
Field of study

Psoriasis is a T-cell mediated chronic inflammatory diseases where pro-inflammatory mediators are involved in its pathogenesis. Alpha-2-macroglobulin (α-2M) is a panproteinase inhibitor having unique clearing role of different cytokines. This study was conducted on 30 patients with moderate to severe psoriasis to see the plasma level of α-2M and was compared with the normal healthy controls. Patients who were already selected for systemic treatment (methotrexate) and consented for routine blood test for monitoring at baseline and 12 weeks after treatment were enrolled along with 10 healthy controls. The venous blood (5 mL) was collected and the plasma alfa-2 macroglobulin was estimated using the enzyme-linked immunosorbent assay. The mean plasma α-2M level was 3.0 ± 0.4 g/L among the normal healthy persons, and 2.8 ± 0.7 g/L among the untreated patients of psoriasis (p>0.05). Its level among the patients with psoriasis after systemic antipsoriatic drugs was 2.8 ± 0.6 g/L which was not significantly different from the baseline level (p>0.05). The study shows that the plasma α-2M level in psoriasis is not different comparing with normal healthy persons

Crossref

Directory of Open Access Journals

Bangabandhu Sheikh Mujib Medical University Journal

Rethinking environmental sound classification using convolutional neural networks: optimized parameter tuning of single feature extraction

Author: H Ali
J Salamon
M Sahidullah
RN Shepard
S Abdoli
S Chu
SH Jung
V Boddapati
Y Su
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Towards creating a reference based self-learning model for improving human machine interaction

Author: G Plouffe
H Li
HP Gupta
J Aron
J Ding
KP Murphy
L-A Perez-Gaspar
M Chen
M Faundez-Zanuy
M Sahidullah
N Klboz
R Yu
SU Park
Z Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref