The SJTU System for Short-duration Speaker Verification Challenge 2021

Chen, Zhengyang; Han, Bing; Qian, Yanmin; Zhou, Zhikai

The SJTU System for Short-duration Speaker Verification Challenge 2021

Authors: Zhengyang Chen
Bing Han
Yanmin Qian
Zhikai Zhou
Publication date: 3 August 2022
Publisher

Abstract

This paper presents the SJTU system for both text-dependent and text-independent tasks in short-duration speaker verification (SdSV) challenge 2021. In this challenge, we explored different strong embedding extractors to extract robust speaker embedding. For text-independent task, language-dependent adaptive snorm is explored to improve the system performance under the cross-lingual verification condition. For text-dependent task, we mainly focus on the in-domain fine-tuning strategies based on the model pre-trained on large-scale out-of-domain data. In order to improve the distinction between different speakers uttering the same phrase, we proposed several novel phrase-aware fine-tuning strategies and phrase-aware neural PLDA. With such strategies, the system performance is further improved. Finally, we fused the scores of different systems, and our fusion systems achieved 0.0473 in Task1 (rank 3) and 0.0581 in Task2 (rank 8) on the primary evaluation metric.Comment: Published by Interspeech 202

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2208.01933

Last time updated on 06/10/2022