unknown

Modified-Prior PLDA Based Speaker Recognition System

Abstract

为减弱注册语音与测试语音时长不一致对说话人识别性能的负面影响,提出一个概率修正PldA建模方法.根据语音时长自适应改变传统PldA模型中I-VECTOr的概率分布函数,提高PldA对每个说话人每段语音的时长表征能力,以增强说话人类别的区分度.为验证基于概率修正PldA模型的有效性,进行了nIST SrE10 COrECOrE测试集在3种不同时长的评测实验,以及nIST 2014 I-VECTOr MACHInE lEArnIng CHAllEngE测试任务.结果表明,相较于传统的PldA训练模型,通过语音时长的约束提高了说话人识别性能.To reduce the negative impact on the performance of speaker recognition systems due to the duration mismatch between enrollment utterance and test utterance,a modified-prior PLDA method is proposed.The probability distribution function of i-vector was modified by incorporating the covariance matrix with duration of each utterance of each speaker during the PLDA training,which further improved the discriminant capability of speaker classification.To evaluate the robustness of the proposed modified-prior PLDA method,extensive experiments were performed on NIST SRE10 core-core task(female part)in duration mismatch conditions and NIST 2014 i-vector machine learning challenge.Experimental results demonstrated that the duration-based modified-prior PLDA method achieved better performance compared with the traditional PLDA.国家自然科学基金资助项目(61105026

    Similar works