Search CORE

2 research outputs found

Utterance Augmentation for Speaker Recognition

Author: Chen Zhengying
Chu Andrea
Fang Yeming
Feng Gang
Moreno Mengibar Pedro
Moreno Ignacio Lopez
Pelecanos Jason
Shi Jin
Wang Quan
Publication venue: Technical Disclosure Commons
Publication date: 18/05/2020
Field of study

The speaker recognition problem is to automatically recognize a person from their voice. The training of a speaker recognition model typically requires a very large training corpus, e.g., multiple voice samples from a very large number of individuals. In the diverse domains of application of speaker recognition, it is often impractical to obtain a training corpus of the requisite size. This disclosure describes techniques that augment utterances, e.g., by cutting, splitting, shuffling, etc., such that the need for collections of raw voice samples from individuals is substantially reduced. In effect, the original model works better on the augmented utterances on the target domain

Technical Disclosure Common