A Speaker De-Identification System Based on Sound Processing

Costandache, Mihai-Andrei; Gifu, Daniela; Iftene, Adrian

A Speaker De-Identification System Based on Sound Processing

Authors: Mihai-Andrei Costandache
Daniela Gifu
Adrian Iftene
Publication date: 9 August 2021
Publisher: AIS Electronic Library (AISeL)

Abstract

In the context of products employing speech recognition, where the speech signal is sent from the device to centralized servers that process data, or simply products that involve data storage on servers, privacy for audio data is an important issue, just as it is for other types of data. Ignoring privacy has consequences for both, speakers (information leaks) and server administrators (legal issues). In this paper, we propose a speaker de-identification solution based on sound processing, altering voice characteristics, along with an API. Our solution consisting of pitch shift and noise mix (the latter is an optional augmentation method) has a great speaker de-identification performance, without an important loss in terms of word intelligibility. It is worth mentioning that sometimes the recordings may not be easy to understand in the initial (i.e., not de-identified) form, due to the speaker’s pronunciation, talking speed, and other related factors

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

AIS Electronic Library (AISeL)

oai:aisel.aisnet.org:isd2014-1...

Last time updated on 16/11/2021