Silent-speech enhancement using body-conducted vocal-tract resonance signals

Hirahara,  Tatsuya; Nakajima,  Yoshitaka; Nakamura,  Keigo; Otani,  Makoto; Shikano,  Kiyohiro; Shimizu,  Shota; Toda,  Tomoki

Silent-speech enhancement using body-conducted vocal-tract resonance signals

Authors: Tatsuya Hirahara
Yoshitaka Nakajima
Keigo Nakamura
Makoto Otani
Kiyohiro Shikano
Shota Shimizu
Tomoki Toda
Publication date: 1 April 2010
Publisher: 'Elsevier BV'
Doi

Abstract

The physical characteristics of weak body-conducted vocal-tract resonance signals called non-audible murmur (NAM) and the acoustic characteristics of three sensors developed for detecting these signals have been investigated. NAM signals attenuate 50 dB at 1 kHz; this attenuation consists of 30-dB full-range attenuation due to air-to-body transmission loss and 10 dB/octave spectral decay due to a sound propagation loss within the body. These characteristics agree with the spectral characteristics of measured NAM signals. The sensors have a sensitivity of between 41 and 58 dB [V/Pa] at I kHz, and the mean signal-to-noise ratio of the detected signals was 15 dB. On the basis of these investigations, three types of silent-speech enhancement systems were developed: (1) simple, direct amplification of weak vocal-tract resonance signals using a wired urethane-elastomer NAM microphone, (2) simple, direct amplification using a wireless urethane-elastomer-duplex NAM microphone, and (3) transformation of the weak vocal-tract resonance signals sensed by a soft-silicone NAM microphone into whispered speech using statistical conversion. Field testing of the systems showed that they enable voice impaired people to communicate verbally using body-conducted vocal-tract resonance signals. Listening tests demonstrated that weak body-conducted vocal-tract resonance sounds can be transformed into intelligible whispered speech sounds. Using these systems, people with voice impairments can re-acquire speech communication with less effort. (C) 2009 Elsevier B.V. All rights reserved.ArticleSPEECH COMMUNICATION. 52(4):301-313 (2010)journal articl

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Shinshu University Institutional Repository

oai:soar-ir.repo.nii.ac.jp:000...

Last time updated on 19/12/2022